[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


other/don't like Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106839455

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: deHC_cHD_00026_.png (1.43 MB, 1728x1075)
1.43 MB
1.43 MB PNG
>mfw
>>
I left after chroma radiance was announced, what did I miss?
>>
File: Debonked.jpg (141 KB, 720x1129)
141 KB
141 KB JPG
>>106844216
>>
>>106844219
Sora2 leak
>>
>>106844216
Debo always gens people standing up, never lying down or sitting. Makes me wonder if Debo is wheelchair bound and that is why he spends too much time genning images on 4chan and treats his thread like it is his personal space or his home.
>>
File: 45795423808576.jpg (722 KB, 3405x1880)
722 KB
722 KB JPG
>https://github.com/dvlab-research/DreamOmni2
>>
File: ComfyUI_naqfs_00010.png (1.9 MB, 968x1320)
1.9 MB
1.9 MB PNG
>>
File: file.png (1.23 MB, 1415x709)
1.23 MB
1.23 MB PNG
>>106844296
what is this bullshit lmao
>>
>>106844310
Mistakes were made.
>>
File: 00338-2142890545.png (2.51 MB, 1248x1848)
2.51 MB
2.51 MB PNG
>>
>>
>>106844357
i want to rape this little slut and fill her holes with my watery cum
>>
>>106844363
>>
>>106844310
>anon is blind
>>
>>106844381
mmm... nyo~
the bitch on the left is supposed to do a 1000 mile stare like the one on the right
see above example
>>
if the anon who made
>>106842691
is still here, any chance you could run it again but without the chicken ^-^
>>
File: radiance.png (2.38 MB, 848x1488)
2.38 MB
2.38 MB PNG
>>106844219
lodestone might have figured out a good "ramtorch" way to train and/or run inference on models with more system RAM than vram

pony v7 auraflow version is disappointing (as many expected) and pony team decides to train qwen

wan 2.5 is api only [for now? some people were pessimistic about that]

qwen image edit got an updated version, it is good. as are the updated lightning lora to speed it up

huyuanimage3.0 won some benchmarks but the aesthetics suck and it's too big for now
>>
File: 00040-3243303301.png (3.72 MB, 1536x2048)
3.72 MB
3.72 MB PNG
>"hey chud! i know how to save the west, come to my house tonight"
well chud? you're answer?
>>
File: 1755453268937375.png (288 KB, 612x586)
288 KB
288 KB PNG
>>106844463
your*
>>
File: 1746814384775127.png (2.83 MB, 1040x1560)
2.83 MB
2.83 MB PNG
>>
>comfyui
yuck
>>
File: radiance.png (2.83 MB, 848x1488)
2.83 MB
2.83 MB PNG
>>106844456
no problem. i forgot: current neta yume lumina is looking pretty good and chroma radiance can do attractive 2d/3dcg 1girl pretty well

not yet replacing illustrious/noob yet but either should be useful to some by now
>>
File: 00046-1634273881.png (3.28 MB, 1536x2048)
3.28 MB
3.28 MB PNG
>>106844463
>"hey chud! according to my calculations, we will need to make at least 10 babies in order to save the west"
cute chudetes belong to the cutest chuds
>>
File: 1749904675518860.png (2.35 MB, 1040x1560)
2.35 MB
2.35 MB PNG
>>
>>106844502
Is chroma radiance more vram intensive because it has no vae? Or slower?
>>
my ai porn account has 2000 twitter followers, how can i convert this into money and leave the rat race?
>>
>>106844508
Scales really badly as you increase resolution.
>>
>>106844528
why are you asking fellow rats
>>
File: 00051-659536982.png (3.34 MB, 1536x2048)
3.34 MB
3.34 MB PNG
"take a peek chud, we must be better chuds and save the west"
>>
>>106844508
no vae. it is doing stuff in pixel space but with speedup tricks (adopted from the pixnerd research) which however still are slower than many vae.
>>
>>106844528
Twitter has been incredibly weak for me. Do you have any special tags?
>>
File: 1754239785900472.png (2.8 MB, 1040x1560)
2.8 MB
2.8 MB PNG
>>
>>106844592
no i just post videos and upload to rule34 ai
>>
>>106844528
95% of your followers are bots
>>
Debo tried trolling and got ignored lmao. Time passes, old clown.
>>
File: 00062-600832562.png (3.28 MB, 1536x2048)
3.28 MB
3.28 MB PNG
>>106844543
>>
File: 1730604188106214.png (2.33 MB, 1040x1560)
2.33 MB
2.33 MB PNG
>>
File: 1745002602637322.png (2.66 MB, 1040x1560)
2.66 MB
2.66 MB PNG
>>
File: 00004-3401928584.png (2.57 MB, 1152x2016)
2.57 MB
2.57 MB PNG
>>
>>106844754
what brainrot is this?
>>
>>106844763
personally i dont like anime (even if western) with the plastic 3d render look, but to each its own.
>>
>>
>>106844528
>He didn't bought Nvidia stocks when this whole AI boom took off.

ngmi. Stop wasting your time and effort trying to get gooner bucks. Just put money into stocks. If you're a pussy, just buy the SP500.
>>
File: 00017-4156378636.png (2.6 MB, 1248x1824)
2.6 MB
2.6 MB PNG
>>106844763
3dcg
>>
File: n.mp4 (1.55 MB, 1120x576)
1.55 MB
1.55 MB MP4
>>106844400
without the chicken the car clearly becomes less suitable for drifting, it's too heavy in the rear
>>
>>106844869
thats fucking awesome love you dude
its cool how you can kinda see through the tint a little bit when it turns
>>
>>106844216
cute
>>
>>
>>106844389
you are unfathomably retarded
you are quoting the wrong prompt. the prompt for the tests are below the images
you didn't even crop out the correct prompt but still missed it lmao
>>
>>106844528
need to grow your audience much larger
it needs to be sufficiently large so that when you can make a paywalled version, like on patreon on something, you can harvest a sustainable amount of paypigs. twitter is riddled with bots and makes you almost no money on its own.
>>
>>106844528
ai explicit porn is lame as hell especially if your generating photorealistic stuff with basic Fellatio and doggystyle acts. It's an oversaturated market and many of the content only feature east Asian and white women. You need to expend into other concepts to grow a larger audience. Shape shifting and body transformation seem to have a hungry audience at the moment.
https://www.youtube.com/@ArzaIce/videos
https://www.youtube.com/@GhostPossession24/videos
>>106844971
would not recommend going to patreon to post photorealistic ai nsfw content. I had several people i subscribed to lost their accounts because of that.
>>
>>106845051
i just used it as an example of paywall services but good to know
>>
>>106844216
what model is this?
>>
Over 100 reactions on my video, bros! I'm making it! I'm popping off!
>>
What is the model this pic was made with?
>>
>>106845072
I actually get mad when I see this shit. Why do you care how many reactions your video gets? It's like bragging about uovotes on YouTube comments. You catered to the lowest common denominator. Good job
>>
>>106845079
Chroma. You're welcome.
>>
>>106844653
>2025
>still gening greasy style
>>
File: 00022-3874857041.png (2.63 MB, 1248x1824)
2.63 MB
2.63 MB PNG
>>
>>106845114
How do you know? Box?
>>
>>106845079
how she holding the beer? chromabros????????
>>
File: 424-3898033858.jpg (214 KB, 746x718)
214 KB
214 KB JPG
>>106844936
guy looks like it's his 9 to 5
"well, back to the grind"
>>
File: 00032-1506236564.png (2.49 MB, 1248x1824)
2.49 MB
2.49 MB PNG
the power of love
>>
File: ComfyUI_temp_ghmhf_00003_.png (1.98 MB, 1024x1344)
1.98 MB
1.98 MB PNG
>>
>>106845160
With enough time one develops an eye for this sort of thing.
>>
I'd like a simple node in comfy to interface with my local llama.cpp server (using openai compat).
I'm mostly interested in multimodal support (Image/Text/Audio Input/Output support) There are quite a few nodes that support interface with an OAI compatible endpoint, do you have experience with any of them?
>>
>>106845284
You meal like speech2prompt or something?
>>
>>106845288
https://github.com/hekmon/comfyui-openai-api?tab=readme-ov-file
just so you understand, I'm talking about nodes like this
>>
Increasing wan2.2 i2v clip length from 5 to 10 seconds used to exponentially increase the gen time. Now the gen time is only 50-100% more.
>>
>>106845356
>Anakin's Force Ghost visits Ahsoka.mp4
>>
>thought I had made a massive breakthrough
>turns out it was just rng
>>
>>106844606
noice
>>
>Wan2.2 animate V2

Hands becoming red eventually

Is "relight" lora to blame?
>>
File: 00058-869541642.png (2.36 MB, 1248x1824)
2.36 MB
2.36 MB PNG
>>
>>106845356
it's still exponential for me
>>
nah but like seriously, you're all memeing, but you're impressed by Qwen Image? Right?
>>
>received another 64gb of ram sticks
>too busy genning to install them

...
>>
>>106845716
i've got a spare 3060 I've been meaning to put in my rig for months, but it's a bit of a faff, and I dunno what benefit it'll even give, so sod it for now
>>
>>106845716
You are not busy, your gpu is busy. You are just pressing buttons.
>>
File: 1759534709124473.png (2.52 MB, 2375x1319)
2.52 MB
2.52 MB PNG
>>106844296
https://youtu.be/8xpoiRK57uU?t=33
>wrong prompt
mistakes were made
>>
File: 1730201862862324.png (651 KB, 1162x373)
651 KB
651 KB PNG
https://xcancel.com/peholderrieth/status/1976338189834141870#m
interesting
>>
File: 100988489038649_00001_F.jpg (2.77 MB, 2000x3000)
2.77 MB
2.77 MB JPG
>>
File: lol?.png (312 KB, 2656x1192)
312 KB
312 KB PNG
>>106844296
https://arxiv.org/html/2510.06679v1
if you look at their mememarks it's "supposed" to destroy Qwen Image Edit? lol who's gonna believe that though?
>>
>>106845836
>GLASS Flows is a very simple algorithm to implement.
Hope so, looks decent on paper
>>
File: file.jpg (851 KB, 3102x1778)
851 KB
851 KB JPG
>>106845874
it does style transfer though
https://pbihao.github.io/projects/DreamOmni2/index.html
>>
>>106845901
True if big. None of the other locals models can do style transfer, image1 to image2, least from what I've seen.
>>
>>106844296
they haven't released the weights yet, if it's a small 1b model it's a nothingburger
>>
>>106845918
>During training, we fine-tune Qwen2.5-VL 7B
>We then train the editing and generation models using LoRA on Flux Kontext to perform multimodal instruction-based editing and generation with the predefined standard instruction format.
so it's just a lora of flux kontext? lmaooo DOA
>>
>>106845936
>https://arxiv.org/html/2510.06679v1
Yep, it's built on Kontext.
Bleh.
>>
File: AnimateDiff_00001-1.mp4 (3.67 MB, 312x286)
3.67 MB
3.67 MB MP4
uguuuAUGH
>>
File: back in my days.png (263 KB, 700x428)
263 KB
263 KB PNG
>>106845936
back in my days when you made a lora of a model you would simply put that shit on civitai and call it a day, now they write paper over it and stuff
>>
>>106845936
>>106845941
So they're using Qwen VL on Kontext? that's... interesting?
>>
>>106845941
I'll reserve judgement until it's released. The style transfer alone looks pretty solid.
>>
>>106845936
>>106845941
Why fucking kontext? are they retarded? QIE is better and has a better licence
>>
>>106845936
>make a lora of Kontext
>name it as if it was a brand new model (DreamOmni2)
fucking frauds, the more I read those papers, the more I realize how many frauds there is in the researching space
>>
>about to get more free shit that has a function that no other image edit model has
>cries about it
never change ldg
>>
>>106845987
>free
they get free advertisment for their work, nothing is free it's an exchange anon, can't believe you're that naive
>>
>>106845987
Its good to be critical anon, and if devs are here shilling their product they need feedback from autists doing model AxB tests all day
>>
File: 104999004.jpg (299 KB, 1280x1536)
299 KB
299 KB JPG
pony v7
>>
>>
>>106845874
That Wu is a productive guy.
>>
File: 1753877650333690.jpg (1.03 MB, 1248x1824)
1.03 MB
1.03 MB JPG
>>
>>106846012
An instant classic.
>>
>>106846012
people waited 2 years for this btw
>>
>>106844430
>pony team decides to train qwen
can't wait for more style cluster nonsense!
>>
>>
>>106845874
you know it's a bad benchmark when 4o is better than nano banana, in no universe this is actually the face
>>
>>106841475

You keep saying you want to learn Blender, 3D modelling, character rigging and animation etc?!

Are you f*ing cereal?!
>>
>>106845356
radial attention?
>>
>>106846012
the power of synthetic data
>>
>>106844430
>pony v7 auraflow version is disappointing (as many expected) and pony team decides to train qwen
even lodestone himself admits that his Chroma project is a failure and is not deserving of a finetune so he went for Qwen, that's brutal, what's even more brutal is that there's still some people in this place pretending that Chroma is fine, the fucking Chroma creator says it's not fine lmao
>>
>>106846012
i don't get
>>
>>106846012
hey at least it's copyright safe
>>
>>106846108
it's even worse than SD3M on making a woman lying on grass
https://www.reddit.com/r/StableDiffusion/comments/1de85nc/why_is_sd3_so_bad_at_generating_girls_lying_on/
>>
>>106846104
>the power of synthetic data
noo you don't get it, we need MORE synthetic data!
https://civitai.com/articles/19986
>We'll continue increasing synthetic content, including our own generation loops, to improve character recognition and especially style blending.
>>
Does wan 2.2 allow for first and last frame or only the first frame?
>>
>>106846141
it does first and last frame
>>
>>106846141
FLVL or whatever it's called allows. Unofficial VACE allows series of frames.
>>
File: 1758987517388024.jpg (672 KB, 1536x1024)
672 KB
672 KB JPG
The power of Pony v7
>>
>>106846170
It genned a pony, and the model is called Pony, what did you expect?
>>
Human v7 when?
>>
>>106846180
>It genned a pony
it looks like shit though lool
>>
>>106846012
>makes the worst model humankind has even produced
>somehow people are still hyped for his next model
dude this community is soo weird
>>
File: 1760092626263575.jpg (835 KB, 1536x1024)
835 KB
835 KB JPG
>>106846170
oh nononononono.....
>>
>All images have been used in training with both captions and tags. Artists' names have been removed and source data has been filtered based on our Opt-in/Opt-out program. Any inappropriate explicit content has been filtered out.

Oh they are fucking idiots, good to know
>>
>>106846193
reposted from lmg award
>>
>>106846170
dayum pony also started its radiance transition, look at all those details!
>>
File: 1730063730589145.jpg (611 KB, 1536x1024)
611 KB
611 KB JPG
Oh my god, this may be the worst result for this prompt i've seen across any model i've tried yet
>>
>>106846197
I'm the same guy writing this post yeah lol
>>
>>106846195
always were, pony v6 was a fluke, now it's obvious enough
>>106846194
OY VEY
>>
Remember Ponyv7 failed because of Iodestone integration into the project, everything he touches turns to failed slop.
>>
>>106846194
meds
>>
>>106846234
>can't take a joke
shalom!
>>
>>106846216
>Remember Ponyv7 failed because of Iodestone integration into the project
lodestone wasn't involved on pony v7 lol
>>
>>106846162
Unofficial 2.2 vace? Tell me more.
>>
>>106846248
But was there.
>>
>>106846170
even in 2022 it would've been considered ugly, jesus christ, how can you fail a project this hard?
>>
>>106846195
why? This is such a retarded decision I don't understand how it can possibly be beneficial.
>>
>>106846261
Look yourself in the mirror and ask the same question.
>>
I had no interest in AI before but I saw some AI videos that impressed me thoroughly and now I wanna give it a shot. How hard is it?
Can I realistically figure it out by jumping straight to video or should I do some image generation first? The guides in the OP for images seem a lot more comprehensive.
>>
>>106846286
hello astralite
>>
>>106846286
but my mirror looks good though
>>
File: 1755516434415490.jpg (368 KB, 1024x1536)
368 KB
368 KB JPG
Ok i'm done. This shit is irredeemable. Won't even bother downloading local model.
>>
Any decent cut to sex loras for wan 2.2, ie the subject of the image is suddenly getting fucked?
>>
>>106846314
can't you just stitch two videos together manually
>>
>>106846313
>irredeemable
do not redeem!!
>>
>>106846277
Well, they obviously didn't filter porn or artists, so I assume it's a far-looking decision to comply with whatever bullshit laws that get passed. As far as I understand the problem was the base model they used and a bunch of architectural voodoo they decided to try which didn't go as expected
>>
File: 1757211681218229.jpg (1.25 MB, 1248x1824)
1.25 MB
1.25 MB JPG
>>106846313
Unedited noob prompt from nearly a year ago in comparison.
>>
Will OpenAI's image gen ever be beat?
>>
>>106846339
>will piss filter image ever be beat
it's already the case lol
>>
>>106846255
https://huggingface.co/alibaba-pai/Wan2.2-VACE-Fun-A14B/blob/main/README_en.md
>>
File: blep.jpg (146 KB, 832x1216)
146 KB
146 KB JPG
>>106844319
>!!!
>GENERATION FAILED!
>Your creation prompt contains NSFW content, which isn't allowed under our policy. Kindly revise your prompt and generate again.

>>106844380
i like it :3
>>106844936
one time a girl at a bar had me "save" her by doing this (i pretended to be her fiance hehe)
>>
Astralite is being harassed on his Discord. I'm reading the comments, and the bronies are furious.
>>
File: cutecutie.jpg (136 KB, 832x1216)
136 KB
136 KB JPG
>>106846344
what if they are specifically into... pissing?
what then???
>>
>>106846354
oh please share some screen I wanna read that so bad
>>
>>106846331
>>106846313
Look almost the same.
>>
File: 1729313953392724.png (2.16 MB, 2858x1259)
2.16 MB
2.16 MB PNG
babe wake up, a new distillation method (by Nvdia) was released, and they used it on Wan
https://research.nvidia.com/labs/dir/rcm/
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM
>>
I miss DALL-E mini's aesthetic
>>
>>106846379
I know they're the biggest company on the planet with unlimited resources, but what's their track record with shit like this?
>>
File: 105006343.png (1.54 MB, 1536x1536)
1.54 MB
1.54 MB PNG
>>106846367
I cant because it would be consider being doxed but ican share this:


Pic related "Definitely the funniest image of Marge Simpson made with the Pony v7 model so far"

"Seeing how the model performs on the CivitAI onsite generator is really alarming, It's comparable to SD1.5 or maybe even some of those niche furry finetunes of SD2.0"
>>
>>106846403
>it would be consider being doxed
just hide the names
>>
File: 1755882784783970.png (94 KB, 665x686)
94 KB
94 KB PNG
>>106846170
https://civitai.com/models/1901521/pony-v7-base
the comments are delivious *grabs pop corn*
>>
>>106846350
>i pretended to be her fiancé
you have never felt the touch of a woman
>>
Please kindly redeem v7 by long & detailed prompting for gorgeous looks thank you
>>
I have a fanfiction I want make it come to life. What would be the best course of action? I have a beefy setup. This is what I have planned.

Generate with wan2.2 t2v (1 frame) the pictures of the characters using a detailed description. Once I have a few pictures that are similar and I like, plug them into ai-toolkit to generate a lora for the characters.

Take chapter 1 and generate a list of scenes (lets say 8-10 scenes per chapter) with the needed shots per scene (6-8 shots per scene) > Plug it into glm4.6 to generate the descriptions for each shot.

Then use wan2.2 t2v and generate each shot.

Use davinci to stitch them together.

Maybe use something like vibevoice to generate voice lines and combine it with Infinite talk to make some of the shots spoken, so I don't have all the scenes silent.

Not sure what do to do with the ambient sounds/music
>>
Even so, I'm glad they released a shit model for defending copyright.
>>
>>106846439
kek
>>
>>106846430
>100k dollars
Really?
>>
>>106846403
SMMMMMMMMMMMMOKIN'
>>
File: 1744304122305066.png (171 KB, 640x640)
171 KB
171 KB PNG
>>106846445
I mean, Chroma cost 150k dollars so that fits
>>
Opinions from /mlp/ regarding v7?
>>
>>106846480
poop emoji
>>
File: meYOW.jpg (185 KB, 1013x1454)
185 KB
185 KB JPG
>>106846435
>not sure what to do about the OST\ambient\music
buy an alesis micron used on ebay
you are now substantially equivalent
within 20-2000 minutes of practicing chord changes
no, the normies will not notice.
>add microcosm hologram to the chain
now the normies and artcore zoomers love you
if you have to play the OST live
just throw some glasses on
get a bit greasy for a few
days beforehand &
drink wine during
your setlist.
u did it! ;D
anon!
>>106846431
uhhhhhhhhhhhhhhhhhhhhhhhh
>>
>>106846480
its shit
>>
>>106846480
i wouldn't expect them to be experts on anything other than cartoon lore but maybe im just out on a limb here
>>
File: 001.mp4 (2.95 MB, 928x640)
2.95 MB
2.95 MB MP4
>>
File: 002.mp4 (1.82 MB, 928x640)
1.82 MB
1.82 MB MP4
>>
>>106846504
>he doesn't know
>>
File: 003.mp4 (3.15 MB, 640x960)
3.15 MB
3.15 MB MP4
>>
File: 004.mp4 (3.15 MB, 832x640)
3.15 MB
3.15 MB MP4
>>
File: 005.mp4 (3.54 MB, 640x960)
3.54 MB
3.54 MB MP4
>>
File: 006.mp4 (3.09 MB, 832x832)
3.09 MB
3.09 MB MP4
>>
>>106846517
jesus christ that has to hurt like a bitch.
these gens are nice. keep going
>>
>>106846071
Omg moar please!!!
>>
>>106846519
no one knows everything anon <3
>>
File: rocketgal420.gif (3.96 MB, 428x588)
3.96 MB
3.96 MB GIF
>>106846548
cute, CUUTE! ;3
>>
Is AudioX the local SOTA for generating audio from videos?
https://github.com/ZeyueT/AudioX
>>
>Content Removed!
>This content violates the platform's content guidelines and has been removed!

how are cloud-based Ai anythings still financially above water?
>>
>recalibrating all pc fans with fancontrol
>it gets to the 5090

AAAAAAAAAAAAAAAAAAAAAA
>>
ponyv7 won
>>
File: 68409227.mp4 (3.67 MB, 816x1104)
3.67 MB
3.67 MB MP4
>>
>>106846776
incredible
>>
File: IMG_3353.jpg (20 KB, 400x400)
20 KB
20 KB JPG
I’m just trying to find the best image model to make degenerate gens of my oneitis using a Lora and all of them kinda suck in their own ways.

>SDXL looks yucky, really bad VAE and likeness is off. Prompt adherence sucks.
>Chroma is better at likeness and prompt adherence, but has been legitimately terrible when it comes to extra fingers/toes and it looks really grainy regardless of workflow/lora


Will we ever get an actually better image model? I’ve tried Wan 2.2 training, but it will not pick up on her likeness.
>>
>>106846802
just use ChatGPT
>>
File: 00000-2555781355.png (3.4 MB, 1728x1344)
3.4 MB
3.4 MB PNG
>>
>>106846802
heal in a more appropriate way
if you're still using her as your ai muse
you are stuck in the past; allow yourself
to finally move on anonkun, only you can
allow yourself to heal; allow yourself to accept
the fact that time can only move in one direction
forward, into the future: which is truly, the only thing you can actually change.
>>
>>106844881
ty. wan can do a lot of cool stuff like that.
>>
>>106846445
nothing really in the scheme of "suspiciously wealthy furries"
>>
>>106846065
yea, there's that... but i guess people do whatever they thought would cover their asses

>>106846106
no... it's the pony team that decided to finetune qwen next with assistance by lodestone? i see literally nothing about lodestone saying chroma is a failure and he's finetuning radiance rn.
the 0.3 snapshot just came out https://huggingface.co/lodestones/Chroma1-Radiance/tree/main
>>
File: ComfyUI_00019_.png (1006 KB, 1328x752)
1006 KB
1006 KB PNG
I put some of my tools online for you fags:
https://github.com/quarterturn/ollama-video-captioner
https://github.com/quarterturn/ollama-captioner
I use them to make wan 2.2 and qwen-image LoRAs
https://huggingface.co/quarterturn/qwen-image-20b-ruri-rocks

I can crank out a reasonably good LoRA starting from video files with very little effort, it's almost all automated. I did one i2v LoRA, holy shit that's really suited for distributed compute, not just a single GPU.
>>
>>106846868
neat
>>
>>106846868
Post the captions to the image you just posted or BTFO
>>
File: ComfyUI_00022_.png (1.13 MB, 1328x752)
1.13 MB
1.13 MB PNG
>>106846662
Because it's a scam, like mortgage hypothecation, or fractional reserve banking in gerneral. Keep the bullshit going, hope no one notices, hide as much money as possible in offshore accounts, and then let the 401K schmucks be the dogshit bagholder.
>>
>>106846885
This is an illustration from the anime Ruri no Houseki. on the left, Ruri, a young girl with medium-length blonde hair tipped with pink, green eyes, is wearing a see-through nightie, and laying on her back in bed, her head on a pillow, looking at Nagi, a blushing, flustered expression on her face. On the right, Nagi, a young woman with long black hair and blue eyes,  is wearing lacy black bra and panties, and is sitting next to Ruri on the bed, looking at her with a loving expression, as she gently, caresses Ruri's face. The scene is that of cozy bedroom, light by bright afternoon sunlight, in a glowing, soft-focus style

OK? And then?
>>
>>
>>106846919
and this bird? its a sloppa species known as sloppabirb
>>
Mmmm, elf witches.
>>
>>106846905
Gooood! Gonna give it a try then
>>
>>106846919
He is just sick, ok? xD
>>
>>106844936
Jake wishes they were that big.
>>
blah blah... is suddenly... blah blah
“is suddenly” is a hotbuzz for qwen edit2509—try it out, Anon, and tell me it's good
>>
File: qwen___0006.png (1.18 MB, 832x1216)
1.18 MB
1.18 MB PNG
>>106846938
i genuinely wonder why people dig into the tiniest details of gens and rip into it like 90% of people don't click on images of women with three hands and don't even notice.

it's a pretty good image (if not generic)
>>
File: ComfyUI_00269_.png (3.45 MB, 1536x2048)
3.45 MB
3.45 MB PNG
>>
>>106846868
not bad. can the gemma3 model do booru tags+natural language instead of just natural language?
>>
File: dante.mp4 (1.95 MB, 640x640)
1.95 MB
1.95 MB MP4
>>
>>106847053
LEEEON HEEEEELPPPP!!!!!
>>
File: ComfyUI_00001_.png (2.06 MB, 1024x1024)
2.06 MB
2.06 MB PNG
>>106846866
>0.3 snapshot just came out
Another anon, ty I missed this

>A photo of a woman standing on a dirt path in a park. Her hands are raised and open. She is wearing a white dress. Her feet are barefoot. She is looking at the camera and smiling.

20 steps/2.5 cfg lms/sgm_uniform

Needs more steps, but most of the artifacts are gone already! Probably won't ever be as sharp as Qwen. Wonder if astral+lode will do a Qwen radiance version..
>>
>>106846633
sexo!
>>
>>106847052
I haven't tried booru but I don't see why not. Out of all the things you can run locally, gemma3 probably "sees" the most detail and is the most accurate. Unlike some of the small image-only models, it can follow a complex prompt describing characters. Maybe google will give us gemini 2.5 flash light weights?
>>
File: 00001-1466628104.png (1.46 MB, 1152x896)
1.46 MB
1.46 MB PNG
>>
File: 1736705615920162.mp4 (2.94 MB, 720x1040)
2.94 MB
2.94 MB MP4
>>
>>106847288
when was the last time you spoke with a real female?
>>
File: ComfyUI_temp_effqj_00002_.png (3.34 MB, 1192x1648)
3.34 MB
3.34 MB PNG
>>106847305
why do chuds get so triggered when they see AI gens? Every now and then I see comments like that
>>
>>106847320
>chuds
only terminally online troons and women talk like that anyway
>>
File: 105269100.jpg (528 KB, 1536x1536)
528 KB
528 KB JPG
AHAHAHAAAHHAHAHAHAHAHAAHAA
https://civitai.com/images/105269100
>>
>>106847363
yeah, its women and troons who always make that kind of comments, but they usually get triggered by realistic 1girls than cartoon ones
>>
>>106847390
ITS MIGU
>>
>>
File: house.mp4 (3.42 MB, 928x640)
3.42 MB
3.42 MB MP4
>>
>>106847479
gate bros...
>>
Somebody know a site where you can pay to use Wan online without the censorship?
>>
All these youtube channels about ai news using ai voice.. You just know they are jeets.
>>
>>106847506
saaar
>>
>ai-toolkit no longer starts
lol, just like last time. I get one or two shots to use it before having to reinstall it. What a shitty application.
>>
File: ComfyUI_00016_.png (738 KB, 1024x1024)
738 KB
738 KB PNG
>>106847068
(You)

>The image is entirely black. There is nothing else on the screen.

In testing, I noticed radiance's black colors suffered. To confirm my suspicions, I ran an experiment..
anon.. black is now a shade purple. An artifact of avoiding 0? Might have to offset the raw RGB values before processing if this is the case.
>>
File: ComfyUI_temp_pqxyc_00008_.png (2.15 MB, 1664x1216)
2.15 MB
2.15 MB PNG
i was too hard on qwen posters before, you can get some pretty nice gens out of it, it's just not the right aesthetic for me
>>
File: 00002-2483459317.png (1.73 MB, 1152x896)
1.73 MB
1.73 MB PNG
>>
>>106847547
>Left is half white half latina
>right is chinese
Did you not specify race?
>>
>>106847552
Wise fwom your gwave
>>
File: wan22___0008.png (1.47 MB, 832x1216)
1.47 MB
1.47 MB PNG
>>106847577
not for the face detailer, left is sdxl (epicrealism), right is qwen
>>
>>106847501
>wan 2.2
rent a gpu
>wan 2.5
idk
>>
>>106847501
Just buy a 3090.
>>
File: 1754375574689870.mp4 (317 KB, 1200x720)
317 KB
317 KB MP4
>>106847438
>>
>>106847729
turn off the porn loras brah
>>
File: bing bing wahoo.mp4 (3.65 MB, 928x640)
3.65 MB
3.65 MB MP4
>>
>>106847751
>not humping her from behind
bing bing wahoo....
>>
File: 00003-1533159549.png (1.37 MB, 896x1152)
1.37 MB
1.37 MB PNG
>>
File: miku protects the city.mp4 (3.59 MB, 1120x576)
3.59 MB
3.59 MB MP4
>>106847390
>>
File: 00004-898169203.png (1.35 MB, 1280x768)
1.35 MB
1.35 MB PNG
>>
>>106847834
she is so talented
>>
>>
File: qwen___0009.png (1.09 MB, 832x1216)
1.09 MB
1.09 MB PNG
>AY body type qwen lora
3hrs to train a lora on 16 images isn't too bad but it clearly causes degradation.

gonna a weeks with chroma and see what it's about.

>>106847853
kino
>>
>>106844936
hey chat is this real?
>>
Any kinda funny artstyle loras for Illustrious? want to try something different than the usual.
>>
>>106847614
>>106847547
Did you also not specify age? why is it making 30 year olds
>>
>>106847870
atou... I KNEEL!
>>
File: IMG_5381.jpg (298 KB, 1179x1513)
298 KB
298 KB JPG
>>106848010
pls understand
we need all the underage ones for our gov funded chomo lairs underground you can keep the only the slags
>hotdog enters hallway
>loose girl behavior incentivized/glorified
>you will never have a normal female due to our brainwashing programs

pic: not related but also laughable clownworld nonsense.
imagine being so retarded that you think this is real via social-media doomscrolling (17 hours a day)
>>
>>106848061
lmao
>>
>>106848061
postcard go to bed
>>
>>106846517
Damn, what model is that?
>>
>>106848145
looks like the new grok video model, not very local of him, but it is a really interesting model.
>>
>>106846331
noob and illustrious will never be surpassed
>>
>>106848151
nah that looks like wan and theres no watermark desu but yeah thats wan
>>
>>106848061
How do I get a job in those underground lairs?
>>
>>106847497
I can finally meet Louise...
>>
File: chroma___0001.png (1.48 MB, 832x1216)
1.48 MB
1.48 MB PNG
i sure hope chroma isn't some kinda meme you guys tricked me into fucking with

>>106848010
ages are 25 and 'milf' respectively. you seem very concerned the model doesn't default to lolicon.
>>
>>106848199
>doesn't default
>default
I've been out of the loop. How is Chroma's training going?
>>
So far Illustrious has been the ONLY model that I have been able to reliably generate good looking handjobs with. A woman's Fingers/hand on a males penis seems to be genuinely difficult in everything else. Chroma, Qwen, Flux. Unless you're doing the very specific POV giant 2 foot dick slop they just turn into a 2022 era body horror AI .

Why is this?

The point of AI is to gen stuff you can't find easily online yet everything after Illustrious seems to only excel at shit you can easily find everywhere online.

Everyone told me to just get ChromaHD and run my previous Illustrious gens through IMG2IMG on it yet it just creates monstrosities.

Fetish bros in shambles.
>>
>>106848287
are you saying you can't find handjob pictures easily enough that you spent thousands on a computer to make your own?

when you own both a hand and a dick?
>>
My portfolio is looking pretty good. Talking to Comfyanon soon.
>>
>>106845356
it does weird shit though.. it kinda goes off the rails when you gen longer clips from my experience
>>
File: ComfyUI_05805_.png (877 KB, 880x1184)
877 KB
877 KB PNG
>>
>>106846194
there's no butterfly at the bottom.. you made that one up
>>
>>106846350
insta friendzoned virgin
>>
>>106846379
Trun, Thurs and GSHOOK lol.. shit still doesn't work
>>
>>106846525
that was pretty gud
>>
>>106848380
nta but a girl from my faculty asked me to "save her" from a guy at a club by kissing her, but I was terrified of relationships and gave her an awkward hug instead. she stopped talking to me
>>
>>106846548
well done
>>
File: chroma___0011.png (1.39 MB, 832x1216)
1.39 MB
1.39 MB PNG
>>106848222
it finished a while back and then split into a few other models HD, flash and something else. I'm not sure who this is for yet, it seems very brittle.
>>
Hello, /sdg/ downloaded chroma1HDGGUFFP8_fp8ScaledHybridRev2, running on swarm, what do you think? I want a clean OC characters anime/2d gens.
What samplers, steps, cfg, loras or chroma versions work best?
Please, share your workflow if you got one!
>>
File: 00001-446186673.png (1.44 MB, 752x1264)
1.44 MB
1.44 MB PNG
>>
>>106848489
I think your cock and balls should be amputated with a blowtorch
>>
>>106848506
sorry im new in this and made the first message to /sdg/ and copy pasted the message
>>
File: 00004-4291796427.png (1.56 MB, 752x1128)
1.56 MB
1.56 MB PNG
>>
>>106848489
*Hello, /ldg/
sorry I send this message to /sdg/ first
>>
>>106848377
There is the lifted hoof
>>
>>106845987
about to get what? qwen edit is already good
>>
Has anyone who’s tried Grok Imagine found even a moderately comparable comfy based workflow (using any GPU available on runpod)? New to wan and I can’t believe how good the grok results were for nsfw photo animation, but public results and moderation flakiness kills the concept… however for me it has set the bar and I wonder what’s even realistic to achieve diy. New to diffusion and been learning lora training for qwen and chroma and having good results with t2i but now that I’ve seen what grok can do with a single image I wonder whether replicating it is a pipe dream even if I throw money at compute. Any tips appreciated
>>
>https://huggingface.co/lightx2v/Qwen-Image-Lightning/blob/main/Qwen-Image-Edit-2509/Qwen-Image-Edit-2509-Lightning-8steps-V1.0-fp32.safetensors
for a lora, is bf16 notably worse? ive been using the fp32 with good results but idk if it's diminishing returns or not.
>>
>>106847288
pretty good one
>>
File: 00025-4019616270.png (1.26 MB, 896x1152)
1.26 MB
1.26 MB PNG
>>
whats the minimum amount of images for a dataset to work well? 20?
>>
>>106848308
Not small penis handjobs where the dick is skinny. Trying to live out my str8 shota fantasies.
>>
>>106847834
yes, yess, yesssss
>>
The comments about new pony on civit are far more entertaining than the model
>>
I tried to chaining 2 x 5s clips. The motion is reset
>>
>>106848577
the difference between having $1 billion infrastructure + proprietary trained models vs $2000 gpu + open source

its not much of a fair battle
>>
>>106848199
>i sure hope chroma isn't some kinda meme you guys tricked me into fucking with
for realism, no its not a meme. for everything else it is tho kek. well... that one anon makes some nice illustration loras desu.
>>
>>106848145
Rook rike wan with slowmo light LoRA
>>
Running my gens through Grok video really spoiled me. I made something and now it is alive, and she is as horny as I was when I made it (or rather, my brain thinks this is true even though I know it is untrue). With just a bit of extra work, you could easily just set up a bot that uses an LLM to make stories in a coomer setting, which automatically produce prompts at key moments, which get fed into the t2i-i2v pipeline and eventually automatic dubbing.

There is going to be a name for this in the future, something like Pygmalion Syndrome, for oneshotting yourself with your own AI outputs and get trapped in a digital world.
>>
File: 00026-3417969163.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
Fresh when ready

>>106848716
>>106848716
>>106848716
>>106848716
>>
>>106848650

I get your point, although it’s never clear to someone ignorant of all the details (me) whether the $$$+infra are essential to the use case (generating 1 video from a pic within a few minutes with a userbase of 1) vs at the global scale they’re operating at
>>
>>106848012
>>
>>106848632
Change the dude into second sexy female. Male on the screen deters from fap.
>>
File: radiance.png (2.79 MB, 848x1488)
2.79 MB
2.79 MB PNG
>>106846548
that is very cute!

>>106846543
cool effect

>>106847068
>Another anon, ty I missed this
no problem

>Probably won't ever be as sharp as Qwen.
i'd not expect it to beat qwen or hyimage3 either

>Wonder if astral+lode will do a Qwen radiance version..
with a vae that behaves well during training and inference, there might not be as much practical motivation to do it. but i guess we'll see.
>>
>>106847834
amazing
>>
>>106848145
All wan 2.2 with light. Images are mostly all generate with illustrious (normally wai) occasionally pony if I've used an older one from my folder.
>>
Another one
>>
limit



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.