[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107006468

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Lumina
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: jack2.jpg (3.24 MB, 2432x2432)
3.24 MB
3.24 MB JPG
>>
>>107010364
>4 pics of the same girl
kys faggot
>>
can someone make a better model than chroma already? I'm tired of tard wrangling this shit
>>
Didn't manage to get titties on ltx 2 site with t2v, let's hope is just in site censorship and not the model itself, it would be such a shame because the model feels and looks so fucking great.
>>
give me a new style and girl to prompt
>>
Output from the LongCat Video demo script first stage. 1:04 long! It took about 3 hours. The compression to attach it here is killing it, so here's the original file: https://files.catbox.moe/2gjaqx.mp4

Note that this is only the first pass out of 3 (the longest though). Unfortunately, the inference crashed shortly after beginning the second stage due to a bug in the provided code. I fixed it and and will try another test.
>>
>>107010393
sure
style_cluster_33
iwakura lain
>>
>>107010404
cool. how does longcat work? do you chain multiple prompts or you gotta do all in 1 prompt?
>>
File: wan22_t2v_00001.mp4 (936 KB, 720x720)
936 KB
936 KB MP4
>>107010331
not so fast anon see here
>>107010391

and video is latest test, workflow here https://files.catbox.moe/2i6qkl.mp4
>>
>>107010393
Gen a scene girl
>>
>>107010393
gen a photoreal obese black woman
>>
>>107010408
The example is a single prompt for everything, but I'm not sure if it will do well with a sequence of different events. It looks like it should be possible to prompt each segment differently, but it might get weird when it fuses them together.
>>
>>107010410
do note its 81 frames but at 32 fps because i think that is the base fps for wan2.2 however when you use most peoples lora's they will be at 16fps so adjust accordingly.

Also can bypass those context windows, they are useful for when you wish to do longer videos. For longer videos just increase the total frames and the context window can be adjusted to more frames, think of them like chunks. It still does all frames at once but it won't oom which is an added bonus meaning you can do 1280 x 720 full resolution without much issues.
>>
>>107010364
I wish there was more feet sloppa...
>>
>>107010441
Describe what you want
>>
>>
without slop twerk lora
>>
>>107010404
Could you do a quick short clip nsfw test?
>>
>>107010440
Context Window is actually a frame batch node?!
>>
>>107010514
kind of, yeah... Its actually meant for doing longer videos but it stop's oom's and it works very good. Its really underrated or just over looked desu because its a new beta node guess.
>>
>>107010536
I have had it hooked up before but stopped using it for some reason.
This totally coincides with me starting to oom more often.
>>
>>107010510
Yeah I second this.
>>107010404
have it do drop down and zoom up a ladies skirt, i want to see the brappers it can gen (0.0)
>>
Does Chroma like simple sentences or long paragraphs?
>>
I have a feeling we are gonna be eating so good in the new year or Christmas coming.
>>
how do I make comfyui run a long job in the background while I create new jobs in the foreground? as in, one long job and one short job both running at the same time
>>
>>107010479
never got into wan and alike.

so this does not work well on faces?
it is clearly different when it gets animated
>>
>>107010584
I think it increases gen times, i've noticed when i forgot to disable them, i had only genned 81 frames but i had the context on and set to 41 frame context size. And yeah they gen i'm doing is taking way too long, its been stuck at 33% for a while now... and it just jumped to 67% thank fuck, i thought it was having a fit. but yeah its effecting the gen time a lot, so only use the context window when you need it.
>>
>>107010626
how bad is your autism? they look pretty close
>>
>>107010653
>close
not at all if you wish to supplement your low count dataset for lora training - with precision
>>
>>107010406
>>107010416
>>107010418
any other ideas
>>
>>107010696
Post your gen and I'll decide
>>
a n o n s how's it going?
>>107010615
sentences, yeah. long paragraphs, nope.
>>
https://civitai.com/models/2073885?modelVersionId=2346721
OYYYYYYYYYYYYYY
>>
>>107010719
My fetish
>>
>>107010719
oy vey shut it down!
>>
whys qwen edit create these scan line looking bands across the output?
>>
>>107010794
GPU failure
>>
>>107010805
don't be mean anon,
>>
>>107010833
>generic white girl #4634566356
yawn
>>
>>107010841
>white
oy vey...
>>
File: output_t2v_refine_1.mp4 (3.16 MB, 1280x704)
3.16 MB
3.16 MB MP4
So far it's impossible to get anything animu-looking out of Longcat T2I. It always does some kind of 3DCG thing when I tell it "anime" or "animation" in the prompt.
>>
>>107010866
does it have i2v?
>>
>>107010866
What is your prompt?
>>
File: spagooter.jpg (2.92 MB, 2432x2432)
2.92 MB
2.92 MB JPG
>>
File: output_t2v_refine_1_1.mp4 (3.06 MB, 1280x704)
3.06 MB
3.06 MB MP4
>>107010872
Yes, guessing it will work better based on how Wan responds. Going to try after a couple more T2I attempts.

This deviantart looking thing is the "best" I've gotten so far.
>>
>>107010898
normal wan has the tendency to CGI my anime stills, I guess they trained it mostly on 3dcgi and realism
>>
File: output_t2v.mp4 (1.19 MB, 832x480)
1.19 MB
1.19 MB MP4
>>107010881
Some variation of this, swapping out terms to try to push it:

>prompt = "high-contrast anime of a female space marine wearing a form-fitting robotic exoskeleton. she has a black bob haircut and narrow blue glasses studded with blinking LEDs. her armored suit is gray and blue, with signs of wear and battle damage. over the suit, she wears a long, ragged khaki-colored scarf. she is in a desert with sparse rocky outcroppings and a brown haze in the air. she climps to the top of a jutting pile of rocks, and looks into the distance. far away in the distance along the horizon, a city of domed buildings is visible, clustered together. at the center of the buildings is the cable of a space elevator, a single vertical line rising straight up into the sky. the lighting is harsh, tinted by the atmospheric haze."
>negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards, cg, 3d, photograph"

Also, the 1st stage low-detail pass has generally looked better than the last pass for this prompt, ignoring that it's the wrong medium.
>>
How do i generate slop?
>>
>>107008788
> KJ was wrong to assume it needed no codes changes, we gonna need a new node.
It need changes for native nodes, his implementation is correct and works, there are example gens in that github thread.
>>
>use the latent upscale method
>it can't do anything other than a 1.5x upscale

God, open source sucks ass.
>>
>>107010364
Top right and bottom right are very much my shit.
Hnnnng
>>
>>107010933
>retard doesnt know how models are trained
>>
>>107010933
literately upscale dummy
>>
>>107010933
Something else must be fucked up
>>
>>107009152
>>107009129
wan?
>>
Any reliable way to do POV deep penetration with wan i2v? GF been complaining my OC dick insertion shenanigans aren't genning deep thrusts.
I looked for some wan loras to address this but couldn't find any. There are SDXL loras for this, I may try some first-last frame workaround.
>>
>>107010940
>>107010950
>>107010963
It's because of the uneven pixels it's pissy about. The upscale by x node was supposed to fix that, upscaling with manual numbers is impossible to match so you don't get the error.
It's literally dogshit coding.
>>
>>107011018
> There are SDXL loras for this, I may try some first-last frame workaround.
Yes, or vace with deep/shallow frames.
>>
>>107011022
Can you illustrate what you mean with examples? I haven't run into this.
>>
>>107011053
I didn't get around playing with vace yet. By the sounds of it seems to have a lot of potential to do wild v2v stuff.
>>
>>107011073
But it runs 1.5x slower.
>>
>>107011060
All this extra shit just for latent upscale and you're stuck with 1.5x, nothing else works or you get the mismatch error.
>>
File: 1730838550118692.jpg (2.78 MB, 1017x9000)
2.78 MB
2.78 MB JPG
I love the Internet
It's lovely having the ability read the thoughts of the brightest minds in history
>>
>>107011000
Pony v7
>>
File: output_t2v_refine_1.mp4 (3.06 MB, 1280x704)
3.06 MB
3.06 MB MP4
Longcat can cook anime-style noodles! Sort of
>>
Got wicked drunk and passed out last night, woke up just now with a throbbing headache. Last I was at my computer I was genning titty elves with Chroma Flash 12 steps DEIS, 30s per gen, so if that was 10 hours ago then there should be 1200 new titty elf pictures on my hard drive now
>>
>>107011094
>int to float
>float multiply
>flot to int
holy fuck have these retarded node devs ever heard of duck typing (prominet feature of python)???? holy fucking shit
>>
>>107011146
Proof
>>
>>107011106
Making a model gravitate toward a specific color was always a thing. It seemed like leaning toward yellow was the "default" in most models but Sora goes too far for some reason.
https://github.com/hako-mikan/sd-webui-supermerger?tab=readme-ov-file#adjust
>>
God damn, 60s.

>>107011149
Open source, bro.
>>
>>107011146
post your top 5 picks
>>
>>107011146
>there should be 1200 new titty elf pictures on my hard drive now
I kneel
>>
>>107011163
I'm not at my PC I'm lying in bed feeling like complete dogshit
>>
>>107011161
>just make it yourself
I don't need the nodes, my workflows are pretty basic, but unlike these hacks I'm a competent programmer, so when I see garbage nodes such as the ones you posted, I can't help but criticize.
This prompted me to check why there are no basic math operation nodes planned in native comfy and VOILA:
https://github.com/comfyanonymous/ComfyUI/pull/8024
The only other competent person in this discussion says the same thing I'm saying.
>>
>>107011149
nigger retard you can't connect int and float just like that
>>
>>107011212
another retard that doesnt understand how python works.
you can have outputs be dynamically typed (or having a dropdown for casting to your desired type), but everything is mathable together in python, makes absolutely no sense to have 09593421906234 nodes all to do casting and implementing math operations for each fucking type, there's absolutely no fucking need retard. Go learn python fucking monkey retard, kys
>>
>>
anis will saveis
>>
>>107011230
no one will save you
>>
why can't chroma tell the difference between an anus and a vagina
>>
>>107011272
it's a failbake
>>
File: ComfyUI_02802_.png (1.07 MB, 768x1360)
1.07 MB
1.07 MB PNG
goth cart
>>
>>107011106
I think it's great the average normie is so clueless, so the anti-AI mob calms down a bit while open source flourishes. Though Qwen definitely has a bit of yellow tint (Alibaba trains on synthetic data unfortunately).
>>
>>107011223
lo
>>107011247
ra?
>>
>>107011220
you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
>>
File: file.png (26 KB, 329x346)
26 KB
26 KB PNG
I mean there already are nodes where you can do all the math and conversions you need (and more)
>>
>>107011272
because it generalized to understand that the only purpoes for a digital anus is for it to be fucked, equating ti to a vagina
>>
>>107011317
the Any Switch nodes in rgthree already support dynamically typed inputs and outputs, you're a fucking retard. You can dynamically type the output depending on the input you're connecting it to, likewise for inputs.
kys nocoder retard
>>
>>107011499
> you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
>>
>>107011531
>>107011338
retard
>>
For video, going with a less elongated aspect ratio is good because you can push the resolution more. So for a square, you can get away with 1080p.
>>
>>107011541
>>107011538
>>
>>107011546
>>107011538
>>107011541
>>107011531
gay
>>
vidgen was a mistake
>>
There's no "best practices" in comfyui, you just have to carve out your own little path through the mountain of feces to get to where you need to go
>>
can local video gen add audio and lip sync like grok? I've been having alot of fun with grok but the daily limit is annoying.
>>
ok I got the money ($3k). which gpu should I buy?
>>
Could Qwen edit be use to change the style of something like make anime real or 3D render realer looking?
>>
>>107011558
no. local is very far behind in all fields currently.
>>
>>107011561
Nvidia H100
>>
File: ComfyUI_07467_.png (3.94 MB, 2560x2560)
3.94 MB
3.94 MB PNG
>>
File: output_t2v_refine_1.mp4 (3.1 MB, 1280x704)
3.1 MB
3.1 MB MP4
Longcat T2I has produced something anime-like!
>>
>>107011561
Preorder 4 Huawei Styx 1k40s
>>
File: ComfyUI_07288_.png (3.54 MB, 2560x2560)
3.54 MB
3.54 MB PNG
Where is criticanon? Did he go back to Blazblue? Did anyone make an OC for him?
>>
>>107011599
>chinese shit
be serious
>>
>>107011602
>96GB vram vs MAYBE 32GB if you're lucky
I am serious
>>
>>107011601
<7288
clear your output
>>
>>107008372
I want to train a super resolution model.

I have a collection of images, with the same style and colours, but some are low resolution because I don't have the original images.

My idea is to recreate the high resolution images by training a model. The idea is to tell a model: hey, this is a low resolution image and this is the original high resolution image, and the idea is to guide the model with that information so then I give it the low resolution image and it gives me a high resolution image. I know I will never get exactly the original, but because all the images share the same color palettes it should be more accurate than using an untrained model.
>>
File: ComfyUI_07285_.png (3.21 MB, 2560x2560)
3.21 MB
3.21 MB PNG
>>107011611
I know I need to, but I'm lazy... What you got nigga?
>>
>>107011617
And what are you gonna do different from the 6 gorrillian upscale models that already exist
>>
>>107011626
Give it a cool demo.
>>
>>107011296
https://civitai.com/models/1942216/chroma-the-creepy-the-unsettling-and-the-ugly
>>
>>107011626
>Asks for X
>But why do you want X bruh???
Read my comment again, please
>>107011617
I think it was called GAN? I don't remember the exact AI term.

I basically want to train the model with pairings, but I don't know what model to train or how, please someone help me.
>>
>>107011617
You'll need thousands h100 hours and millions images for that.
>>
>>107011621
what a fucking shitty gen, why upscale this garbage with broken hands? kys
>>
File: 1754568532062009.png (148 KB, 1711x1157)
148 KB
148 KB PNG
wait what? I thought it was a Wan finetune but it's its own model??
https://huggingface.co/meituan-longcat/LongCat-Video
>>
>>107011589
looks inferior to wan 2.2 unfortunately
>>
>>107011671
Because it was a joke to make fun of a guy in a thread in /v/ so I didn't much time on it. I honestky forgot I posted it here.
>>
>>107010364
But of a newb question right off the bat, ignore if you want: Can you have a local language model somehow?
>>
>>107011698
Jesus I typed so fast I seem like a "saar".
>>
>>107011709
go to /lmg/
>>
>>107011558
currently we are waiting for ltx 2 to release weights in november but that's all
https://ltx.video/blog/introducing-ltx-2
>>
Is my GPU dying? Often when I start my computer or wake it from sleep, GPU fails to initialize and I get a black screen. I have to reset multiple times, it takes several tries like revving up a chainsaw but eventually it does work, but it's annoying and worrying as hell. Does open source genning put undue strain on a GPU? Perhaps leaving Flux running for 10 hours generating batches while I'm away at work was too much for my aging 3080
>>
>>107011664
Can't I make a lora?
>>
>>107011844
Just unplug it and blow on the slot
>>
Are there any IPadapters for Chroma, or do you just use the Flux ones?
>>
>>107011844
mine did the blackscreen thing and then it died a few weeks later. i also left it genning so you might have to get ready to buy a new one soon
>>
>>107011223
>>107011247
can you post your workflow? my chroma gens always look blurry or messy
>>
File: 00045-2138738232.png (2.24 MB, 1824x1248)
2.24 MB
2.24 MB PNG
i really love ltx2 video just for the mere audio support and 8sec coherence. can't go back to generating muted slopped video from wan 2.2. want to try grok out but redditors are crying about the excessive censorship.
https://files.catbox.moe/80u7jc.mp4
>>
>>107011984
uhmmm LOCAL video gen? go to /wsg/ or /gif/ sora threads if you want to post your NON local videogens, no one fucking cares here.
>>
>>107011984
uhmmm stay but less talking and more bouncing
>>
File: Ponyv7_20251026_00001_.png (1.84 MB, 1280x1536)
1.84 MB
1.84 MB PNG
migu
>>
>>107012162
I'm a local fag but bored with muted videos. just paid for a month at a discount test the backlog of 1girl images i have on my output folder.
>>
>>107011094
again, I tried this bullshit and the output was slopped as fuck and the background was grainy. 480p high, latent upscaled 1.5x to 720
>>
>>107011115
why you lie?
>>
>>107012372
again, fuck off, you're off topic.
>>
>>107012372
again, stay, but more bouncing
>>
>>107012445
you fuck off, thread shitter
>>
File: 1731919769508867.mp4 (2.73 MB, 920x720)
2.73 MB
2.73 MB MP4
don't know why it tinted green. triple ksampler, unipc/simple
>>
>>107012474
that immediately killed the sitcom aesthetic
..or did it? I haven't watched tv in decades, god knows what they show now
>>
>>107012457
talk about local in the local diffusion thread, faggotron
>>
>>107012457
>posts off-topic shit
>thread shitter
literally kys, your gens are garbage 3dcgi anyway they FUCKIGN SUCK bro, either do anime or realism, not this fucking pixar shit it fucking blows. then it's all cowtits shit too, NO RANGE at all, you're a literal brown subhuman with your fucking bigporn whatever illustrious shitty mix youre using, fucking garbage. literally kys.

>>107012513
browns dont understand that we have multiple threads for everything, I understand not wanting to go to /adt/ or /sdg/ (theyre tripfag/avatar infested that do the daily HI X HOW R U X3 absolute cancer thing, on top of having faggystudio in the op), but there are pretty 100% servicable /de3/ threads here for cloud image, and sora threads in /wsg/
but no, they can't fathom frequenting multiple threads, they think they're regulars here so they shit up the thread with their useless shit nobody wants to see (save for another retarded brown that won't understand WHY someone doesnt want to see non-local shit in local threads).
It's the same thing in /lmg/ btw, you have absolute brownoid retards discussing diffusion/comfy shit sometimes, but they're ''''regulars'''' there so they feel entitled to shit up the thread. browns shouldve never been allowed internet access.
>>
>>107011247
average romanian after night out

>>107011294
heheh cute

>>107012474
so fuckable

My Lord Allmighty, she's so fuckable fuckk.
OOF. This is the type of girl that you wake up mid night and get to fucking while she protests but succumbs to her body quickly.
God fucking damn it everything damn fuck all.
Young bratty bitches God bless em.

She would not have time to do anything around the house. If I saw her cooking Id fuck her. Cleaning? Id ra_e her on the floor. Chilling on her phone? Take phone ram in the wall throatfuck her until shes wet.

Life doesnt get much better than a bratty bitch cocksleeve on the cock daily.

Imagine the freedom and relief of living such a balls emptied frequently, life.

Oh yeah, and the girl aint bad either.
>>
>>107012549
post hands
>>
>>107012544
based, fuck them.
>>
File: 1734011056304513.jpg (1.05 MB, 2016x1152)
1.05 MB
1.05 MB JPG
>>
>>107012544
I'm not that anon. anon was testing a model that will be open source in a month (supposedly) and you have a sperg fit. YOU kys
>>
>>107012578
>open source in a month
it is currently API only, it doesnt fucking matter. Also they can still rugpull and what they release as open might not correspond 1:1 to what they're serving via API. Kindly fuck off.
>>
>>107012586
just fucking end your life retard
>>
File: 1730378377188205.jpg (477 KB, 832x1216)
477 KB
477 KB JPG
>>
>>107012558
ladies first
>>
>>
>>107012684
Never going to get a woman talking like that
>>
File: 1746136140495823.png (1 MB, 1574x1201)
1 MB
1 MB PNG
>>107011161
am i doing it right?
>>
>>107011969
https://files.catbox.moe/6fv9gj.json
>>
>>107012656
>subtle cameltoe
wan is based
>>
File: 00053-4034009164.png (2.3 MB, 1824x1248)
2.3 MB
2.3 MB PNG
>>107012544
what an absolute faggot. i was testing the ltx2 video model on the official site because there is a serious drought and the devs are not releasing the weights until the very end of November. its a open source model so its far from off topic.
https://files.catbox.moe/b55zog.mp4
https://files.catbox.moe/u3lup5.mp4
>>
>>107012910
stop posting your cgi garbage retard, you're still posting OFFTOPIC api shit despite all the mental gymnastic you're doing. kys
>>
>>
>>107012887
Anon do you have a good i2i workflow for Chroma to share?
>>
File: Ponyv7_20251026_00009_.png (1.92 MB, 1280x1536)
1.92 MB
1.92 MB PNG
I continued genning ponyv7, iterating styles.
Using simple/euler/40steps/4cfg randomized style clusters and.
It's fucking garbage. WHILE some of the gens might look nice, the details are absolutely shit. It's like the model is undercooked, and ALL gens have big defects, either in the anatomy (mostly eyes/hands, didnt try FEET gens but I cant imagine them being better) and/or in the image composition. These aren't even hard prompts AND MAN it just fucking sucks.
Posting a realism gen I got when randomizing the styles. Anyways, that's it from me for pony. It's a shame because it had some promise. You can gen at 2MP (which is nice) without needing to upscale, 100~ seconds per gen, but the results are... just not worth it. Back to lumina/qwen/chroma with me.
>>
>>107012994
I appreciate your grit anon
>>
ani might drop a new release today
>>
>>107013173
Go back
>>
>>107012994
unironically he should've just not released it and disappeared from the Internet in shame
>>
File: QIE2509_20251026_00004_.png (1.62 MB, 1328x1328)
1.62 MB
1.62 MB PNG
>>107013229
yup, sad. in other news, this qwen lora is pretty good.
>>
Nooooooo, which animal fucker will save us now??
>>
>>107013253
man, in this space, especially for our precious 1girl, only the most degenerate dedicate time to the craft. These degenerates are sadly furryshits and ponyshits (a worse version of furryshits). I'm not sure about the yume guy, but he's got a SEA name, so he might just be animegoon minded.
>>
The first of 1,696 gens I need to comb through. This is gonna be a lot of work
>>
>>107011857
I haven't tried it yet, batwing clownshark has a style transfer workflow: https://github.com/ClownsharkBatwing/RES4LYF
I like their sampler & chain sampler workflow
If you were talking about copying the likeness of a person then sorry I don't know
>>
File: file.jpg (934 KB, 4173x3000)
934 KB
934 KB JPG
why does qwen image edit zoom out when I increase the pixel count?
>>
>>107013184
go to bed
>>
>>107011632
>no horror porn gens on the loras civit page yet
shame, guess I know what I'm doing when I get home

>>107011272
yeah this is proving to be quite a buzzkill
t. analsex enjoyer
>>
>>107013324
it's baked in the model sadly, you can mitigate some of it if you dont connect the vae to the text encoders and create your own latents and conditioning.
>>
>>107012981
Just a basic working one. If you want I can add the nodes to that Chroma workflow I posted.
>>
>>107011223
>>107011247
The eyes look very sad. Would hug (and also give a bath and a sandwich).
>>
Any of you guys know how to inpaint in comfyUI without the whole image losing quality after a couple of passes? I'm trying to edit several small parts of an image but after 4 or 5 passes the image as a whole starts getting noticeably blurrier, even in parts that were never edited/masked on.
Any nodes/workflows that could solve that?
>>
>>107013484
from what I recall, LanPaint can sort of do it. Generally when you go to/from VAE there is quality loss, no way around it. LanPaint tries to do a smart thing, which is preserving the original image and just replacing the masked part (more or less).
>>
>>107013503
What I'm confused about is how did A1111/Forge use to do it. I switched to comfy months ago for several reasons, but the fact that such a staple function AI generation is so hard to get right with comfy and so simple in the older,arguably less potent work environments is crazy.
I'll check your recommendations along with some other things, thanks.
>>
>>107011852
Have you seen loras for upscalers?
>>
>>107013360

Please do, I would love a WF from you. Your pictures look dope!
>>
>>107013527
welcome to the cumfartui experience where user convenience is barred from entry
>>
>>107013484
don't inpaint with the same seed
>>
>>107013484
also you're doing something wrong
>>
So what chroma and what version of it should I be using with 24gb vram?
>>
>>107013612
I don't know, that's why I'm asking. It should be possible with some guided training but I don't know how to do it.

You should be able to train or make a lora for models in theory, even if it's an upscaling model, it should be traineable.
>>
Does LORAs not work with the GGUF version of WAN 2.2 I2V? Can't lewd loras to work at all.
>>
>>107013796

It definitely does. Screenshot or catbox your workflow?
>>
why is civitai archive useless now? There's no point if the only mirror is the deleted mirror from civitai lol
>>
>>107013784
You'll need lora training code and know what layers to train. At this point, better ask LLMs.
>>
>https://rentry.org/wan22ldgguide
I'm following this to the letter with Comfy but I can't get it working.
After finishing "Loading transformer parameters to cuda" it starts "Sampling 81 frames at 480x832 with 3 steps" and Comfy ends with nothing in the console.
16gb vram + 32 sysram get filled up, but there's no GPU usage until Comfy ends.
Also two of the workflows linked in the guide are the same, one is an empty file, one has the wrong models loaded in.

>torch==2.10.0.dev20251026+cu128
>triton-windows==3.5.0.post21
>>
>>107011844
maybe some capacitators are borked
>>
>rtx 4070
>864x1208
>qwen image edt
>315s per generation
is this normal? takes a long time text encoding, the generation itself its pretty fast
I'm using a fairly simple workflow
there's 2 text encoder nodes, each take 118s, the ksampler takes 94s.
>>
>>107013360
Yes please <3
>>
>>
File: 1677880274397050.jpg (4 KB, 249x157)
4 KB
4 KB JPG
this month has been locally boring. nothing new. I literally haven't touched my comfyui folders, except to add new loras...
>>
>>107014020
comfyui is boring
>>
euler_cfg_pp/simple CFG=1
>>
>>107013972
>fixed
some retard put the clip on the CPU, went from118s to 4s
>>
>>107014047
oof
>>
Does anybody have some tips for Qwen?
Got a workflow with 3 image inputs, couldn't find how to do with only 2.
Do I need loras for NSFW?
I've mostly been changing women's clothes but I'm limited to a "source" image for body composition.
>>
>>107011294
She has three legs
>>
>>107014098
>Got a workflow with 3 image inputs, couldn't find how to do with only 2.
TextEncodeQwenImageEditPlus has 3 image inputs. Use qwen edit 2509 of course
>>
>>107014098
>>107014133
Wait, I read that backwards. Disable the image nodes you don't want with ctrl+b (right click bypass)
>>
File: ComfyUI_I2V_00520.mp4 (3.36 MB, 864x1064)
3.36 MB
3.36 MB MP4
>>107013660
>>107013999
Here, catch.
https://files.catbox.moe/bfrlr8.json
>>
>>107014169
>https://files.catbox.moe/bfrlr8.json
goat
>>
>>107013756
Try base or hd with >>107014169
>>
>>107013281
For copying likeness there's PuLID for Chroma.
>>
File: 1726075449183071.jpg (7 KB, 249x250)
7 KB
7 KB JPG
why can't inpainting on comfy be as nice and easy as it is on forge, why this humiliation ritual
>>
>>107011984
It can't do nsfw so is DOA just like ltx1
>>
>>107014413
hoyl fucking brainlet dude, just pass the mask and youre done. is it too much to figure it out? fucking retards I fucking swear
>>
>>107014413
what
it's even better on comfy
>>
File: QIE2509_20251026_00007_.png (1.51 MB, 1472x1136)
1.51 MB
1.51 MB PNG
would you buy her figure?
>>
chroma flash or chroma radiance
>>
>>107014548
both have their cases, generally id go with flash, radiance if you want to do more abstract/soft/smudgy shit (it's undercooked for now)
>>
>>
>>107013796
They most certainly do. You must be fucking something up real bad.
>>107013849
That guide is shit. If you're using the fp8 model that's like 15gb for the model alone. You have to increase the block swap setting to offload. I dunno why your sysram would fill up though.
>>
Is it possible to fix the banding on Qwen?
>>
File: 1603582965.png (1.52 MB, 896x1152)
1.52 MB
1.52 MB PNG
>>
>>107011984
>6 fingers
>Terrible physics knowledge
>Slopped to hell and back

Why does it have to be such a half-assed implementation of a video model though? It's not even Wan tier. Why can't they get it right?
>>
>>107014413
use swarmui



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.