[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Unable to Masturbate Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106787650

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Blessed thread of frenship
>>
okay now i'm loling because i finally realized the OP images have been bad on purpose

you got me in the first half baker not gonna lie
>>
sex with jenny
>>
File: i2843.jpg (381 KB, 1280x896)
381 KB
381 KB JPG
>>
File: ice protestor.png (1.97 MB, 1024x1024)
1.97 MB
1.97 MB PNG
>>106790544
fist prost
>>
any way to avoid degradation when using last image to gen next video in wan 2.2 i2v?
>>
which workflow produces the best gothy Halloween bitches?
>>
>>106790558
She's hot. Notice how she doesn't have the retard groove in her chin.
>>
File: 00054-3889755807.png (1.02 MB, 1344x768)
1.02 MB
1.02 MB PNG
>>
>MayLi
>Chinese researcher GF
>Jenny
Who's next?
>>
File: 1745732111133228.png (777 KB, 1360x768)
777 KB
777 KB PNG
BUY NOW
>>
File: 1567197793783.gif (2.65 MB, 320x240)
2.65 MB
2.65 MB GIF
>>106790446
>1024x1024
>all that blurry scuff
LMAO EIGHTY BILLION
>>
File: ? - !.png (56 KB, 214x226)
56 KB
56 KB PNG
>>106790588
32GB?
>>
File: WanVideo2_2_I2V_00552.webm (3.67 MB, 704x1280)
3.67 MB
3.67 MB WEBM
>>
File: 1736404921670692.png (787 KB, 2000x1600)
787 KB
787 KB PNG
>>106790596
>LMAO EIGHTY BILLION
b-but, it's 1st on that memeboard though!!
>>
File: 1638542121251.png (79 KB, 271x270)
79 KB
79 KB PNG
>>106790603
mommy
>>
File: 1750150493441272.png (776 KB, 1360x768)
776 KB
776 KB PNG
the man is pointing at a screen at a game show, with an Nvidia GPU, and the text RTX 6090 below it. The price "$10000" is below the GPU. The screen has a chart showing RTX 6090 with a very big red bar and RTX 4090 with a tiny blue bar. Keep the man's expression the same.

TRUST THE CHART
>>
Is this a real person or just some AI generated slut?
>>
>>106790621
I hope the chinks release an at least somewhat reasonable vram monster at some point
>>
File: 00063-2728510861.png (2.76 MB, 1248x1824)
2.76 MB
2.76 MB PNG
>>
>>106790637
the US will ban that card if it exists lol
>>
File: 1731676055772267.png (893 KB, 1360x768)
893 KB
893 KB PNG
you WILL give Jensen your money
>>
>>106790603
original pic?
>>
>>106790588
I think it's rumored to be 30% more powerful in raster vs the 5090 and way more in raytracing.
No idea how this will translate in AI workloads.
>>
>>106790654
frame 1
>>
>>106790662
it's going to be expensive as fuck cause it's a new node unlike the 50 series, so it's actually much faster.
>>
>>106790641
how did you get that cool tattoo without it looking weird? and is it easy to make it glow in your wf?
>>
>>106790588
>>106790621
>>106790647
I cannot wait to pay $3500 msrp + tip for 36gb of vram!
>>
>>106790671
try reddit
>>
>>106790645
I will fly there and smuggle it in in my ass if i have to
>>
>>106790645
That'll just accelerate BRICS membership/allegiance. If the physical things exist, they are going to get used.
>>
>>106790645
Good time to not be a mutt
>>
>>106790664
webm compression fucks the quality
>>
>>106790647
I've already given him like 15k in the last 5 years, I'm tapped out bros...
>>
>>106790679
you're right, I forgot you couldn't ask questions in ldg
>>
/ldg/ has many incorrect information regarding Hunyuan Image 3.0. A simple comparison between Hunyuan Image 3.0 and Qwen Image was conducted, and it was concluded that it is not much better than Qwen Image, let alone SDXL. Such incorrect information has misled many people. Hunyuan Image 3.0 is an autoregressive LLM model that integrates a diffusion model. Therefore, it has "concepts" and can generate tasks like traditional diffusion models that cannot handle, such as creating similar comics, through simple prompts, or customize every part of the image (the number of understandable prompts far exceeds Qwen Image). Currently, there are only four image generation models with this capability: Google's nano banana, ByteDance's Seedream 4, OpenAI's image1, and finally Tencent's Hunyuan Image 3.0. Therefore, the open-source community has for the first time obtained a model that is extremely close to the closed-source SOTA model. Unfortunately, it has not sparked much hype.
>>
>>106790712
do i have to handhold you you stupid nigger?
>screenshot frame 1 in your video player if you have a micro competent one that includes a screenshotting feature
>use web search of choice to reverse image search
>((???))
>realize she's just your generic brown hoe mommy and you can just gen fifty million of those in any sdxl checkpoint
>>
File: britbongruok.png (458 KB, 716x895)
458 KB
458 KB PNG
LOL
>>
>>106790732
You could have written half as much "reverse google search a screenshot of the first frame" or better, uploaded an image and be done with it.
>>
>>106790732
bro wrote 50 words instead of just uploading the pic he had just used to gen the video he posted
>>
seems like it's that time of the month for anons
>>
>>106790755
>s
>>
>>106790741
poor UKeks
>>
>>106790732
i searched it and it didnt look like its based on an existing woman so i just want to get the full original gen image quality instead of crf 19 webm compressed one
>>
>>106790762
SEX
>>
>>106790741
honestly, I don't think any local model is compliant.
>>
File: mermaid-caught-fishnet.jpg (1.45 MB, 2349x1339)
1.45 MB
1.45 MB JPG
>>106790588
> $10000
Wouldn't shock me at this point
>>
>>106790732
>>106790732
>use web search of choice to reverse image search
Do you know what year it is? Reverse image search has been intentionally crippled, especially for people, by every service for going on five years now.
>>
File: boin.mp4 (1.41 MB, 1120x1440)
1.41 MB
1.41 MB MP4
>>106790741
UK wanted this to happen. Unfortunately they don't have a loicense for the common internet anymore.
>>
File: 1748315880876983.png (26 KB, 1272x287)
26 KB
26 KB PNG
>>106790778
Google went from god like in reverse image to pathetic.
>>
>>106790741
the population of UK is a cattle one, they probably praised for this lol
>>
>>106790745
>>106790749
>chooses to write a response that wastes even more time than i used to write my longwinded tirade
what did you hope to accomplish?

>>106790778
>intentionally crippled, especially for people
half true, half skill issue. i don't have this problem unless it's a subject that just doesn't have many photos as is.
>>
>>106790786
I'm sure nier automata got popular only because of the curves of 2b, I find it really mid compared to nier replicant
>>
>just reverse image search obvious flux chinned ai whore #11billion
What did anon mean by this
>>
>>106790808
Dude, you look like an annoying loser.
>>
>>106790809
I really enjoyed the game, 2B being hot helped but the game was good.
>>
>>106790741
>oi wheres your loicense mate
>>
>>106790792
google is such a snake in the grass.
>>
>>106790588
preordering
>>
Anyone tried using local models for texturing 3D models?
>>
>>106790792
it started even before refusing completely to reverse people, back when if you searched for specific hentai it would go retarded on purpose and only get you "cartoon"
>>
>>106790827
Like, chill dude.
>>
>>106790848
You would want to train a model exclusively on texture maps.
>>
>>106790654
>>
where to sell my 4090, dont need anymore
>>
Looking like the only gpu I can consider at this point, for ai, is the 9070xt, or wait for celestial.

Any chance of a 9070xtx? I know a gre version is rumored, which is WORSE.
>>
File: 1743660402729014.png (1.92 MB, 1697x1308)
1.92 MB
1.92 MB PNG
>>
>>106790903
>20gb
7 years ago, the RTX titan was launched with 24gb of vram, we are really stagnating at this point...
>>
>>106790915
we will never get another 1080ti
we will NEVER get another high vram gpu for the average consumer that doesn't cost an arm and a leg

ai must die
>>
>>106790915
unfortunately incredible amounts of money via national financial institutions going for cryptomemes and now the at least more useful enterprise AI captured the market, and TSMC/nvidia is both overburdened with demand AND they have only small incentives to improve anything

the drive isn't the same as when it was healthy competition for discerning video gamers with sufficient manufacturing capacity present to manufacture more if you had a gpu that was received really well in the market
>>
File: ComfyUI_00002_.png (969 KB, 1024x1024)
969 KB
969 KB PNG
>standing pose of a slender young beautiful northern european woman with floral crown, long black braided hair, hazel green eyes, firm breasts and wide hips, dark makeup, white lace bra, white lace panties, white lace stockings

I'm a very patient and open minded person, but I feel like someone on /g/ is gaslighting everyone about Chroma. Is there perhaps a NSFW version of FLUX instead?
>>
>>106791012
>I'm a very patient and open minded person, but I feel like someone on /g/ is gaslighting everyone about Chroma.
I don't think so, a lot of people are critical of that model (for good reasons)
>>
>>106791002
>the drive isn't the same as when it was healthy competition for discerning video gamers with sufficient manufacturing capacity present to manufacture more if you had a gpu that was received really well in the market
China has all the incentive to rugpull the US economy, which will be trivial to do if they even announce a semi-real competitor.
>>
>>106791012
Skill issue.
>>
Jenny, Jenny, dreams are ten a penny
Leave them in the list and found.
>>
File: 1754924662603621.png (9 KB, 899x784)
9 KB
9 KB PNG
>>106790741
not just the uk. almost all of western Eu is censored. artistic sites are still available, but for famous adult sites, it's the same punishment as the UK. i'm glad they're too lazy to censor niche adult sites
>>
>>106791020
making good gpu's is hard bro, it's gonna take some time before anyone can catch up to Nvdia
>>
File: booooooba_1.webm (3.87 MB, 752x1376)
3.87 MB
3.87 MB WEBM
>>106790603
It this a real person or AI?
>>
can chroma do gore? what's a good gore prompt?
>>
File: flux_krea_00007_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>106791022
Chroma skill issue. She is literally brown my dude and painted for some reason. This doesn't happen in flux.
>>
>>106791046
>i'm glad they're too lazy to censor niche adult sites
yeah, once they hit pornhub I was like "yeah and?" this site was dead several years ago when they wiped the amateur videos in it
>>
>>106791020
it's not the US economy making these tho, it's kind-of a global network of suppliers ending up at TSMC a company which itself is immensely capable

i do think it's actually somewhat bottlenecked
>>
File: ComfyUI_temp_norxq_00001_.png (2.14 MB, 1440x1440)
2.14 MB
2.14 MB PNG
>>
File: WanVideo2_2_I2V_00571.webm (3.73 MB, 704x1280)
3.73 MB
3.73 MB WEBM
>>106791052
its ai
>>
>>106790741
OI MATE NO ANIME TITTIES FOR YOU
>>
>>106790786
Now that's a cute kitten.
>>
>>106791110
also Florida?
>>
File: 1742986559148845.png (1.32 MB, 856x1216)
1.32 MB
1.32 MB PNG
qwen edit 2509 is so neat.
>show the girl from behind
>but it's a gravure video, so what?
you can do this with any image. how would you do this with inpainting otherwise, with the same outfit and proportions?

and you can manipulate ANY object, not just girls.
>>
>>106790786
It's for the best. Explicit images only incite jeets to rape.
>>
>>106791047
True, but then again they said the same thing about frontier models, then DeepSeek fell out of the sky.

>>106791074
Yeah, but right now so much US stock valuation is tied up in AI and NVIDIA specifically, that if NVIDIA stock prices fall just a tiny fraction or even "only" fail to grow "fast enough" then it will set off a series of crashes.
>>
>>106791124
>and you can manipulate ANY object, not just girls.
Girls are the best kind of object to manipulate.
>>
>>106791131
As you said, DS happened and Nvidia dipped but alas, no crashes. Which I deeply regret.
>>
>>106791139
true, that's why everyone who learns to draw starts with boobs first.
>>
>>106790875
thanks
>>
>>106790875
>>106791155
can you also catbox it? or at least give the model and prompt? thanks
>>
>>106791094
kek
>>
File: IMG_0202.png (2.88 MB, 2732x2048)
2.88 MB
2.88 MB PNG
How is /ldg/ genning on this fíne weekend? I’m comfy in bed slopmaxxxing on my ipad with some music going and the dogs snoozing at my feet. This is the life lads.
Running a batch on the new wainsfwv15 to see if its any big difference.
>>
>he opened his port
>>
>>106791213
Router level blocked from any traffic outside of the LAN. I’m stupid but not terminally so.
>>
>>106791046
>i'm glad they're too lazy to censor niche adult sites
The thing with lists and time is that they can only grow bigger.
>>
>>106791196
I am trying to understand a wan long video workflow to replicate it
>>
>>106791124
>you can do this with any image.
you can, but I find it slops it hard once you start turning them around. And I can never get the panties to be micro enough. Probably a skill issue.
>>
>Running a batch on the new wainsfwv15 to see if its any big difference.
lol
>>
>>106791253
nah, qwen image edit is just slopping shit if it changes the image too much, it is what it is
>>
File: 1608301352326.jpg (87 KB, 536x656)
87 KB
87 KB JPG
Do qwen and nunchaku qwen share the same seed outputs?
>>
>>106791244
Once it can be done it’s like magic. I cranked it to one this morning that was like a 10 minute long video of 3 “no sex” OF whores giving a group blowjob. You could tell it was AI cause the cock kept having weird morphs and movement, but still, believable enough and it was 10 minutes long and seamless. Crazy stuff. Not sure how they managed it.
>>
>>106791253
>And I can never get the panties to be micro enough
Not until someone properly trains on various underwear. Its current understanding of lingerie, underwear, and a many sexier clothes is quite bad.
>>
>>106791057
>good gore prompt?
Manbearpig.
>>
>>106791260
An anon in a previous thread said he noticed it was “better” or something like that no better way than to try it for yourself. What am i supposed to do, just not believe anyone on here ever? I love me slop.
>>
>>106791281
That explains some of my issues. I can get SDXL to run panties between pussy lips but Qwen is a hard no.
>>
>>106791277
>10 minute long video
wtf
I'll be happy with already 60 seconds cut in 6-8s segments
Unless they ran the stuff automatically then yeah you can get 600s stuff, but how they did they manage to do it without the result looking hypedegraded by the 5 min mark
>>
>>106791290
what kind of model is kontext and qwen image edit?
>>
>>106791290
Yeah, and for some reason outside of nudify loras quickly banned from civitai, not much is released for QIE.
>>
>>106791304
I think very clever stitching of clips. I have read on here that you can use the first frame to start and sort of maintain coherence that way with a strong prompt. I don’t do video though so don’t quote me on that.
>>
>>106791277
>seamless
i have never seen this. does it use the last frame an an image2video and glue the clips together? otherwise there would have to be something more going on. how would it not degrade in quality?
>>
>>106791311
I'm not sure what you mean
>>106791315
Would it be hard to train yourself?
>>
File: mermaid-hair-brush.jpg (1.1 MB, 1768x1768)
1.1 MB
1.1 MB JPG
Is joycaption still the way to caption images locally for non-booru models? Or has something better come around?
>>
>>106791327
Now that i think about it it had some clever cuts that almost made it seem continuous but thinking back i recognize it now. Like one girl i think took a shirt off and covered the camera doing so, and it didn’t look exactly the same after the shirt came off. Mind you, i had my cock in hand so i didn’t stop to closely analyze it. May have been fake in some other manner than i2v
>>
>>106791321
The issue is that when each new clip uses the last image of the preceding one, which means it went through vae decode each time, which degrades the image.
Think of it like saving a jpg over and over.
>>
File: radiance.png (2.78 MB, 848x1488)
2.78 MB
2.78 MB PNG
>>106790809
people enjoy either game or even just 2b, but yea replicant was better

>>106791125
i'm sure britain is safe now, <safety> always works

>>106791131
>US stock valuation
i often found that inexplicable, some companies seem valuated like they owned their entire supply chain to the bottom. unfortunately it doesn't usually help us streetside peasants much.
>>
>>106791361
Oh god please don’t be the witchfaggot from /sdg/, that guy is the fucking WORST.
>>
File: ComfyUI_01909_.png (2.69 MB, 1024x1536)
2.69 MB
2.69 MB PNG
>>
File: ComfyUI_01913_.png (2.59 MB, 1024x1536)
2.59 MB
2.59 MB PNG
>>
>>106791406
try kl_optimal
>>
>>106791196
wai v15 is great, go to anime model, that and hassaku is good too.

use this extension for easier prompting/booru tags (what the model is trained on):

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

so if you type fate it will show all tags related to fate (characters, series, etc). very useful tool.
>>
>>106791253
for anything bordering on nsfw use the qwen clothes remover lora and just prompt for a tiny bikini or whatever.
>>
File: ComfyUI_01921_.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>106791424
>try kl_optimal
Was on 12 steps ddim/simple before. This is 20 steps ddim/ddim_uniform. ddim_uniform was the most coherent background. I went down the line on schedulers.
>>
>>106791342
maybe https://huggingface.co/quarterturn/molmo-flux-captioner ?
>>
File: ComfyUI_01927_.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>
>>106790544
https://github.com/temporalscorerescaling/TSR
>We present a training-free mechanism (TSR) to steer the sampling diversity of denoising diffusion and flow matching models, allowing users to sample from a sharper or broader distribution than the training distribution. We validate its effectiveness on 2D toy data and demonstrate its applicability to real-world tasks.
>>
File: greta.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>106791342
damnit I only wanted the hat
>>
>>106791465
>ddim
There's a ddim sampler? Where did you get it?
>>
File: file.png (2.6 MB, 2770x1235)
2.6 MB
2.6 MB PNG
>>106791491
>>
>>106791494
prompt
the mermaid in image A is wearing the hat on the girl in image B
>>
File: ComfyUI_01915_.png (2.49 MB, 1024x1536)
2.49 MB
2.49 MB PNG
>>106791502
Built into ComfyUI. My workflow is 100% native Comfy Chroma
>>
>>106791491
>>106791506
>another sampler cope
great, you can put it on the pile of thousands of sampling copes
>>
>>106791491
lol
>>
>>106791474
It looks about the same age as joycaption, but may be worth taking a look at, thanks
>>106791494
That's pretty funny.
>>
File: 1744641474552991.png (2.73 MB, 1693x1035)
2.73 MB
2.73 MB PNG
show the girl from behind.

technology is neat, huh?
>but you can just use the camera mode
the point is you can do this with ANY image.
>>
>>106791556
yeah qwen edit is the first thing I've come across that I can see myself using this for real tasks outside of cooming. I kneel to China for this one.
>>
>>106791451
Ty based 1girl appreciator
>>
File: 1733945210647940.png (2.14 MB, 1268x1125)
2.14 MB
2.14 MB PNG
>>106791556
>>
lumina/yume is pretty nice, any possible speedups? anything lower than 30 steps is pretty bad, something like dmd2 would be nice for batches
>>
>>106791342
joycaption is trash
use qwen vlm of whatever size you can manage, or hire an organic peon if you want the results to actually be good.
>>
>>106791556
try it on the behind image with
>show the girl from the front
or whatever works to turn her around. it should be funny. Try doing this over and over lol
>>
Depending on the lora, it seems that the prompt is just a placebo effect. I did some tests with image2video, describing the scene in detail, and then writing random things, and the results are pretty much the same.
>>
File: 1753035914630271.png (995 KB, 768x1360)
995 KB
995 KB PNG
>>106791570
or with the second image node enabled and with an image:

the girl is wearing the outfit from image2.

eve denton, her hips are augmented.

also, this is far superior to inpainting for outfit swaps.
>>
Why does Chroma keep giving me shitty grainy images even if I'm at 1024x1024? Should I risk it all and pull to see if that fixes it?
>>
>>106791605
Give it more steps
>>
File: 0100.jpg (1.04 MB, 2744x1144)
1.04 MB
1.04 MB JPG
>>106791556
How fine grain is the control? Can you put two people in an image and tell it to have one look at the other while keeping the rest of the pose unchanged? Can it tweak eye levels? That could be extremely useful for multi-subject image and for which inpainting has always been frustrating for me at least.
>>106791589
I will look at qwen vlm, thanks.
>>
>>106791608
>>106791605
(or fewer)
(or add a nonsense negative)
>>
File: 1728467183770483.png (944 KB, 768x1360)
944 KB
944 KB PNG
>>106791591
had to change the prompt for the back photo, it kept it at the back.

show the girl from the front.

didn't specify asian so it has to guess. still neat, you can prompt whatever details you want.
>>
>>106791608
I'm doing 30 steps already do I seriously have to do that?
>>
>>106791605
>Why does Chroma keep giving me shitty grainy images even if I'm at 1024x1024?
it's a shit model, that's pretty much it
>>
>>106791614
you can do multi image stuff fine, sure. when referencing you can say "image2" or "image3" to refer to the nodes, instead of saying the man/woman (description). it's a really neat model.
>>
>>106791619
Are you using flash HD?
>>
>>106791617
what happens if you say
>show the old woman from the front
>>
>>106791625
I see people making good stuff with it. Are they just rolling the dice and not posting their bad gens?
>>
>>106791645
Anon that’s every model even with the newest ones
>>
>>106791635
No but I've tried it and that model introduces entirely new problems so I'm just not bothering with it for now. I planned on messing with that later.
>>
File: 1749530363522140.png (950 KB, 768x1360)
950 KB
950 KB PNG
>>106791637
as I said, you can prompt anything and the transform will do it. or use an image source (image2, image3)
>>
File: ComfyUI_01934_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>106791605
The base version requires a lot of steps. I'm using 20 steps ddim/ddim_uniform for Chroma1-HD-Flash atm. There's a flash lora for the other versions that's pretty new actually https://huggingface.co/silveroxides/Chroma-LoRAs/tree/main/Chroma1-Flash
>>
>>106791663
transform example:

show the woman from the front. she has gigantic boobs and is japanese.

https://files.catbox.moe/8av0pr.png
>>
>>106791665
>The base version requires a lot of steps.
why? after v30 he changed the training method so that the model can render shit at lower steps
>>
>>106791681
Just because it can, doesn't mean it looks good.
>>
File: 1730977505737026.png (1.29 MB, 856x1224)
1.29 MB
1.29 MB PNG
the woman on the right is dressed as Hatsune Miku, and has long teal color twintails.
>>
What's Qwen Edit's native resolution? SDXL works at 1024, WAI 15 at 1536. what about Qwen?
>>
>>106791658
Try my WF. The chroma cache node is really good.
https://files.catbox.moe/h0y7ua.png
>>
File: 1745791555271504.png (1.21 MB, 856x1224)
1.21 MB
1.21 MB PNG
>>106791697
the two girls are doing a slav squat.
>>
File: ComfyUI_01942_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
>>106791726
>slav squat
we need to go way deeper
>>
>>106791745
Deep enough to sit on my shrimpdick
>>
File: 1749462915216166.png (1.43 MB, 856x1224)
1.43 MB
1.43 MB PNG
>>106791726
change the location to a slum in India filled with garbage, beside a trash filled river. the girls both have their arms at their side.

kek
>>
File: 1729960582720747.png (1.44 MB, 856x1224)
1.44 MB
1.44 MB PNG
>>106791766
better raven:
>>
cozy bread
>>
>>106791164
sry didnt se this, heres momacita https://files.catbox.moe/3xs695.png
>>
>>106791589
>qwen vlm
have to check it out, joycaption makes so many mistakes it's exhausting
>>
>>106791832
based
>>
>>106791589
>qwen vlm
What version can do nsfw?
>>
>>106791832
lol looks like any of a number of aunts or cousins I have. Turns out the power of AI wasn’t making the girl of your dreams, but just your average Mexican seven, based.
>>
Hi
Can I link Lora in “series”?
For example:
>generation model X
>output linked to Lora of woman in bikini
>output from lora above, linked to Lora of milk bath

And the result would be a woman in a bikini taking a milk bath?
Note: I made up the examples for illustrative purposes only.
>>
>>106791665
I tried some of the loras you linked and I couldn't get them to work that well. I'll try the full flash model later today.
>>
>>106791911
Yes. And assuming ComfyUI, you can make this braindead simple by using lora stackers.
>>
>>106791873
>Turns out the power of AI wasn’t making the girl of your dreams, but just your average Mexican seven, based.
a Mexican seven (WITHOUT TATTOOS) is basically a 9 in any big city in 2025
>>
https://huggingface.co/nunchaku-tech/nunchaku-sdxl
so this only works with base SDXL right? is there no way to make it work with XL mixes?
>>
>>106791873
i have been making crazy ai porn for over a year, but typically a pretty face and cleavage is all i need
>>
File: RA_NBCM_00037.jpg (824 KB, 1872x2736)
824 KB
824 KB JPG
>>
>>106791124
Reo Fujisawa
>>
>>
>>106791697
>Hatsune Miku
Wonderful gen as always my king.
>>
>>106791990
>an autistic weeb
I'm hearing about this now for the first time
>>
File: mivolo_lolo.jpg (143 KB, 868x780)
143 KB
143 KB JPG
mivolo2 works with comfyui when using the timm pull.
the age estimation works great, it would just be nice to have it in a custom node package
>>
still don't get what chroma 2k is
>>
>>106792009
more coherence on higher Mpx count
>>
I remember a node that was able to toggle on/off different groups, but I can't remember its name, any knows the name?
>>
>>106792038
Quick Group Bypasser from rgthree
>>
>>106791451
That’s a great extension, I have a lot of fun just messing around with it. One I’ve discovered recently, which has been very fun to use is “doll joints”. Different flavour for my 1girl gacha
>>
>>106792051
Thank you anon, it was "Fast Groups Bypasser (rgthree)", searching bypasser was enough.
>>
>>106792067
only thing to change in the settings is to enable "show all results", so if one term like "fate" has a ton of results, you can scroll the tags.
>>
anyone knows if loramanager is compatible with subgraphs? it's not able to see my lora nodes when they are in a subgraph for some reason
>>
what's subgraph?
>>
>>106791663
interesting. messed up the hands, but still, you could just turn the working one into old, or try some seeds
>>
>>106791665
>ddim/ddim_uniform for Chroma1-HD-Flash
does it support cfg 1?
>>
>>106792145
comfyui feature
>>
File: dmmg_0129.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>>106791065
>>106791012
yeah flux ain't messing up something that simple (cept for the chin)
>>
>>106791451
is there a web version to just try typing tags?
>>
File: 1739138746966306.png (1.31 MB, 880x1184)
1.31 MB
1.31 MB PNG
put the girl in image2 beside the bicycle in image1.

neat, original was black and white at first too.
>>
>>106792193
Yes, but base or 2k with flash lora is better
>>
>>106792145
something redditors don't use
>>
>>106792215
not sure, extension itself is small though, idk if there is a standalone one.
>>
>>106792223
>2k
I don't know what 2k is. I have Chroma 50 annealed, at q8.
>>
File: ComfyUI_01946_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>106791681
>The base version requires a lot of steps.
>why? after v30 he changed the training method
Couldn't tell you

>>106791920
>tried some of the loras you linked and I couldn't get them to work that well. I'll try the full flash model later today.
F, okay

>>106792193
>cfg 1?
Yes https://files.catbox.moe/i159xk.png
>>
>>106792145
Think of it like a function. You squirrel off a bit of code into its own little package, if you know the inputs you feed in and the outputs you want out. It is good for both organization and reuse of repetitive tasks.
>>
>>106792248
>I have Chroma 50 annealed
Lol that's the aborted version lmao. Use base, holy fuck.
>>
File: FluxKrea_Output_237728.png (2.27 MB, 1024x1496)
2.27 MB
2.27 MB PNG
>>106790613
I literally haven't seen anyone even attempt to provide specific output examples with prompt of things that e.g. Qwen could not ever do under any circumstances
>>
File: 1755613911132592.png (1.09 MB, 880x1184)
1.09 MB
1.09 MB PNG
>>106792222
remove the girl in image1 leaning on a bicycle.
>>
File: 15 steps.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
euler, 15 steps of chroma. It can do a lot with 15 steps. Sometimes, paradoxically, the hands are better at fewer steps. Not this time. Trying 50 steps now.
>>
>>106792276
the current current current actually for realzies super final this time I swear guys "HD" is objectively better than Base
>>
File: 1748684133341439.png (1.12 MB, 880x1184)
1.12 MB
1.12 MB PNG
>>106792281
now that I have an empty bike, can re-enable image2 (the other girl)

the girl in image2 is leaning on the bicycle.
>>
>>106792306
*sitting on, wrong paste.

pretty neat though, doing this even with openpose or inpainting would be hard if not impossible to do properly.
>>
>>106791451
>go to anime model
Here we use real anime models like base noob.
>>
>>106792295
ok keep eating shit retard. Don't ask for help next time
>>
File: 1745967752674657.png (1.16 MB, 880x1184)
1.16 MB
1.16 MB PNG
>>106792306
same thing, diff image2 image. booba lady kept her beer, dont drink and drive!
>>106792322
it works with any checkpoint you like. including noob. all it does is show tags relevant to your current text.
>>
File: QwenEdit_00120_.png (1.02 MB, 768x1360)
1.02 MB
1.02 MB PNG
working with more than 1 image really slows qwen edit right the fuck down
>>
Can qwen edit turn a picture into the openpose stick figure?
>>
>autistic slop loving miku spammer can't read
Figures.
>>
File: 1735901408087015.png (1.3 MB, 992x1048)
1.3 MB
1.3 MB PNG
the girl in image2 is sitting at the table with the girl in image1.

cool.
>>106792354
use aio aux preprocessor node -> openpose in the menu. that gives you an openpose for the image.
>>106792358
base noobai is fine, but wai 15 is really good too.
>>
File: x.png (2.1 MB, 1024x1024)
2.1 MB
2.1 MB PNG
>>
File: dmmg_0135.png (1.09 MB, 832x1216)
1.09 MB
1.09 MB PNG
>>106792287
i find it's usually way better to just use refiners if you're happy with everything else at lower steps.
>>
>>106792354
some anon was doing it a few threads back, apparently it works well
>>
>>106792276
Wait, what happened to annealed?
>>
>merge chroma hd and the flash delta weights to see if anon was lying
>he wasn't lying
>it's better than the actual flash checkpoint
so what the FUCK is the checkpoint in the flash repo?
>>
File: comp.jpg (1.8 MB, 3456x2304)
1.8 MB
1.8 MB JPG
I keep telling but you don't listen. Non-flash model + flash lora is better than flash HD.
>>
>>106792398
This seems quite lewd tbqhwy
>>
File: 1738534396880877.png (1.27 MB, 992x1048)
1.27 MB
1.27 MB PNG
>>106792364
>>
>>106792408
I made it in post-nut clarity. Just removed the horse strap-on from the prompt.
>>
>>106792398
If you're talking about the flash loras from silveroxides then those all suck except for the v47-flash-heun stuff. Merging the delta weights is the best option.
>>
>>106792414
Kek
>>
File: 1750569261047348.png (1.17 MB, 992x1048)
1.17 MB
1.17 MB PNG
my vision is augmented
>>
>>106792415
>Merging the delta weights is the best option.
They are bigger than the entire Q8
>>
we'll get a chroma finetune any day now, right?
>>
>>106792435
Not my problem if you can't use the full weights.
>>
does qwen image edit have the ability to do a face swap that isn't just a photoshop style clone?
>>
File: 1740786216102836.png (1.19 MB, 992x1048)
1.19 MB
1.19 MB PNG
>>106792432
>>
>>106792449
for that use reactor

https://codeberg.org/Gourieff/comfyui-reactor-node
>>
>>106792449
(like this seems to)
https://github.com/TencentARC/PhotoMaker
>>
upscaling is by far the most time consuming process for image gens. is there anything that optimizes or speeds it up?
>>
File: sana_thunberg.png (1013 KB, 768x1360)
1013 KB
1013 KB PNG
hmm I wanted the face too.
Let me try again.
>>
>>106792480
Use topaz externally.
>>
>>106792432
>my vision is augmented
beer goggles count
>>
File: 50 steps.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>106792287
changing steps from 15 to 50 changed the style. Chroma is very random. With Flux that usually doesn't happen, with euler.
>>
aside from money, what is stopping me from taking sdxl and training it on t5xxl until it werks?
>>
>>106792491
is that base model or using a lora? pretty good, that's the eva artist I recognize the name.
>>
>>106792488
sanna missing few white lines of party powder
>>
>>106792501
There are already merges with Gemma
>>
>>106792509
can i use any model with that or are you taking about the anime models
>>
>>106792516
Why would you use sdxl for anything but anime?
>>
>>106792497
Almost like Chroma is a bad model that isn't fit for people to gen with and was instead released with the expection that someone else would make a finetune that fixes it, which will likely never materialize.
>>
>>106792504
lora, needs more/better training
>>
>>106792524
why would I use sdxl for anime? qwen with loras is better for that
>>
>>106792534
Why doesn't WAI or Noob put their billions of datasets into Chroma? You have to contact them personally and share your findings
>>
File: 1749524212225050.png (1.26 MB, 1360x768)
1.26 MB
1.26 MB PNG
the girl in image2 is sitting at a table beside the girl in image1.

kek, fun model
>>
File: 1747324024491072.png (1.24 MB, 1360x768)
1.24 MB
1.24 MB PNG
>>106792555
the girl in image2 is sitting at a table in the distance, near the girl in image1.

okay that one is neat.
>>
File: x.mp4 (1.49 MB, 800x1856)
1.49 MB
1.49 MB MP4
>>106792432
nice
>>
File: 1759407378519971.png (1.24 MB, 1360x768)
1.24 MB
1.24 MB PNG
the girl in image2 is sitting at a table in the distance facing the camera, near the girl in image1.
>>
>>106792555
Gotta actually put the numbers onto the images?
>>
>>106792587
you can describe the characters but it's easier to describe the node as well, in case you have 2 asian characters for example.
>>
File: 1756889953676607.png (1.05 MB, 1360x768)
1.05 MB
1.05 MB PNG
>>106792575
the girl in image2 is sitting on a beach chair beside the girl in image1.
>>
File: 1752879526046934.png (1.32 MB, 1360x768)
1.32 MB
1.32 MB PNG
>>106792605
one more:

the girl in image2 is sitting on a beach chair beside the girl in image1. keep the location the same.

if you say to keep it the same it won't gen a new spot, or you can make one if you want.
>>
>>106790741
>3 month old news LDG already went apeshit over
uh... okay...
>>
>>106791406
CFG fried brudda
>>
>>106792554
I wish we could have Illustrious with T5 text encoder
>>
vibevoice is awesome https://files.catbox.moe/mt1az5.webm
>>
>>106792398
pls poast fullres of top mid
>>
>>106792694
can it do a good bane voice? e5-tts could but I havent tried vibevoice
>>
File: sanna_thunberg.png (1.01 MB, 768x1360)
1.01 MB
1.01 MB PNG
progress is being made
>>
>>106792696
https://files.catbox.moe/w0fitt.png
>>
>>106792701
probably. i tried voices distorted by radio and it carried over. it does accents well, i used a spanish voice clip and had her talk in english and it sounded like a good accent
>>
File: 1737137243476989.mp4 (889 KB, 720x752)
889 KB
889 KB MP4
>>106792411
>>
File: 1757688546359511.png (1.24 MB, 1360x768)
1.24 MB
1.24 MB PNG
the girl in image1 is sitting in a chair watching a large movie screen in the background, with the text "LDG" at the top of the screen, and a picture of Hatsune Miku below it.

from a random fishing screenshot of stellar blade.
>>
>>106792677
I mean (you) contact them and show them your Chroma discoveries so thyey make a finetune on Chroma with their datasets
>>
>>106792694
seems pretty good. is there a repository of voices for this one?
>>
>>106792722
>Puts the beer down into the plate.
>>
File: 00078-3783741923.png (2.69 MB, 1824x1248)
2.69 MB
2.69 MB PNG
>>
Nub question here: I downloaded wan+ComfyUI+controlnet on the recommendation of an anon in a different thread about how to create video. I downloadit all and managed to get it running, but the model refuses to generate lewd. How do I make it do that? I don't think it's an issue with prompting, so I think the model itself might be censored?
>>
File: 00080-1903473118.png (2.29 MB, 1248x1824)
2.29 MB
2.29 MB PNG
>>
>>106792248
Why are you using v50 still. Either use Base or 2k.
>>
>>106792823
with controlnet or specific lora it should do quite a lot of lewd

without it, it isn't really trained with sex or the lower human genitals, no. you might get nude boobs but not generally much more
>>
File: notquite.png (924 KB, 768x1352)
924 KB
924 KB PNG
not quite
>>
>>106791709
anyone???
>>
>>106791709
I think 1024x1024? But I am not anywhere close to sure about that.
>>
>>106791709
i think it was something like 384 to 3072 in either dimension (API version reference)
>>
File: 00084-575163550.png (2.49 MB, 1248x1824)
2.49 MB
2.49 MB PNG
>>
File: blur.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>106792528
Could be. Is non-annealed better? For no reason I got a blurry gen, yes at 15 steps, but blurry? Just weird. it was a bad-ish prompt (I just like to try random prompts I find).
>>
>>106792781
>is there a repository of voices for this one?
not that i am aware of. grabbing wav files from video games works very well.
>>
>>106792896
qwen native is1328x1328
>>
>>106792854
I don't know what those are. I remembered lodestones and checked to see if there were new chromas.
>>
>>106792931
I just use OBS to grab a ten or twenty second clip from youtube and use a video upload node to take the audio only and plug it into the speaker input.
>>
>>106792934
Whaaat?! You aren't intimately aware of the totally not confusing and often times contradictory information surrounded chroma that is in no way the fault of the maker who seems to be allergic to documentation of any kind?
>>
File: ComfyUI_01962_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
File: very_mildly_sinful.mp4 (548 KB, 1120x1632)
548 KB
548 KB MP4
>>106792931 >>106792945
so it works that well? i suppose that's good too.

if someone makes a collection of samples that work good let us here know tho.

>>106792824
>>
File: random.mp4 (497 KB, 1120x1632)
497 KB
497 KB MP4
>>106792824
very random gen but i like how wan understands this glowing cross
>>
File: RA_NBCM_00040.jpg (750 KB, 1872x2736)
750 KB
750 KB JPG
>>
When ready

>>106793003
>>106793003
>>106793003
>>106793003
>>
File: radiance.png (2.91 MB, 848x1488)
2.91 MB
2.91 MB PNG
>>106792934
yea the "release" chromas are chroma 1 base and 2k, what he is working on is radiance (model architecture change without VAE)
>>
>>106793028
girl is hot !!!
>>
>>106792369
buttchin
>>
File: no blur.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>106792926
aha. regular non-annealed (aka base?) chroma 50.
>>
>>106792861
I tried it specifically with a lora that was meant to do nudes. Maybe I'm just not cut out for this stuff.
>>
limit



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.