[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106860668

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 1756555258994530.jpg (263 KB, 1216x832)
263 KB
263 KB JPG
1st to 1girl
>sneed and seeth, ramlets
>>
File: WAN2.2_00325.mp4 (3.07 MB, 544x960)
3.07 MB
3.07 MB MP4
>>106866765
glory to 1girl, the lifeblood of /ldg/
>>
https://poal.me/8buvc2
https://poal.me/8buvc2
https://poal.me/8buvc2
>>
File: IMG_1487.jpg (1.61 MB, 1808x3216)
1.61 MB
1.61 MB JPG
>>106866715
great thread, made by me, Nick
>>
>>106866765
this took RAM?
>>
>>106866745
our chinese allies are hawking these threads, my nunchaku post got deleted too.
GRIM
>>
File: 00001-2976551893.jpg (979 KB, 2048x2560)
979 KB
979 KB JPG
>>
>>106866848
Yes, my technical skills are just plateaued at using SwarmUI with LoRas.
Anything more complicated than that confuses and angers me.
>>
File: radiance.png (2.87 MB, 864x1488)
2.87 MB
2.87 MB PNG
>>
>>106866888
radiance is at the same time good and fucking garbage, idk how the furry did it
>>
File: 1736957004037194.jpg (186 KB, 1280x1280)
186 KB
186 KB JPG
>>106866858
Yeah there's usually at least one insurgent who eventually shows up and tries provoking me to get myself b& whenever I post here.
You get used to it after awhile.
>>
>>106866888
I don't /get/ radiance. it's technically very cool, but I don't understand it. I don't know how to prompt for it, I have no context for its training or how the model thinks, and I have no idea how to control the output.
it's like I'm talking to a friggin alien.
guess I'm sticking with it until I come up with some better determinative tests.
>>
File: 00004-4051304265.png (3.7 MB, 2048x2560)
3.7 MB
3.7 MB PNG
>>
is it possible to have comfyui output a command that i can copy paste then run in the command prompt or something?
>>
File: Yuja01.png (2.77 MB, 2048x1152)
2.77 MB
2.77 MB PNG
>>
reeeeeee i just want to stitch clips together to make a long video but IT CHANGES THE COLOR FOR NO REASON AND COLOR MATCHING DOESNT WORK
>>
>>106867222
What software are you using to edit them? Anyways you can use Photoshop to take a still frame from both clips and make a simple lut, then import that.
>>
File: 00053-2789875215.png (803 KB, 768x960)
803 KB
803 KB PNG
>>
>>106866888
Looks like SD1.5. Are you getting paid for this or what?
>>
>>106867222
have you tried kijai's color match node?
>>
>>106866800
I cant figure out how to get this shit to work, I never get results anything like that.

you use text 2 vid or image 2 vid ?
>>
whats the simplest workflow/template to do text/image to gif ?

using comfy UI
>>
>>106867222
>>106867307
i tried photoshop, davinci resolve, and the color correction node.

i have not heard of luts, i used the built in color matching. are you saying there is something else i can do with the colors of the original video and then apply it to the second video?
>>
>>106867433
>lay both clips side by side
If I had photoshop installed I could show you if you gave me two still frames but I don't have any software at the moment.
>>
File: 00054-3348602719.png (901 KB, 768x960)
901 KB
901 KB PNG
>>
>>106866820
You neglected to include the only model worth talking about right now
>>
File: WAN2.2_00351.mp4 (3.84 MB, 608x496)
3.84 MB
3.84 MB MP4
>>106866800
>>106867380
I'm currently doing only t2i
>>
>>106867588
guess my 3070 just aint cutting it.
>>
is there a way to like save seed settings into an image or whatever so if you lose it you can reload the image and get the settings
>>
File: file.png (2.46 MB, 1280x1280)
2.46 MB
2.46 MB PNG
>>106866905
>>
File: 1744258792533054.png (952 KB, 768x1280)
952 KB
952 KB PNG
>>106867841
Thanks, I hate it.

https://civitai.com/models/539391/go-do-a-crime-or-meme-concept
>>
>>106867485
actually i just found a decent solution using a smooth cut in davinci resolve. the color matching isnt 100% there but its not obvious that there is a cut when i blend the last two frames with the first 2 frames
>>
File: ComfyUI_temp_ptyda_00001_.jpg (888 KB, 1536x1536)
888 KB
888 KB JPG
>>
>>106867833
this happens by default
it saves the entire workflow not only the seed
>>
Wan animate is dead right?
>>
File: 1720744198642191.jpg (28 KB, 600x570)
28 KB
28 KB JPG
generating 1 frame takes like 5 seconds so why cant it generate 10 frames in about a minute and make a gif or just dump the raw frames and I can make the gif

it takes like 15 minutes and it never comes out right
>>
>>106868051
it's pretty shit yeah
>>
>>106868094
Same reason you can't generate a still image 1girl by generating a million 1x1 pixels in isolation.
>>
File: file.png (3.43 MB, 864x1488)
3.43 MB
3.43 MB PNG
>>106866895
It is very good and has ~the right training data (unlike most corpo models) but he has comparatively little compute resources. That's basically how.

To me this result is still great.
>>
File: radiance.png (3.06 MB, 864x1488)
3.06 MB
3.06 MB PNG
>>106866945
>it's technically very cool
to some degree it's "making do" but it is cool, yes

>I don't know how to prompt for it
primarily booru tags though you might also want some descriptive sentences. but it didn't learn "everything" a booru has

>I have no context for its training
he's primarily training furry and regular booru data, I think also some misc other stuff

>I have no idea how to control the output
i have no such issues but indeed it's not complex multi character interactions, just mostly 1girl
>>
File: radiance.png (3.47 MB, 864x1488)
3.47 MB
3.47 MB PNG
>>
File: 00055-3552743803.png (1.58 MB, 1280x768)
1.58 MB
1.58 MB PNG
>>
File: guardianmed.png (2.52 MB, 1727x1320)
2.52 MB
2.52 MB PNG
>>
File: radiance.png (3.14 MB, 864x1488)
3.14 MB
3.14 MB PNG
>>106868094
> just dump the raw frames and I can make the gif
that's basically in almost all standard video workflows except you don't dump it as gif but run a video codec in that last node that runs a video codec... but you do have the frames

as to why it's not equally small/fast as individual frames? well it is connected to one or a few of the last frames AND the prompt that it needs to logically continue. that's just more memory usage and/or more computation.

the models that can sensibly do this "thinking" tend to be not the smallest fastest possible models either. it isn't generally much different apart from that, though
>>
>>106867271
he does it for free even
>>
File: radiance.png (3.11 MB, 864x1488)
3.11 MB
3.11 MB PNG
>>
File: yakub.jpg (45 KB, 776x776)
45 KB
45 KB JPG
So is SDXL still the best model to do this?
I am trying to make Wan 2.2 porn videos based on photorealistic NSFW images of fictional characters in different sex positions. There are loras for different positions and acts for Wan 2.2, but as I understand they need the reference image to be in that position already for good results.
So I need to make a good base image for i2v using character LORAs.
Flux can do this but I can't find a way to consistently get rid of the "plastic" flux look which completely ruins it for me plus maybe I just suck at this but it seems worse at combining multiple loras together (character + sex act etc.) than SDXL. Krea spinoff is better for realism but it lacks the LORA library and the compatibility with base flux LORAs is limited.
Flux LORAs can be converted to Chroma and it doesn't need any sex position LORAs as it knows NSFW out of the box but it needs a lot of luck with limbs as Chroma is too schizo until someone makes a finetune with decent anatomy. Hopefully in the future but doesn't seem to be an option now.
Qwen seems promising but there is currently little to no character and NSFW LORAs?
The realism mixes of Pony/Illustrious aren't good for this because they don't have the a lot of the specific "character" loras I am looking for, if you know what I mean.
So this leaves me with good old SDXL?
I haven't really used non-booru finetune SDXL for more than a year so I am a bit lost as to what current meta is.
I am following 1girl rentry in the OP and downloaded Lustify. Didn't really have too great results though. (Bad hands plus not replicating the character accurately enough, though I haven't tested too much) Maybe I just don't remember prompting meta, parameters or whatever for base SDXL too well.
Regardless does anyone here have any checkpoint and other recommendations for this task?
>>
File: radiance.png (2.56 MB, 864x1488)
2.56 MB
2.56 MB PNG
>>
File: radiance.png (3.03 MB, 864x1488)
3.03 MB
3.03 MB PNG
>>106868523
your personal sense of 1girl aesthetics and required base poses and so on matter - but it might be a realistic sdxl derivative is the best base, yes
>>
>>106868523
>it needs a lot of luck with limbs as Chroma is too schizo
what positions get fucked up the most? I'm making photo loras so I might as well throw explicit stuff in one of them
>>
>>106868523
1) Use one loraless gen to generate an image that is mainly just to generate controlnets: openpose (high strength) + depthmaps (med/low strength) is my guess, you play around.
2) Feed the controlnets to your Character Lora Friendly model of choice.
3) Upscale/crop to Wan 2.2 arget resolution using preferred skin texture model of choice (EX: using an SDXL lora on IL for the skin, but it's okay because you're i2i with low denoise)
4) Use this as your first frame for Wan.
>>
File: radiance.png (3.13 MB, 864x1488)
3.13 MB
3.13 MB PNG
>>
File: file.png (1.93 MB, 1328x1328)
1.93 MB
1.93 MB PNG
>>
>>106868499
>Lord VRAM, ancient Egyptian
>turn rocks into crystals
>inscribe ancient glyphs to make them think
>invent computer
>seal technology away for thousands of years
>early 21st century, tech discovered again
>humans use language models to teach it to generate art
>tfw they only use it for anime porn
>>
>>106868580
why is literally me Elijah Wood
>>
is there a NSFW version of this thread ?
>>
>>106868586
>tfw they only use it for anime porn
Truly the apex of human civilization.
>>
File: 00056-189150352.png (1.53 MB, 1280x768)
1.53 MB
1.53 MB PNG
>>
Is there any place where people share datasets for celebs?

>>106867588
How did you managed the 70's/pseudo vhs look here? Is it all post process?
>>
>>106868564
Almost all of them. The torso is fine but any combination of arms, legs, hands and feet turn out deformed in varying degrees majority of the time. The faces can also get fucked in my experience if there are more than one characters in the image.
>>106868569
Controlnet is interesting but I am not sure I need it. I was referring to missionary, cowgirl, etc. with positions rather than uber specific positions of limbs.
Might do that if end up bored but not a priority now.
>Use this as your first frame for Wan.
That's the problem part.
I need a model that has:
NSFW knowledge or LORAs
Has character LORAs and can replicate them accurately
Good realism
to be already in the position of the sex act in the first frame.
>>
>>106867949
Can you show how you did it? I also have resolve and have no idea how to make blend the colors.
Colors changing over one videos, then even more when you make multiple videos drive me insane, especially as no one seems interested in correcting that.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.