[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Copyright Safe Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106844207

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
How does Illustrious keep winning without doing anything?
>>
File: 00013-3514250720.png (1.4 MB, 752x1128)
1.4 MB
1.4 MB PNG
>>
>>106848731
no weights no care
beyond that, basically all open weight anime models are useless beyond noob and variants right
not sure that's an illustrious win so much as an everyone else loss
>>
>>106848731
they didn't get the genius idea of fucking with the training halfway through and actually completed training unlike the competition
>>
>>106848731
lumina will save us
>>
Hello, **/ldg/** downloaded chroma1HDGGUFFP8_fp8ScaledHybridRev2, running on swarm, what do you think? I want a clean OC characters anime/2d gens.
What samplers, steps, cfg, loras or chroma versions work best?
Please, share your workflow if you got one!

In /sdg/ gave me some help >>106848657 but looking for more suggestions!
>>
>>106848748
how can it when everyone is fine-tuning an unfinished model? pre-training is the most important stage
>>
noob is fucking trash, lumina sucks and pony just committed suicide

illustrious reigns
>>
>Qwen image / edit
>Wan 2.2
>Chroma
>Neta / Lumina
>Illustrious / Noob / SDXL
>Pony

Localsisters ...
>>
>he hasnt tried yume v3.5
>>
File: 00000-509227090.png (2.85 MB, 1024x1280)
2.85 MB
2.85 MB PNG
>>
Sorry for the repost, I ended up at the end of the last thread…

Has anyone who’s tried Grok Imagine found even a moderately comparable comfy based workflow (using any GPU available on runpod)? New to wan and I can’t believe how good the grok results were for nsfw photo animation, but public results and moderation flakiness kills the concept… however for me it has set the bar and I wonder what’s even realistic to achieve diy. New to diffusion and been learning lora training for qwen and chroma and having good results with t2i but now that I’ve seen what grok can do with a single image I wonder whether just how close anyone can hope to come with available models. Default comfy wan 2.2 i2v on an L40S gave me some interesting results but not even in the ballpark of grok but that’s my current starting point for learning. Any tips appreciated
>>
why would i use neta lumina or noob when illustrious has everything already, i dont get it.
>>
>finally get the motion I'm after
>it fries the parts into plastic

REEEE
>>
>>106848783
It's better than base Illustrious 2.0 and certainly better than 0.1 / the other versions. The author finally cracked it, it seems.
>>
File: loicence.png (73 KB, 808x921)
73 KB
73 KB PNG
is nunchaku a meme for QIE?
>>
File: ChromaGTA6_00005_.jpg (645 KB, 1176x1672)
645 KB
645 KB JPG
>>
>>106848859
illustrious doesnt have a chad vae
https://civitai.com/models/1790792/netayume-lumina-neta-luminalumina-image-20
>>
File: 00002-3390168000.png (2.33 MB, 1024x1280)
2.33 MB
2.33 MB PNG
>>
>>106848783
just tried 3.5, the anatomy is still dogshit. this shit is worthless
>>
File: ChromaGTA6_00008_.jpg (702 KB, 1176x1672)
702 KB
702 KB JPG
>>
>>106848987
post your prompt and settings and if youre lucky ill tell you what youre doing wrong
>>
I'm literally on drugs rn
>>
>>
>>106849000
>and if youre lucky ill tell you
nah, im not going to play gacha on whether you feel like helping or not, faggot.
>>
>>106849028
no worries ill accept your concession
keep at it tho i believe in you anon
>>
>>
>>106849007
model?
>>
>>106849039
you never planned to even remotely help even if i complied, bitchboy. i already deleted it too, dead model. i havent seen a single image made with lumina that looks good and you wont post one either since you know they sucks kek
>>
File: 00004-2813936462.png (1.99 MB, 1024x1280)
1.99 MB
1.99 MB PNG
>>
File: ChromaGTA6_00011_.jpg (726 KB, 1176x1672)
726 KB
726 KB JPG
>>
File: 1000003021.png (2.28 MB, 1536x1024)
2.28 MB
2.28 MB PNG
>>106849085
show me illustrious attempting to do 5girls all individually prompted as well as this
https://civitai.com/images/105196050
>>
>>106849117
yikes
>>
>>106849129
we all know illustrious couldnt get close to that without regional prompting even with the fuck ups kek
>>
>>106849020
this is what i want
>>
>>106849117
you cant GENUINELY fucking tell me you think this looks good
>>
>>106849156
lol i was thinking the same thing
>>
File: Untitled.png (1.69 MB, 896x1056)
1.69 MB
1.69 MB PNG
>>
>>106849156
>>106849133

you could post an illustrious version of 5girls but you know it wont even be close
>>
File: ChromaGTA6_00013_.jpg (589 KB, 1176x1672)
589 KB
589 KB JPG
>>
>>106849117
>>106849133

SDXL will always be best because it is the most logical and directive model for tagging. These newer models with enormous tag plus prose prompts are tedious for editing. They are not logical. With SDXL I feel like I am using software with checkbox and sliders where "1girl, crouching, happy, yellow hair" gives me exactly that. With Neta or Chroma I have to prose slop it. I will never know the better way to prose it, whether verbose or direct or whatever.
>>
>>106849117
Okay now make yuri porn out of this. If you can't do this, it's shit.
>>
>>106849179
you dont need to use NLP with it. tags only works fine
>SDXL is the end all be all
kek
>>
>>106849194
>kek
Retard.
>>
By the time Neta gets as friendly and useful as SDXL, we will already have multiple models made open weight like NovelAI 4.5 with its style and character transfer tools and all this Neta stuff will be pointless
>>
File: 00027-3879850952.png (485 KB, 752x1128)
485 KB
485 KB PNG
>>
File: 1000003024.jpg (36 KB, 448x176)
36 KB
36 KB JPG
>>106849211
yeah its IMO the best version. substantially less overt "default style" as well
>This version is a pre-trained model (I’m not sure what to call it, but it’s basically a continuation of the previous work by the Neta team, using the Neta Lumina v1.0 model). To clarify further, versions 2.0 Plus and 3.0 were fine-tuned from this pre-trained model. My workflow involves using the best checkpoint from this pre-trained model at that time and fine-tuning it.
>>106849232
it already is, for me. youll have to wait for a shitmix though it seems
>>
File: ChromaGTA6_00015_.jpg (829 KB, 1176x1672)
829 KB
829 KB JPG
>>106849179
>Chroma I have to prose slop it.
Just sentence or two with natural language, rest can be tags.
>traditional illustration of a sitting demon girl next to a tree, morrigan aensland from vampire \(game\). 1girl, demon girl, green eyes, long hair, green hair, large breasts, cleavage, leotard, pantyhose, bare shoulders, head wings, bat wings, low wings, purple wings, bat wings, bat print, bridal gauntlets, detailed skin texture, bursting breasts, river, brown oak tree, leaf, Expertly drawn and painted pin-up style artwork of a demon girl with wonderful details. Pastel colors in the background.
>>
>>106849256
fullres of pic here whoops https://civitai.com/images/105200664
>>
>>106849179
>With Neta or Chroma I have to prose slop it.
That's purely a flux problem, and old school t5 hybrid models. But mostly a flux problem. No recent models with good prompt comprehension has it, which includes HiDream (which gen extremely bad quality images, to be fair) and Qwen Image.

With recent models, natural words behave like your keywords/tags. If you write "a gorgeous angel with angel wings three meters right to that point with huge golden halo doing a middle finger", you will, actually, get an angel, with angel wings, three meters right to that point, with a golden halo, doing a middle finger. Crazy I know.

Flux is so bad in prompt understanding we used models to be able to purple prose Flux in just the right way in order to make it understand, but it's not a general problem. Just a Flux one.
>>
File: 00005-2739731255.png (1.33 MB, 1216x832)
1.33 MB
1.33 MB PNG
>>106849169
hmmm
>>
File: IMG_1487.jpg (1.61 MB, 1808x3216)
1.61 MB
1.61 MB JPG
>>106849236
>>
>>106849272
post metadata and prove that was purely prompt
>>
>>106849283
https://files.catbox.moe/f5ki67.png
>>
>>106849272
That 4ch VAE looking real rough desu
>>
>>106849292
ah well I always use oekaki as a style tag which is the main offender I think
>>
File: wan22___0001.png (1.57 MB, 832x1216)
1.57 MB
1.57 MB PNG
>>106849117
it's an ambitious prompt
>>
>>106849290
i wont point out how you used a shitmix but for some reason i can only see the negative prompt in my text editor
>>
File: ChromaGTA6_00017_.jpg (862 KB, 1176x1672)
862 KB
862 KB JPG
>>
File: chroma___0039.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
i think i'm done with chroma, going back to flux
>>
File: timewaster.png (580 KB, 1859x525)
580 KB
580 KB PNG
>>106849325
>>
File: Screenshot_2772.png (657 KB, 1823x484)
657 KB
657 KB PNG
>>106849325
though this only worked since all these characters are frequently drawn together. I doubt i could get 5 characters from different franchises and especially not 5 OCs
>>
>>106849343
ishygdds
>>
>>106849256
wait 3.5 is completely different... lets see how many pissing korbos differ!!!!
>>
>>106849325
>i wont point out how you used a shitmix
When will anon learn that comparing slopmerges to actual tunes is retarded
>>
File: ChromaGTA6_00019_.jpg (741 KB, 1176x1672)
741 KB
741 KB JPG
>>
>>106849375
it has an updated booru dataset as well. up to september
>>
I tested the latest (definitive?) Chroma 1 HD version of Chroma using the provided workflow, and the results where lackluster. But it's nothing like the last checkpoint I used, so I must be doing something wrong.
>>
File: chroma___0041.png (1.32 MB, 832x1216)
1.32 MB
1.32 MB PNG
actually. lodestone might have cooked here
>>
Is it possible to control lora influence in output in a gradual manner? Like, "Only start applying lora after x amount of steps" and things like that.
I tried searching around and couldn't find any info or nodes that would manage that, so I imagine it's just a thing that doesn't really work like that?
>>
>>106849463
Use Base or 2k
>>
File: 00005-2589835627.png (1.1 MB, 960x768)
1.1 MB
1.1 MB PNG
>>
>>106849472
Yeah, downloading base now. I don't know how I got to that HD thing.
>>
File: ChromaGTA6_00024_.jpg (701 KB, 1176x1672)
701 KB
701 KB JPG
>>106849472
I've been using Chroma-DC-2K-T2-SL4-Q8_0
>>
>>106849503
link? google and HF search is not helpful. I thought HD was the recommended version?
>>
>>106849517
damn I wonder if this is the version that was deleted
>>
File: file.png (96 KB, 290x301)
96 KB
96 KB PNG
always converge
>>
Is there something like an llm node that will cocnvert a flux style prompt to a tag based prompt?

>>106849263
Oh, that's good to know tyyy
>>
File: file.png (1.74 MB, 1024x1536)
1.74 MB
1.74 MB PNG
I like the new neta yume
>>
>>106849550
Any decent instruct llm will do that for you.
>>
File: chroma___0042.png (982 KB, 832x1216)
982 KB
982 KB PNG
>>106849483
the documentation says to use it, that's how i got there.
>>
>>106849464
Resembles Chris-chan quite a bit. If Chris-chan was a milf.
>>
lmao, Sora Pro killed everything >>>/wsg/5994477
>>
>>106849576
Just pay $2 per gen
>>
>>106849576
It's impressive for what AI can do right now, but if that was a real anime, I wouldn't get past that shit opening.
>>
>>106849533
I still believe
>>
File: 00158-3661929894.png (742 KB, 960x768)
742 KB
742 KB PNG
>>
>>106849464
this looks like the most stereotypical midwestern woman ever
>>
>>106849465
if you do a two step sample this is trivial to do
>>
>>106849525
>>106849517
>>106849503
https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main
The memeversions are in a separate repo
>>
Just forget about Chroma since he's moving on to Qwen now
>>
File: r.png (2.51 MB, 848x1488)
2.51 MB
2.51 MB PNG
>>106847834
awesome miku!

>>106849329
the looks work here too
>>
>>106849623
Doesn't doing 2 step + 2step samplers work completely differently than 4steps though? As in, it doesn't actually process the image the same and would give a botched result
>>
File: chroma_flux__0004.png (1.59 MB, 832x1216)
1.59 MB
1.59 MB PNG
>chroma 60%, flux 40%, flux facedetailer
i can live with this, except the weird haloing

>>106849607
raised on tater tot casserole
>>
>>106849651
>The memeversions are in a separate repo
It's the best version so far, meme or not
>>
File: chroma_flux__0005.png (1.58 MB, 832x1216)
1.58 MB
1.58 MB PNG
>>106849666
nah i do it all the time, literally what i'm using with these images. just have to make sure if the VAE changes you reencode the image to the correct one
>>
File: 00159-1975300475.png (829 KB, 960x768)
829 KB
829 KB PNG
>>
>>106849660
the situation is funny though
>Pony fag: "Hey lodestone, want to join us and work on V8? we won't use Chroma btw, your finetune is shit, Qwen is the perfect candidate"
>Lodestone: "AWESOME, CONSIDER ME IN"
>>
>>106848927
I'll never understand Americans linking watermelons to black people
>>
>>106849687
cool
>>
File: ChromaGTA6_00026_.jpg (717 KB, 1176x1672)
717 KB
717 KB JPG
>>106849464
korn fans used to be younger, damn
>>
File: 00160-3009570065.png (953 KB, 960x768)
953 KB
953 KB PNG
>>
>>106849790
>>106849278
>>
>>106849767
everyone used to be younger, anon
>>
>>106849712
Because people have eyes? Black people unironically disproportionately like grape juice, fried chicken and watermelon. Anti-racists are such fags, you can't see anything with your eyes you're so mind broken.
>>
>>106849671
>>106849680
>>106849767

Their grandfathers lost the Winter war
>>
https://github.com/dvlab-research/DreamOmni2?tab=readme-ov-file
the lora has been released btw
https://huggingface.co/xiabs/DreamOmni2/tree/main
>>
>>106849834
>Winter war
Liberate Petsamo !
>>
File: ChromaGTA6_00028_.jpg (551 KB, 1176x1672)
551 KB
551 KB JPG
Harry x gta6
>>
File: chroma_flux__0011.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>>106849767
jonathon davis is 54 anon

>>106849680
example workflow
https://files.catbox.moe/55ag6j.png
>>
File: 00161-3571522318.png (952 KB, 960x768)
952 KB
952 KB PNG
>>
>3 Oct, 2025: We've combined DualParal with the Wan2.2-T2V-A14B model.
https://github.com/DualParal-Project/DualParal
>>
File: 1730458698153793.mp4 (3.73 MB, 832x480)
3.73 MB
3.73 MB MP4
>>106849875
https://dualparal-project.github.io/dualparal.github.io/
this is so ass, change the moment of the video several time and the color changes drastically everytime
>>
>>106849825
>disproportionately like grape juice, fried chicken and watermelon
That's just southerners in general you mouthbreather. The whole working south run on fried chicken and Grapico, and if you're not spitting out watermelon seeds at least a couple of times a year, you're not living.
>you can't see anything with your eyes you're so mind broken
You can't see anything with your eyes because they're too close together and misaligned because you were born retarded,
>>
>>106849897
yeah you're a mind broken leftists that can't see reality any more because you're terrified of being called a racist, no need to reply, you're a fag and you'll die from your brain rot
>>
>>106849907
chill
>>
File: 1730702122828964.png (147 KB, 1080x414)
147 KB
147 KB PNG
Will Alibaba save us again?
>>
>>106849525
QRD?

>>106849503
Is this the 2k version?
>>
>>106849907
>everyone whos not retarded is a leftist
>>
>>106849916
no, these people keep shooting people over believing everyone is a racist nazi, I'll chill when they're removed from all social media
>>
File: 00162-1821561776.png (964 KB, 960x768)
964 KB
964 KB PNG
>>
@106849922
d*bo
>>
>>106849917
only if they realise that their SaaS stuff is only relevant if it can beat or match western SaaS
that shit they pulled with Wan 2.5 left me assblasted
especially when Sora, Grok Aurora/Imagine, etc. dropped that totally mogged it
>>
File: radiance.png (3.23 MB, 848x1488)
3.23 MB
3.23 MB PNG
>>106849875
i am not even sure this actually works?
>>
>>106849166
Very cool
>>
>>106849939
to be fair I can see them giving up making video models, their only goal was to be SaaS Competitive, now that Sora exists, might as well give up, the mountain is way too high >>106849576
>>
>>106849917
yus long live china
>>
>>106849890
>>106849940

kek, yeah it just looks like last frame method. there's more wan goodies but nothing usable has yet been released for comfy :(

https://github.com/TencentARC/RollingForcing
https://github.com/NVlabs/LongLive
https://github.com/dc-ai-projects/DC-VideoGen
https://github.com/mit-han-lab/radial-attention
https://github.com/dvlab-research/Jenga

>>106849917
>Wan2.5 5b model
>>
File: chroma_flux__0018.png (1.45 MB, 832x1216)
1.45 MB
1.45 MB PNG
>>106849666
nice trips, but also this link is for you
>>106849855
>>
>>106849935
/d.bo/is
/@\d/is
>>
File: 00047-3519788344.png (803 KB, 1224x768)
803 KB
803 KB PNG
>>
File: ChromaGTA6_00031_.jpg (729 KB, 1176x1672)
729 KB
729 KB JPG
>>106849919
>Is this the 2k version?
yeah, only version I use
>>
>>106849967
>nothing usable has yet been released for comfy :(
can't comfy implement that by himself?
>>
File: 00163-2520552454.png (702 KB, 768x960)
702 KB
702 KB PNG
>>
File: r.png (2.91 MB, 848x1488)
2.91 MB
2.91 MB PNG
>>106849967
DC-VideoGen seems quite interesting if it is gennerally working, but yes I can imagine there's really not enough time or motivtion to implement all the stuff people come up with.
>>
Gemini or JoyCaption for wan captions?
>>
>>106850014
Shieet
>>
File: radiance.png (3.04 MB, 848x1488)
3.04 MB
3.04 MB PNG
>>106849967
radial attention too, that seems very useful if it works as indicated

people come up with some cool stuff. thanks for the linkage.
>>
https://www.reddit.com/r/SoraAi/comments/1o2z011/sora_2_overhyped_and_underdelivers_while_wan/
the fuck is this cope? even the ledditors don't fall to that bullshit on the comments lmao
>>
>>106849983
cute
>>
>>106850052
Sora won't ever show you a blow job and $2/meme seems like a steep price for anyone that isn't a Youtube grifter and even they don't make anything themselves, they just steal other people's posts.
>>
>>106850064
>they just steal other people's posts.
ironic, don't forget Alibaba stole billions of videos to make Wan, I won't die on that hill, that would make me hypocritical
>>
is it just me or is comfyui's subgraph feature completely broken, inconsistent, and unusable?
>>
>>106850014
Melancholy.
>>
>>106850006
not anymore. comfy just pushes PRs nowadays and increments the version. the project is in bloat freefall constantly and the runtime fps keeps getting hit constantly. the new frontend being "faster" is a complete lie since they didn't even fix the fps counter. how the fuck can you claim it's faster if you have no receipts? saying it's an OS is the most delusional thing as well. nobody knows what the fuck to do with the repo anymore and it just keeps getting worse
>>
File: chroma_flux__0026.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>106850002
this is a cool one, anon
>>
>>106850076
seems to work for me.
>>
>>106850014
Fear of a Snack Planet.
>>
>>106849917
Please be a music gen model. Those who tried using Qwen-Omni and uploaded real songs for it to describe know what I am talking about.
>>
>>106850083
Syurptitous.
>>
>>106850096
for those of us who didn't, what do you mean?
>>
File: file.png (28 KB, 699x264)
28 KB
28 KB PNG
>>106850089
maybe its an issue related to the CR upscale image node then, but nothing seems to go wrong when i use it outside a subgraph so its weird
basically the links between inputs and the nodes inside seem to get jumbled because when i made this subgraph at first it worked just fine, but when i drag in the workflow again and try to edit the upscale factor it gives me the upscale mode options instead, even though those inputs are correctly linked inside the subgraph
>>
>>106849503
>>106849381
>>106849329
Can you share your metadata? Getting into Chroma >>106848754 and learning different prompt and setup methods. Your gens are clean and sharp, you seem experienced. Would appreciate if you share it!
>>
File: 00165-3347342587.png (912 KB, 768x960)
912 KB
912 KB PNG
>>
>>106850127

It can describe songs with insane accuracy, it knows its instruments, genre/style, bpm, it can timestamp parts of the song (chorus, bridge, verses etc), can transcribe song lyrics accurately even knowing where the chorus etc are, knows which parts of the song certain instruments kick in etc

Try it yourself any audio under 3 min on:
https://chat.qwen.ai/

They can easily train a Suno tier model with the data they likely have that they used to train Qwen-Omni.
>>
>>106850059
>>106849278
>>
>>106850151
also half the time i cant even leave the subgraph view because escape stops working
>>
>>106850151
nothing jumbled here

>>106850206
also I can leave via escape and clicking the parent graph. is the frontend package up to date and the browser nothing too crazy? i don't really know what's going on on your end tho
>>
File: SPOILER_104983866.jpg (146 KB, 1024x1536)
146 KB
146 KB JPG
Guess which model
>>
>>106850073
>you watched a Disney movie once so you can't ever make animated movies
argument
If you don't understand the difference between copy and pasting a movie and learning to draw your own movie you can't be helped
>>
>>106850227
pony v7 (some people waited 2 years for this btw)
>>
>>106850236
must be what nigbo and Nick are using
>>
>>106850233
sora can literally recreate the cowboy bebop music Opening but go on king >>>/wsg/5993868
>>
>>106850189
Every AI company tips their hand based on what they release as tools, even Sora 2 was obvious after ChatGPT released Whisper showing they were working hard on accurate transcriptions. Even Dalle-3 was preceded by a decent VLM (closed) model.
>>
>>106850236
Yeah! ^^
>>
>>106850244
And it can also reimagine Cowboy Bepop's opening theme as an opera bad faith shill. :)
>>
File: 1760097671771741.png (415 KB, 949x2298)
415 KB
415 KB PNG
>>106850227
Part2 guess which model^^
>>
File: sc4.jpg (196 KB, 1159x919)
196 KB
196 KB JPG
>>106850187
too lazy to clean workflow but this is what makes Chroma gens clean and sharp. ignore tile size setting
>>
File: 1739295360410078.gif (177 KB, 220x165)
177 KB
177 KB GIF
>>106850254
reading those comments is so satifying desu, it's like seeing the bad guy lose on a movie, that never gets old
>>
>>106850260
Thanks, prompts same as your earlier posts? short sentence + tags format?
>>
File: HOLY COPE.png (73 KB, 805x379)
73 KB
73 KB PNG
>>106850254
>noooo you don't get it, it took him 2 years to make that model but it's unfinished, just 2 more weeks bro ;-;
>>
>>106850276
yeah, check civitai for prompts
>>
>>106850260
Model? Any loras ? any tags to give 2d that quality?
>>
>>106850223
pulled after the epsilon scaling feature came out and that made my group node malfunction so i changed it to a subgraph
ill try updating all the packages and see if that does anything
>>
>>106850260
nta, how long does it take you to gen? it takes me 70 seconds with upscaling on a 5090 for chroma.
>>
>>106850288
>he doesn't know that all the hate is just the petra spammer
>>
>>106850288
>ts
I'm seeing this everywhere now. Are people too fucking lazy to write "this" or does it mean something else?
>>
>>106850308
it means tiny sneed
>>
>>106850227
100% chroma. I'll gen in 6 batches at a time and 2 of 6 of them are mangled just like this or even worse, kek
>>
>>106850254
Can you do the same with a Chroma model?
>>
>>106850254
This is when you include Iodestone in your work team
>>
File: radiance.png (2.5 MB, 1488x848)
2.5 MB
2.5 MB PNG
>>106850227
it's unfortunate that the AuraFlow training didn't work out, they maybe kept trying a bit too long instead of trying other models.
>>
>>106850317
no it's pony v7
https://civitai.com/images/104983866
>>
>>106850288
It's wild when you see lots of new models coming out training on a tenth the resources and trained in several months. In the time he's stalled he could've trained a full model from scratch, even Auraflow was trained relatively successfully in a fourth the time.
>>
This is a troll upload, right? He HAS to know the lora is fucked, right?
>>
>>106850308
as I understand it the original meaning was negroidspeak for "this shit" but even that was beyond certain zoomers, who use it to simply mean "this"
>>
File: radiance.png (2.46 MB, 848x1488)
2.46 MB
2.46 MB PNG
>>106850318
the civitai feedback for chroma is good, same for neta-yume lumina IIRC
>>
File: 00166-2940830031.png (731 KB, 768x960)
731 KB
731 KB PNG
>>
File: ChromaGTA6_00037_.jpg (593 KB, 1584x1176)
593 KB
593 KB JPG
>>106850302
this took 147 sec with that i2i workflow. 4070ti super (16gb), no optimizations. With sdxl resolutions it takes way less time

>>106850302
using gta6 lora I trained
>>
>>106850354
Whih model are you using? Chroma 2k?
>>
>>106850354
1girl, full body , smiling, looking at viewer, beach
>>
>>106850076
It is buggy for sure, but right this second it is more useful than not. I have encountered a couple of persistent bugs with placing certain custom nodes in a subgraph, how it interacts with bypasses etc. But for all I know, it is a bug with the custom nodes and not the subgraphs themselves, because you can get things like Impact Switches just suddenly deciding to not take inputs if too many switches already exist etc.
>>
are all the flash chroma checkpoints supposed to look like shit? i'm using the recommended 8 steps with heun and they come out horribly.
>>
>>106850360
https://huggingface.co/silveroxides/Chroma-Misc-Models/blob/main/Chroma-DC-2K-T2-SL4/Chroma-DC-2K-T2-SL4-Q8_0.gguf
>>
I bought a 6000 blackwell for 1girl SDXL slop. I'm built different.
>>
File: 00167-1493705108.png (1.2 MB, 768x960)
1.2 MB
1.2 MB PNG
>>
>>106850367
1faggot, will never be a real woman, typing a comment
>>
>>106850370
Thanks!
>>
If you used pony at any point (even the """good""" version) you are worth less to me than cloudkeks
>>
>>106850373
Kino
>>
all this slop will be forgotten tomorrow, like tears in the rain
>>
File: radiance.png (2.74 MB, 848x1488)
2.74 MB
2.74 MB PNG
>>106850369
i think one anon managed to pick some subject matter and settings where it's ok but most of the (not that many) chroma users including me don't use them

probably just use a non-flash checkpoint if it doesn't work for your prompt
>>
>>106850377
quietly devastated right now
may not recover
tell my 1girl I love her
>>
>>106850390
Honestly, that itself is kind of hot. Thousands upon thousands of women needed to take their clothes off on the internet for me to produce my own endless stream of personalized private porn catered specifically to my tastes. It's unholy, genuinely some kind of social catastrophe, and it is so hot.
>>
>>106850393
yeah, im just using the non-flash ones, i was just curious if i was missing something but it seems to be the general experience.
>>
Why these generals got all branched, sdg, ldg, animedg, etc
>>
>>106850470
for shits and giggles
>>
File: 00007-1749613537.png (1.57 MB, 1024x1536)
1.57 MB
1.57 MB PNG
>>
anything new in the local scene aside from qwen edit
>>
>>106849825
I'm not American, the watermelon thing is very American.
>>
File: ChromaGTA6_00044_.jpg (692 KB, 1128x1648)
692 KB
692 KB JPG
>>
>>106850490
nope
>>
>>106850327
Cant view it, I live in united englanistani but I believe you.
>>
>>106850470
sdg was cancer so ldg was made
cancer from sdg tried to split the general multiple times to kill ldg. at one point there was landscape ldg, realism ldg, etc. now there's just the anime one which is barely alive and only used by like 3 anons.
>>
>>106850484
that's a cool effect, what did you use? or catbox if its easier and you dont mind
>>
>>106850373
When did hasan piker get a new pet?
>>
>>106850535
'dark theme' and oekaki
>>
>>106850509
Anime general was likely made by anons from russian 2ch's /ai/, it even had their info in the first post initially.
>>
>>106850549
Yours looks a lot more painterly, are you using a specific lora? Mine came out more scary with those words.
>>
>>106849465
Should be possible. Something like keyframes,
>>
>>106850590
prompt was:
1girl, skinny, solo, bikini, wide-eyed, wavy mouth, full body, looking at viewer, beach, oekaki, impressionism, painterly, dark theme, jaggy lines, oekaki
with negative: shiny skin, simple background
5.5 cfg. using my own unreleased mix. 28 steps, euler beta
the style is wildcarded so there was two oekaki
>>
>>106849842
anyone tried this lora? does it make kontext better than qie?
>>
>>106850636
I'm not sure it's that simple, it's also using the text encoder Qwen VL (wheras Kontext classic uses T5) so...
>>
>>106850659
I was checking the pipeline, it's still using the dual clip (t5+clip l), it's using the VLM to encode the input OR videos but yes, this needs a dedicated node in comfy most likely. I downloaded the weights at least
>>
>>106850613
I keep getting a fucking spotlight on the character kek. Thanks for the prompt, i'll mess around with it.
>>
>>106850337
he likes 'em skinny what can he say
>>
>>106850707
tomoko?
>>
File: 00171-1885543971.png (1.16 MB, 960x768)
1.16 MB
1.16 MB PNG
>>
What's the state of NovelAI compared to local gen UIs like (re)Forge and Comfy? Many an AI Discord server seems fond of aggressively shilling the former, (NovelAI). I haven't used it since early v3.
>>
>>106850719
>tomoko?
ye, wanted to see if i could get it like yours where she was barely visible. i'm just going to force it with a lora.
>>
>>106850760
I'll join in the dark ladies at the beach genning once I finish this batch of 20+ images of girls pissing in a toilet
>>
>>106850769
I got it to work. Your pissgirl's right leg is FUCKED bro.
>>
>>106850752
Kill yourself.
>>
>>106850677
lool, now you have to load t5 + qwen vl? this is ridiculous, that's why it was a dumb idea to go for kontext, QIE has qwen vl naturally
>>
>>106850778
man im not sure I like this new neta model desu, I have way more fucky anatomy compared to testv4 or v3, generally less stuff MAKES sense, like I'm genning a lot of ladies with panties on the side but still wearing panties, or fused fingers way more than before. or even the pic I posted you see the fucking background? like it compeltely lost cohesion. I'll experiment a bit more, but I might jump back to testv4
>>
>>106850796
doesnt really matter that much desu, you'll be parking the models to ram anyway... you have at least 96gb of ram right?
>>
File: 00172-4042499012.png (1.1 MB, 768x960)
1.1 MB
1.1 MB PNG
>>
File: ChromaGTA6_00045_.jpg (496 KB, 1128x1648)
496 KB
496 KB JPG
>>
>>106850798
i usually just drop the model right away if there's too many issues with anatomy. im just sticking to wainsfwv14 for the time being since 15 gives me too many issues with hands. good luck with your piss adventures.
>>
File: 1737908391038267.png (838 KB, 928x1120)
838 KB
838 KB PNG
Howdy faggots.
Has anything new dropped in the last two weeks?
>>
Can someone explain GGUF/quantization to me as if I'm a drooling retard? I get that it makes it easier to run for lower end hardware, is there any benefit to using it if you have 24gb of vram? I've never had to use it so I've never had to look into it, but I'm curious if I've been missing out on cozy genning speeds
>>
>>106850881
If you can load the full or fp8 version of the model there's no need to use quants
>>
File: 00173-2209518086.png (1.31 MB, 768x960)
1.31 MB
1.31 MB PNG
>>
>>106850870
>Has anything new dropped in the last two weeks?
Check your pants.
>>
>>106850917
>Check your pants.
whoa, what was lumina doing there?
>>
>>106850881
Quantization: if you round the numbers that make up the model, then you sacrifice some precision (the decimal places you rounded away) in exchange for using less space and therefore a speed increase.
GGUF: run on CPU instead of GPU

If you have 24 gb VRAM, you don't really need either, except maybe quants for increasing batch size.
>>
File: 1736927840159366.png (808 KB, 1024x683)
808 KB
808 KB PNG
>>106850917
I'm like 12, so they still haven't dropped yet...
But I'll have you know I've been posting here since I was 1 years old.
>that's how I know all the jokes.
Anyways, what's new?
Has anything surpassed illustrious yet?
>>
nothing will ever surpass illustrious
>>
>>106850944
>Has anything surpassed illustrious yet?
for some anon, yes. others will have to wait for a jeetmix unfortunately.
>>
nothing will ever surpass NTR mix
>>
File: 00070-2827369593.png (1.5 MB, 1224x768)
1.5 MB
1.5 MB PNG
>>
File: 00174-1392688547.png (953 KB, 768x960)
953 KB
953 KB PNG
>>
>>106850969
that thing sure breasts boobily
>>
>>106850891
q8 is better than running fp8.
>>
File: radiance.png (2.56 MB, 848x1488)
2.56 MB
2.56 MB PNG
>>106850944
>Has anything surpassed illustrious yet?
not overall

but for specific uses sure - wan, chroma [radiance], qwen [image edit], neta yume lumina and if you can run it on your H100/H200 hunyuanimage3.0 are better at various things
>>
File: radiance.png (3.25 MB, 848x1488)
3.25 MB
3.25 MB PNG
>>
nobody will read this so ill just say it, i havent even tried to use noob because i dont know what the heck epsilon and v-pred means
>>
>>106851006
Hecking wholesome nigger chungus
>>
>>106850944
>Has anything surpassed illustrious yet?
No. Newer models may have better prompt adherence but they're not fine tuned for anime and therefore lack the vast knowledge XL finetunes have, not to mention the near infinite loras XL has for it.
>>
>>106850943
>>106850891
Thanks doodz. So no speed gains, just strictly space occupied in vram
>>
You should be asking has anything surpassed Noob, not illustrious.
t. day 0 XL user and day 0 illustrious 0.1 user
>>
>>106851021
Googah!
t: caveman who made the paintings
>>
what noob should i be using?
>>
File: 00175-1856583297.png (1.37 MB, 960x768)
1.37 MB
1.37 MB PNG
>>
>>106851084
shuma nigerath
>>
File: IMG_2370.png (695 KB, 678x907)
695 KB
695 KB PNG
>>106850613
>>
>>106850979
youre meaning the degradation of q8 is less than fp8, correct?
>>
>local thread free and open source
>gatekeeps his workflow
>gatekeep his lora
>>
lilbro cannot stand koff posting in the blessed thread of frenship
>>
>>106851124
nicholas is a reject from /sdg/ as debo kept posting one random image in the OP and not nick fe's works
>>106851131
>MrCatJak
>>
>>106851122
yes, q8's quality is closer to fp16 than fp8.
>>
>>106850780
I just want to weigh my options.
>>
File: radiance.png (3.17 MB, 848x1488)
3.17 MB
3.17 MB PNG
>>106851006
v-pred version changed stuff under the hood of SDXL and you need to pick settings that work

the images are often like they have a wider color gamut
>>
File: screenshot.1760133231.jpg (101 KB, 583x321)
101 KB
101 KB JPG
>>106851149
>>106851122
The AI has spoken. Q8 is better.
>>
File: 000006.jpg (1.03 MB, 1488x2000)
1.03 MB
1.03 MB JPG
>>
>>106851124
he just tried the alibaba style kek
>>
>>106851071
chads use vpred 1.0 while those who require a bit more handholding use... i dunno cyberfix or something
>>
There are so many samplers these days. I am completely at a loss as to what is best so I just keep using euler. People make comparisons but it all seems so fickle and unreliable. I tried res 2s / bong_tangent as people were saying that was best with newer models and was surprised at how consistent the image was from seed to seed but I don't know if that's a good thing actually.
>>
File: 00176-2023337502.png (851 KB, 960x768)
851 KB
851 KB PNG
>>
>>106851124
I haven't seen a non-template workflow posted here in months. i dont think anyone shares unique workflows anymore.
>>
How is NoobAI better than Illustrious?
>>
You know what to do.
>>
>>106851308
the king is NAI
>>
>>106851318
Yes, NoobAI
>>
>>106851318
>NAI
uhh, lilbro, that's what unc said
>>
>>106851246
Whenever I do anon asks for the dozen or so nodes I wrote myself and even then he can hardly make heads or tails of my spaghetti kino
>>
>>106851340
>>106851336
I meant NovelAI
>>
https://files.catbox.moe/v1pnmx.png
>>
>>106851357
Why is the king begging in the local models thread? Are you too much of a pussy to go against OpenAI or Google?
>>
File: file.jpg (585 KB, 1440x2560)
585 KB
585 KB JPG
>>106851367
>>
NAI is the king of sending your cunny promptos to the alphabet boys.
>>
>>106851382
who rattled your cage?
>>
File: radiance.png (2.6 MB, 864x1488)
2.6 MB
2.6 MB PNG
>>106851239
feel free to run huge test series to get your statistical evaluation, but it'll probably kind-of vary by model anyhow

simply use some of the samplers you think are good?
>>
File: 18jOn7i.jpg (275 KB, 604x906)
275 KB
275 KB JPG
AI will never replace 3D artist :)
https://x.com/gleb_alexandrov/status/1976382622688543215
>>
>>106851405
A, B, C
Easy as 1, 2, 3
Or simple as Do-Re-Mi
A, B, C, 1, 2, 3, baby, you and me, girl!
>>
File: 1733892382426393.png (398 KB, 750x571)
398 KB
398 KB PNG
what's the recommended version of python for a stable experience genning wan videos of comfy?
>>
>>106851418
I genuinely hope the 2d to 3d ai pipeline gets perfected in the coming years because i have a few vidya ideas i really want to try but i'm stuck at base prototyping because making custom assets is costly.
>>
There used to be something called GauGAN on Nvidia's playground. I loved how ugly it was. Nvidia's Canvas is too "good" -- but every time I try to get one of these numerous GauGAN repos working, I fail miserably. Any advice? Anyone do this successfully?

Altneratively, in the absence of having an RTX card, is it possible to install Canvas on a cloud server?
>>
File: .png (1.37 MB, 848x1488)
1.37 MB
1.37 MB PNG
>>106851418 >>106851445
for the sake of discussion: qwen image edit, kontext, wan and others *already* usually generate the other perspectives you can't see

the techniques to turn it into a textured 3d model are not high resolution enough, but that probably won't be that way for long. maybe it already isn't with a H100/H200.

that said xitter is retarded
>>
>>106851448
Understand that I don't have much to offer you boys but hopefully karma will pay off
>>
>>106851418
i find it amusing that, in order to prove no ai use, 3d artists will post very highres images with multiple angles which in turn makes it easier to train on their style. or theyll post the unbaked versions which also helps in training.
desu no one sane is saying itll replace them, just as digital photography never replaced analog.
>>
When ready

>>106851472
>>106851472
>>106851472
>>106851472
>>
>>106851458
there are still problems with consistency and while it is impressive from a hobbyist perspective it looks like shit compared to having objects in blender. I feel like endgame AI image/video gen will be a frontend agent for a 3d engine that will build a scene with AI generated objects/textures/shaders etc and just render it. 100% consistency, infinite length, much lower compute/memory requirements.
>>
>>106849272
>Koakuma
Not enough love for her
>>
>>106851313
I want to become hentai creator. Give me some good free A.I. to convert pictures to video. No programing, just simple creator for monkey.
>>
File: ComfyUI_00414_.mp4 (2.06 MB, 1024x1024)
2.06 MB
2.06 MB MP4
>>
>>106851740
what a goofy resolution
>>
File: ComfyUI_00415_.mp4 (2.61 MB, 720x1280)
2.61 MB
2.61 MB MP4
>>
>>106849825
That is jus negro american thing, here blacks like cocos and fish
>>
File: NetaYume3vs35.jpg (3.36 MB, 2512x1712)
3.36 MB
3.36 MB JPG
The difference between NetaYume 3.0 and 3.5 is a bit harder to define than 2.0 Plus vs 3.0, but I do think 3.5 is another modest improvement overall. Main thing I've noticed is eye proportions for both male and female characters make a bit more sense in 3.5, and it adds some nice relevant details in appropriate contexts where 3.0 didn't, like the sword here. Prompt (sans boilerplate / neg) was just `masterpiece, best quality, very aesthetic, a 2d digital anime illustration of a samurai warrior in traditional armor, standing in a cherry blossom garden.`
>>
File: ComfyUI_00416_.mp4 (878 KB, 1280x720)
878 KB
878 KB MP4
>>
>>106851740
>yap yap yap yap yap
>>
>>106852671
always.. i hate that so much
>>
File: ComfyUI_00417_.mp4 (937 KB, 1280x720)
937 KB
937 KB MP4
>>
File: ComfyUI_00419_.mp4 (638 KB, 1280x720)
638 KB
638 KB MP4
y so ded



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.