Discussion of Free and Open Source Text-to-Image/Video ModelsPrev:>>107356595https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Zhttps://comfyanonymous.github.io/ComfyUI_examples/z_image/https://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Surely zimage is running a form of DyPe natively, right?If not, the DyPe devs surely can make it work since zimage and flux seems to work similar.
1girl, naked
How did the chinks make a distilled/turbo model before making base model? Makes no sense
>>107358401make != releaseDumbass
DyKe
>>107358401casual racism outside of /b/ is too edgy4me
>>107358368No matter how much you spam this board, no matter how many models you shitters get, it will never get as good as real art and you will never fill the void with slopping computer-hallucinated garbage. Soon the novelty won't be enough and you're going to kill yourselves.
>>107358401 common ownership of the means of production
>>107358425We heard you the first time Sam
>>107358425An hero soon, artcuck
>>107358425>Soon the novelty won't...4 more years lil bro
Friendly reminder to support based and uncensored Chinese local models against censored API menace and "safe" western slop.>>107358401Because they made the base first and then distilled it? They just released the base first.If they are still training, I assume they would do some finetuning or reinforcement learning, not training from scratch,But more likely they are just running some tests or whatever.>>107358425Literally no one here gives a shit about it being real art seething retard.We just like making shitposts and cooming.
>>107358425rent free lmao >>107358295
>107358425You sound really upset, have you tried learning about new things instead of getting angry at them?
>>107358445*They just released the distill first.
>>107358445>If they are still training, I assume they would do some finetuning or reinforcement learningthen what they will release can't be called "base" anymore?
>>107358388>taking panels through two llms and zimage
>>107358425Hey buddy I think you got the wrong door, xitter is two blocks down.
>>107358445>no one here gives a shit about it being real art/ldg is the vanguard of new art. society will think of it as "early classic" Trump era renaissance art.
>>107358472>Calling all space niggas
>>107358463Maybe the main model was lacking in artistic stuff, and they wanted the distill to be realism oriented, so they decided to distill and release the model first before further finetuning the model on artistic stuff? Or some shit like that.I dunno could be a lot of reasons, I don't see the conspiracy.
>>107358425kek sloppers really get mad about the truth
>>107358445I will support based uncensored Chinese API models
>>107358472this model has a killer combo, realism + details, impressive for a 6b model
>>107358425>the novelty won't be enough200k+ generated and 10k saved, the more time goes on the more ideas i have and the more ideas i get the more models i have to test those ideas on.If I don't feel like spending much time on finding a good balance of settings on a new idea I load up some of my older favourite images and gen a few hundred similar images with a new twist or model.Or taking one of the thousand saved images online, push them through a VLLM to get a few different descriptions of it, and then feed them into multiple different models to see what they come up with.Any image or video model or lora that comes out for it means I regen all my characters and past prompts with those which takes days.And this is all before we even have a proper text2edit model without a VAE that is gonna destroy the barrier of entry for getting the exact image you want without worrying too much about even prompting anymore.Unless you are a brainlet with no imagination having access to create anything can't ever become boring.
>>107358315Nice
>>107358388I need to work on my light lora combos..
>>107358566man he is literally me
>>107358425lolol
>>107358425This isnt the correct take. The correct take is that art itself peaked with ILLUSTRIOUS and NOOBAI.Remain underfed so called "artist".
>>107358580>1072x1072wtf, what gpu are you rocking anon? how is it so expressive? new wan movement lora?
>>107358598all illu gens have the same shit shading
>>107358605shitmix issue
>>107358566desu the effect is not really strong, but once again I shouldn't expect this kind of shit on the non base model
>>107358580I guess gradient backgrounds just enhance the color/brightness shift no matter what.>>107358603Painteri2v, fill out the prompt."the woman looking at the viewer and smiles while closing her eyes tilting her head and then opens her eyes and looks at the hamburger that she is holding with both of her hands and fingers and then opens her mouth wide and begins to eat the hamburger as she puts it towards her mouth and with each of the several bites she takes of the burger pieces disappear from the burger with bite marks as crumbles of bread fall from the burger and her mouth and a ketchup mark is left staining her cheek and she continues to eat the burger violently as she shakes her head and eventually the entire burger has been devoured by the toman and she leans back and lets out a loud burp as her lips shake from the force of the burp."Unending sentence, no commas, no periods.
vramletbros should try out SDNQ https://github.com/erosDiffusion/ComfyUI-ZImageDit this one worked for mehttps://github.com/EnragedAntelope/comfyui-sdnq seems brokenquality is pretty much the same
>>107358670>Painteri2vholy shit thanks!
>>107358596>got half the shit wrong>didn't even look at the piece of garbage he generated>still posts his slopSasuga saar
>>107358558Didn't read this ran vomit.
assuming I don't have dGPU, is it futile to run something like this?
>>107358705Yes.
>>107358673>SDNQ in cumfartactually nice, I want the generic one tho, ill check it out
>>107358705I look like this
>>107358425imagine spendin basically your whole life on drawing multiple hours a day to land work for shit money as a cog in some globohomo company just so you can at least draw for a living and then almost overnight ai automated drawing before it even automated programming, and it can create images that would take you a lifetime of practice and time to do, that now any normie can get in 5 seconds for freebrooooooooooootal life
>>107358753I don't know but you sound like you have a massive learning disability.
me personally i dont feel bad for artists, but i wouldnt wish whats happening to them to happen to other people, i just dont give a shit and ill keep on cooming with local image modelsyou'll never stop me
>>107358673Was the catch for unofficial SVD implementations is that they lack the fused kernel or whatever so you get less quality/performance?Not like I really need it for 6B though.Still waiting for Wan 2.2 copechaku implementation however.
so sick of ranfag being a drama nigger
>>107358766draw this in 5 seconds paintpiggie >>107358763oh, you cant? rip
>>107358763>it starts turning into hebrew halfway insounds about right kek
have y'all folx found a solution for translating english prompts to chinese in comfy?
>>107358775>less performance?yes, there is no speedup for me even though i have to offload bf16but the quality dropoff isnt EXTREMEonly good for vramlets maybe, pretty sure SDNQ supports older pre-RTX cards
>>107358795my chinese gf does it for me
>>107358775>Still waiting for Wan 2.2 copechaku implementation however.https://huggingface.co/wanvideoquant right on time
>>107358806how's her honeypot gig going?
>>107358815>its himllmxy bros... WE WONNED?
OOOH AHAHAHAHAHAHAHAH FLUX KEKS ITS FUCKING OVER AHHAHAHAAHHAWE ARE FUCKING BACKI KNEEL XII FUCKING KNEEL XIhttps://github.com/nunchaku-tech/nunchaku/issues/809https://github.com/nunchaku-tech/nunchaku/issues/809https://github.com/nunchaku-tech/nunchaku/issues/809
>>107358823but FLUX needs nunchaku way more than Z-image
>>1073588232026 will be year of the pooh
this time next year we'll all be speaking chinese
>>107358823man I just checked and this faggot still didnt merge in official qwen loras support.not that we need it anymore thanks to ZIT.
I managed to integrate the prompt enhancer part on ComfyUi, I'm currently using Qwen 3 VL 4b instruct (there's a thinking version on the list but when I use it it also adds the thinking part on the prompt so Idk what to make of that)I started with this short vague ass prompt>Hatsune Miku eating popcorn while skateboarding, depicted in the visual style of a PlayStation 1–era game screen, some game Uiand it gave me something detailled, that's pretty coolI also used the official system prompt for Z-image so we're in the clearhttps://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.pyyou can test it out herehttps://files.catbox.moe/mzo0r6.jsonPS: Update your transformer's package or else it won't regognize Qwen 3 instructPS2: Install flash attention so that it gets faster
>llama 3 70b tunes are still the best for rpam i missing something? i know moe is all the rage now, but smaller models suck ass compared to a dense 70b. even deepseek has more isms than l3 70b. what are you guys using now that would be better?
>>107358580I was just using the light 1030 high model (not the lora) and it definitely had faster motion than the moe lora, at least for my sexy massage gens.
>>107358840to the victor go the spoils
>>107358815>nvfp4Welp, still waiting.
>>107358856very nice! thank you for sharing it anon <3
>>107358865/lmg/ is that way
>>107358865>>/g/lmg
Arr rook same.
>>107358425half this thread are actual pedophiles by the waynot lolifag type, like actually jerk off to kids type
>>107358913I wish they stayed in sdg
>>107358931They created /ldg/ though.
>>107358605Nope
>>107358425This would have made sense if we never moved on from the flux plasticslop era. But we finally just did that.
>>107358931>obsessed
>>107358938härkönen
>>107358938kek
>>107358954cummed to
>>107358949>posting the same image for 3 years straight
>>107358905>Arr rook same.Art imitates life.
>>107358856>PS2: Install flash attention so that it gets fasteryou can get it here for the windows fagshttps://huggingface.co/lldacing/flash-attention-windows-wheel/tree/mainFor pircel the prompt was>A magazine cover featuring Hatsune Miku involved in drug trafficking.
>>107358856Nice, I tried to figure out how to do this earlier but settles on manually doing things.
>>107358949who is living rent free in your mind schizo?
>>107359002ment to link >>107358974
lul
>>107359014>>107359019braindead retard kek
>>107359002Stop posting these trash images. Isn't your discord enough for you? Ask nigbo to join you, you can stroke eachother off 24/7 then.
why is ran spamming this thread
>>107359029Who is nigbo? Why are you so mentally ill?
>>107359002
>>107359047Why is it so blurry?
>the netayume bugmen are seething at SDXL chads again
>>107358856>8-bit NES style depicting Donald Trump battling 2B from Nier Automata, user interface elements, dramatic effects, particle effects, dynamic poses
>>107358905>using anons simple prompt > LLM rewriting the prompt into a bigger one > prompt read as qwenvl despritive and joycaption booru tags merging into the same prompt > samplerHNNNNNNNNNNNNNG
>niggerjak tricked into downloading cumfart again because she's illiterate and doesn't know forge has it
>>107359055>>107359075>>107359090samefag pajeet
>>107359054netayume niggas perpetually malding
>>107358856>An illustration book describing how a model called "Z-Image turbo" managed to be more popular and loved than a model called "Flux 2 dev"kek
The miracle of Chinese SOTA only did so much to keep the XL sloppas at bay... They've finally returned but for how long?
>>107358856Any way I can customize samplers of the enhancer? (Temperature, min p, etc)
>>107359127yes, with the qwenvl advanced node
>>107359137Thanks I will give this a shot.
welp this model going to taken down and censored
>An ultrasharp, high-resolution aerial photograph looking straight down (top-down view) onto the Shibuya Scramble Crossing. The focus is laser-sharp on a beautiful Caucasian woman with a radiant, genuine smile, who is waving up at the camera. I can't get her in the middle :(
>hes still samefagging
fuck this nigger lumina 2 ass model doesn't work in fp16 dtype
Why is everyone pretending like this wasn't known information since the first hour the model was available?
>>107359119kek, that one is good
why is he so fucking mad
>>107358866>>107359055WTF????
Can I show the prompt between these two with a node? Like a Show Any sort of deal?
portrait model btw
>>107359237Portrait of a landscape, yeah?
>>107359237is this where teletubbies from?
>>107358856>Doesn't work for NSFWFuck, it doesn't even want to describe a 2d pic as if it was an human
>>107359237Neat
Is there a "batch load prompts from all images in the folder and gen with those prompts in the current workflow node"?Whats the best way to achieve this?
>anons getting 3 day bans for replying to sfw ai generated imageryit's so grim
>>1073592863 day? It's a perma ban.
>>107359262oof, is there some uncucked qwenvl finetunes there?
>>107359290You don't get permas if it's AI
>>107358856>Create a four-panel manga explaining the concept of gravity.what he did was dangerous though lool
>>107359229Kinda figured it out. But I am locked out of editing the prompt. Can I have this automated process be sent to the final prompt node but it stops to not generate, allowing me to adjust the prompt manually?
>>107359300I literally got a permaban message for merely replying to an image lol
>>107359286just dont be a pedo, simple
How are there still no loras? It’s been a week, hasn’t it?
>>107359316kys nigga it hasn't even been 48 hoursporn has warped your brain
>>107359313seriously.. im fucking glad they're banned child rapists and wannabes
>>107359237landscape model btw
>>107359330hot
>>107359324so much this. trans rights are human rights btw
>>107359337shouldnt trannies be your friends given they are the highest likelyhood of being a pedo like you doe? uh oh
>>107359330Kinda impressive that this works at all.
Local Diffusion
don't do the crimeif you can't do the time
>Z doesn't understand facesitting.it's over
>>107359453base model when
>>107359471possibly in 2027
>>107359471it's still cooking
I wonder if with z edit it will be worth to vibe code a photoshop clone
How much vram do I need to train loras?
I wanted longcat but this is fine too.
>>1073594906
>>107359471Sunday
>>107359547>Sunday
>>107359446is this reverse psychology or something?a photo of a woman, sitting on top of a man's face on a bed. the man is lying below her, his head covered entirely by the woman's buttocks.
ostris posted, i'm training a lora right NOW
>>107359591>ostris posted, i'm training a lora right NOWthis shit will be useless in 2 days when base will be released lool
>>107359490Depends on what model that is.I am also guessing it doesn't really support that resolution.
>>1073595066 what?
>>107359506>>1073596136 7
tried to train a lora with the fork of a literal pajeet and it did nothinghttps://github.com/pyros-projects/diffusion-pipewhy do I keep falling for these scammers, I just wasted money on renting a gpu setting everything up and the loras don't work, I don't know why I even tried this, there are literally no other loras too lol
>>107359622Now generate this image with Z-image
>>1073596136 vrams
>>107359622kek
>>107359639
Nano Banana Pro vs Z Image Turbo, who wins?
>>107359605Chroma, I'll just train 1024x1024>>107359605:3
>>107359592>implying i carei'm posting on /g/ while the sun is out, you think anything i do matters?
>>107359687nano banana's jacket doesnt exist, too weird looking
>>107359687Z-image didn't do the expression
>>107359687nano wins since you can tell z only uses chinese background in its datasetz wins since people can run it so it's good enough
>>107359687Nano looks off but can't put my finger on why.
>>107359719the lighting looks like its shot in a studio
>>107359719nigger is smiling
>>107359622
>training a lora using a distilled model
>>107359687zutt wins because nigger is jaywalking (trve)
Still haven't figured out why it takes so long while guides say it's a matter of a few seconds.
>>107359719Runpod fags and locals cant compete with Nano and Grok imagine
>>107359777coz its using reasoning
>>107359777how long does it take>>107359784no he's using the instruct model not the reasoning model
>>107359782damn, this was made with nano banana pro?
>>107359782back in your cage cuck
>>107359784>>107359787You can see it above the node, 108s, rtx 5090.
buy an ad
>>107359632>I just wasted money on renting a gpuAAAAAHAHAHAHAHAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHAAHAHAHAHAAHAHAHAHAAHAHAHAHAHA
>>107359807lmao, and this is with flash attention 2?
anyone trying?
>>107359782can grok do this?>>107358866>>107359055
>>107359812wait people do that really? LMAOYou can rent my GPU >>107359632 ;)
my whole identity is me owning gpui have literally nothing else happening in my life :( anyone else like this?
>>107359815Yeah, guide was using like a xx60 card.
>>107359802Yes i have been making tv seethe all week
>>107359820I'm sorry, I can't comply with that request.
>>107359827based, gotta remind them how AI will replace hollycuck
>>107359782>>107359827I've already seen these on twitter
>>107359825The system detected potentially unsafe content. Please try again later or adjust the prompt :)
>>107359825yes, me, i have a poorfag gpu and i get really insecure when people shit on vramlets itt
>>107359782>>107359827Z-image edit could unironically have this level if it was a 14b model instead of 6
>>107359844really lol i made these particular ones.
>>107359855fat bfl fingers typed this
>>107359825my whole identity is local modelsi feel you anon
>>107359825Don't worry, you are based and redpilled. You can generate all the child porn as fast as you like you want unlike all the api cucks and vramlets.
>>107359782it did its best, it can't do old images with the grain unfortunately
>>107359817
>>107359855Burgers cannot comprehend the slimness of the Chinaman
>>107359687Wan 2.2 T2I + distill lora
>>107359883anything special you need to do? setting up my venv right now
>>107358856>having to use Qwen 3 vl 4b for encoding the text>having to use Qwen 3 vl 4b instruct to rewrite the promptwhy do I feel this is retarded, if the text encoder was the instruct model it could've do the both of them
>>107359894looks good 8B or 16B?
>>107359894>overbright plastic shitlool
>>107359906Wan2.2 AT2V 14b low noise modelQ8
>>107359782>>107359827just you see anon! this is how z-image base will look like
>>107359894how many steps
>>107359899nope, so far so good
>>107359931white girl lora?
>>107359931what resolution are you training on to reach 15.4gb? block swapping?
which node to extract a clean prompt string from an image?
>>107359944>>107359931nevermind i just noticed. pictures for ants
>>107359931are you using ramflow by chroma?
>>107359894he's literally me
>>107359894giff workflow anonman
>>107359827:(
>>107359880maybe with image edit it will be better when it comes out, i used a image and said add crew with 80s equipment and grain and etc.
howdy Z bros
>>107359940angel youngs body type lora, still have the dataset handy from flux>>107359962no sense in training a huge lora if it doesn't work>>107359969nope. i think 24GB should be fine
>>107359218Nice!
>>107359994What scheduler sampler
>>107359894The lighting is absolute shit.
>>107360003euler bong tangent
>tfw ukrainian>the model is literally called ZI hate chinese so fucking much
>>107359827
>>107360071>Soul>Souless
>>107360071>original vs netflix remakethe woman is literally black on Z-image turbo lmao
>>107360071May I suggest this node
>>107360069Named after agent z
The fact that you can even compare them at all is funny
>>107360071this shows that even though z-image is really realistic it's still a little bit slopped, can't wait to see if the base model will improve on that
>750/3000 steps>body, tattoos and hair already learntwhat the fuk
>>107360120Are you training on 24vram or cloud?
>>107360120>what the fukand this is only the turbo model, the base model will learn this shit even faster
>>107360118define slopped
Prompt i used in case any turb cucks want to try is1994 Paramount Pictures soundstage, medium close-up behind-the-scenes photograph during filming of Star Trek: Generations, exact same framing and lighting direction as reference image but now shot from only a few feet behind the camera, Panavision Panaflex Platinum 35mm camera with anamorphic lens very close in foreground, camera operator's hands visible on follow focus, cinematographer leaning in, Rick Berman or Jonathan Frakes standing right next to camera watching small CRT video assist monitor, actors clearly visible and large in frame, 1990s crew in polo shirts khakis bandanas Nike sneakers fanny packs, Mole-Richardson 10K and Dino lights close by, C-stands and sandbags in foreground, thick
>>107360130define deez nuts
>>107359812yes I wasted a whole 2 bucks, nvm I'm trying ai-toolkit now
>>107360137>pircellmao, did you ask the model to zoom out from the original image?
>>107360069
ai-toolkit released support to train z-image loras has anyone tried?
>>107358557>>107358472What model is that? On a work trip so stuck phonelurking
>>107360153did you not read the thread? someone is training something right now
>>107360153check the thread you peabrain
>>107360137>1994 Paramount Pictures soundstage, medium close-up behind-the-scenes photograph during filming of Star Trek: Generations, exact same framing and lighting direction as reference image but now shot from only a few feet behind the camera, Panavision Panaflex Platinum 35mm camera with anamorphic lens very close in foreground, camera operator's hands visible on follow focus, cinematographer leaning in, Rick Berman or Jonathan Frakes standing right next to camera watching small CRT video assist monitor, actors clearly visible and large in frame, 1990s crew in polo shirts khakis bandanas Nike sneakers fanny packs, Mole-Richardson 10K and Dino lights close by, C-stands and sandbags in foreground, thick:(>>107360155Z-image turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>>107358580Can you make the same video with Frieren?
>>107360158very helpful anon wow incredible why didnt i think of that? theres zero documentation on how to do shit how about someone shares
>>107360058Z is godly
>>107360150yea just upload any image and say behind the scene image, or medium close up and etc.
It does actually feel like the model gets better at understanding expressions and emotions if you translate the prompt to chinese first.
>>10736012824GB 3090 and i can sorta gen in comfy while i train the fucking lora?> https://files.catbox.moe/3j1t21.png
>>107360180can't wait to try that on z-image edit
>>107360120have you trained on nudes?
>>107360185oh hi! arent u the anon that made the hailey rose lora?
>>107360197dataset is nudes with face removed so we'll see >>107360185low-res test gen in catbox but we're very early in training
Is sage and flash attention two different versions that can be installed at the same time, or does one take over the other?
>>107360179
>I changed the order of the prompt by translating to ching chong language sigh
So what i get about this model is that my entire library of loras will be obsolete within few months, freeing a shit load of space on my PC.
>>107360229my thoughts exactly
https://xcancel.com/bdsqlsz/status/1994336717587845601#mhmm...
>>107360206yes sir
>>107360219
>>107360249Yes, its distilled alright, seems like its already destroying the quality of the modelI'm starting now too
>>107360229That AK/pistol hybrid is absurd but I love it.
>>107360267>seems like its already destroying the quality of the modelyes, that's why it's essential to get a good base model, just so that we can actually train it
>>107360273It defaults to AK every time you prompt a gun without specifying it. Sovl.
Is there any way to gen a video from image on 1660S, even if it takes a long time?
>>107358368>Z-Image-TurboI wish it could generate dicks
>>107360252>>107360249good to see you back here! been a while
>>107360229Yes, unless it trains like shit, this model will own the entire image gen market
>>107360284the end game is to train a lora of your own dick
A "distilled" high step model would be better than Z-Image-Base and I assume they're training just that
>>107360320no its a non distilled base model which means many here will seethe because itll be more difficult to get good images. on the upside, presumably, the quality ceiling of base will be higher than turbo
>>107360267good luck with the training. i wouldn't take the sample images as any indication of final outputs though.>>107360290thanks, i've been going hard dumping all my flux loras to civit. moving to ZIT, since Flux.2 is way too big for me to train locally.
>>107360320no distilled, distillation makes the model too hard to finetune
>>107360340what resolution are you training on, im training at 768 and 1024 but seems overkill, its going to take forevermy_first_lora_v1: 2%|#3 | 53/3000 [07:05<5:28:13, 6.68s/it, lr: 1.0e-04 loss: 1.275e-01
>>107360290>>107360340yes have been following you there. interesting to see how your bake will go on this one man!
Emoji sort of work.
>>107359980Why is the res so low?
>>107360337>which means many here will seethe because itll be more difficult to get good imagesAnon will almost immediately claim its DOA because their 1:1 comparison with their prompt and settings tuned for Turbo will look like ass on base
>>107360366based
>>107360366lol
>>107360366use facemask emoji
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/26
Fresh>>107360388>>107360388>>107360388>>107360388
>>107360366lmao thats crazy
>>107360358training this one at 512 just to get it out the door, then i'll attempt higher ones and fine tuning settings depending on what i get
>>107359719he is not crossing in a designated place
>>107360279Yep, with grok imagine or sora 2
>>107358466How did you make this? Didn't realize AI could make cute feet like girl on the left kek
>>107360366sort of? looks like they work 100% to me