[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: archivist.jpg (103 KB, 672x936)
103 KB
103 KB JPG
Fuck it, I'm gonna start saving stuff. Better late than never.
I'm just going to get the biggest HDD/SSD I can find and start filling it up. Is there anything I need to know beyond that? Because if there is then I don't.
>>
Isn't that what AI does though? Just archive things? Isn't that what you're doing right now? Isn't it the same thing?
>>
I once heard someone say something about eggs and baskets
>>
File: 1748549203578232.jpg (177 KB, 400x566)
177 KB
177 KB JPG
>>108483313
AI won.
>>
>>108483313
No, not even a little.
>>
File: 1772170137402855.webm (1.53 MB, 800x600)
1.53 MB
1.53 MB WEBM
>>108483306
If you have a decent CPU (ie 8-cores or more) then you should consider using AVIF/AV1 to encode images/video at low quality settings (ie Q40/CRF 50) so you can fit them on cheap 16GB (or smaller) flash drives before you stick them in your fridge.

This gives you a "better than nothing" scenario in case of ransomware/physical computer theft outcome. You'll hate yourself for stooping to YIFY quality but you'll hate yourself even more if your giant external HDD suddenly dies and takes out everything you care about.
>>
>>108483372
WHY 16GB capacity flash drives (or smaller)? The reasoning goes that as SSD density went up, their longevity took a nosedive as the NAND traps became smaller and smaller.
>>
>>108483306
what i did was buy a JBOD array on ali, hook it up to a rpi3b i had laying around, and filled a transmission server up with everything i wanted to archive.
you can get fancy with raid and backups, i'm not too concerned about a drive failing though.
throughput rate on the pi is pretty garbage, but it works for stuff i'm not going to care about until long after it's finished downloading.
>>
>>108483372
>>108483387
This is good, just keep in mind not all processors support hardware decoding for Av1/AvIF. And software decoding will have your device maxing out the CPU.
>>
>>108483417
Not if it has 8-cores (or more). Sucks that humanity is still like a decade away from having AVIF/AV1 be viable to the poors who are using like dual-core celeron computers right now but if you can encode AVIF/AV1 quickly then you can decode it just fine.
>>
>>108483372
>>108483416
You're speaking in tongues to me. I trust it's good advice, but if I make it too complicated I know I'll end up not doing anything at all.
>>
>>108483440
Basically you know how YIFY makes movies really small? If you did that but used new compression technology, you could make them even smaller.

Small enough to fit on many cheap 16GB flash drives. Throw them into your fridge and now people stealing them or losing them to hackers is very very unlikely.
>>
>>108483306
It's junk without actual relevance to your life.
Touch actual grass.
>>
>>108483470
I'm not at all worried about getting robbed, and someone hacking my storage sounds really unlikely. And I'm not so strapped for cash I need to buy little 16GB drives. I've already got a couple 2TB HDD's laying around.
I'll keep this in mind, though.
>>
>>108483306
you need a raid/nas so a drive failure won't kill your data
>>
>>108483505
I'll keep that in mind too. Are drive failures that common with external storage?
>>
>>108483553
you should plan on a drive having a roughly 2% chance to fail in a given year
>>
>>108483553
Think about it this way. It will fail eventually. So is the plan to curate all this data, and then keep it for only as long as the drive will last? Maybe it'll fail after a year, maybe 15 years, but it won't live forever. Archiving data is an active process that involves maintaining the data on more than one media. Even having just 2 copies is infinitely better than 1 if you buy a new drive for backup as soon as one drive fails. There is risk both drives will fail within the same window, but hopefully you see the principle here: one is none, two is one
>>
>>108483577
>>108483597
I guess that's wise. God dammit, there's too much to worry about.
>>
>>108483616
There are boxes like synology where you put in multiple drives and it figures everything out for you. Or you can put multiple drives in an old PC and find some software to use on it.
>>
>>108483649
Is there any reason to not just get two storage drives and save stuff to both of them? Or is this nas/raid thing just automating that?
>>
>>108483306
>is there anything I need to know
Yeah, 2UU for a 1/1 that taps to draw a single card is pretty inefficient card advantage. You can do a lot better
>>
>>108483306
You can start small, but keep backups. Don't be a fag and run a huge RAID0 and lose everything when a single disk shits the bed.
>>
>>108484464
But getting two draws a turn instead of one is dope. Notice it doesn't say "sacrifice this", so you can repeat it until your opponent destroys it, and even then he'd be wasting a sorcery for that.
>>
>>108483306
On HDD/SSD, use a file system that handles bit rot like btrfs. If you archive to optical disc (recommended), dont forget to put parity files on them. I never actually have bitrot on optical discs (I have been doing it since I was a kid, got some 25 years old discs without hash errors at this point) but better be prepared.
>>
>>108483306
congrats on picking the worst time to start
>>
>>108483372
>>108483436
>>108483417
>oy vey pollute the internet with shitty re-encodes for jewgle

how long have you been shilling Tel Aviv and samefagging for, Daiz? where's your trip code?
>>
>>108485797
You have very poor reading comprehension.
>>
>>108483387
In theory - files stored on multiple, smaller drives = you lose less in case of one of them failing.

If one of 20 16GB drives will fail, you lose only small portion of your collection. If that ~320GB will be on one disk, it's failure will be much worse.
Obviously there's limit in how practical is this result, but for smaller, emergency store with only most precious of data this is the way to go.
>>
>>108483306
yt-dlp works on most video hosting sites, some require configuration like passing through a cookies file which just means copy pasting the cookies from your browser into a txt file
If you want a config file automatically give you the best audio and video track then combine them into an mkv unironically ask the free google AI widget they're clearly scraping github documentation for popular projects regularly
gallery-dl works on a surprising number of sites if you want to mass grab images and can be run without any config needed on most sites
>>>/t/ actually used to have pretty good (non porn) archival threads no idea what it's like today
If you can afford multiple HDDs (ideally 8+) setup a ZFS RAIDZ2 pool just to have a bit of leniency when it comes to hard drive failure.
Good luck and most importantly if you see something that you could potentially want to see again in the future download it immediately. I've been datahoarding for close to 20 years and I can't tell you how often I saw a video or article, decided fuck it I can archive this later only for it to be taken down within minutes of me watching/reading it.
>>
>>108486086
Oh yeah if you want LOCAL copies of a website, article, whatever instead of relying on some online archiver try httrack.
>>
>>108485949
>>108486030
You ok, Daiz? Why change your trip code so many times and often post without it?

https://desuarchive.org/_/search/text/celery/username/pixdaiz/
https://desuarchive.org/_/search/text/celeron/username/pixdaiz/
https://desuarchive.org/_/search/text/celery/username/ss2/
>>
File: 1766751787546659.png (44 KB, 1080x356)
44 KB
44 KB PNG
>>108486030
Assuming you don't write like 1TB of data to them and blow through their P/E cycles, 16GB drives were built on like 40nm lithography, making controller defects very rare just due to the sheer size of the NAND charge traps . The failure rate especially if refrigerated (slows down electron leakage exponentially) is close to like 0.1 per 100. But yeah, losing 100% of you data across a dozen 16GB flash drives is very unlikely especially since burglars aren't interested in what's in your fridge...

Picrel for anyone interested in encoding AVIF images. 8 cores is the bare minimum. Maybe you can scrape by with 6 cores if you only have a few hundred images at most but I wouldn't really recommend it...
>>
File: 1773404661551752.webm (3.19 MB, 1280x512)
3.19 MB
3.19 MB WEBM
>inb4 "don't le use le avif, it's le bad, because some shizo spammer say's it's LE BAD!"

>80 = very high quality. Distortion not noticeable by an average observer in a side-by-side comparison at 1:1 from a normal viewing distance. This corresponds to the typical output of cjxl -d 1.5 / -q 85 or libjpeg-turbo 4:2:2 quality 85.

https://github.com/cloudinary/ssimulacra2


File size: 254 KB (~29% smaller than JPG)
https://files.catbox.moe/ptvr88.avif
--sharpyuv -s 6 -q 72 -d 10 -y 420 --cicp 1/13/1 -a tune=iq

File size: 284 KB (~20% smaller than JPG)
https://files.catbox.moe/isrw29.jxl
-q 91 -e 7 --override_bitdepth=10

I will add that you'll get shit results if you encode AVIF from GIMP, you have to use libavif directly. I think eventually these parameters will become defaults but better to always use them just in case.
>>
>>108483315
>you have to eat all the eggs
>>
File: RecordsUnread.jpg (102 KB, 570x796)
102 KB
102 KB JPG
>>108483306
I save many things I think important but ultimately archiving things you're not interested in for the sake of archival is a fool's errand.
>>
>>108485462
If you're using it on your turn you're already doing it wrong. You use it at the end of your opponent's turn so that it's untapped during your turn. But there's also better card draw available - sensei's divining top, rhystic study, and insight engine all do it better.
>>
>>108484464
She's from 7th edition. Cut her some slack.

>>108484588
>You can start small, but keep backups.
I think that's the plan. I've gotta get started before I start worrying about fancier things.

>>108485719
>archive to optical disc (recommended)
Why?
>>
File: 1769801847074728.png (251 KB, 850x1023)
251 KB
251 KB PNG
>>108487641
Opticalfags are retards that don't know what happens when you store flash drives inside your refrigerator.
>>
I wonder why the schizo spammer known as Daiz has been posting the same propaganda picture in every thread for almost a decade now instead of the "abundant evidence" that he says gets made every day.
>>
File: 1749828305862566.webm (728 KB, 640x480)
728 KB
728 KB WEBM
>>108487836
Do you have a response to >>108486223 ????

Why is the AVIF image smaller in filesize vs the JXL one? I thought you said AVIF = rancid dogshit and JXL = the final solution to the image format question?
>>
>>108486223
>don't le use le avif, it's le bad, because some shizo spammer say's it's LE BAD!
Whatever his reasoning is, he is right. AVI is trash, designed to make green grass appear not too blocky on a TV screen. It suits sports shows that are streamed very well. It will likely butcher the quality of your stuff because it was designed for a different kind of content.
>>
>>108487799
Hot air trapped there after closing the door would condende on them and the droplets would cause corrosion?
>>
File: 1751777401987854.png (1.05 MB, 1080x1456)
1.05 MB
1.05 MB PNG
>>108487970
At best his reasoning is incredibly outdated just like 4chan using server software from fucking 2012.

>Using libaom's --tune iq or SVT-AV1-PSY's Tune 4,
>AVIF is generally the most efficient image codec on the Web for quality per bit.
>AVIF's quality per bit is generally better than JPEG XL

https://wiki.x266.mov/docs/images/AVIF

If the schizo spammer would take his meds he would at least point out this damming AVIF problem:

>AVIF does not have progressive decode. This is a common weakness of video-based image codecs.
>While there is a hacky way to do progressive AVIF by encoding a low fidelity frame & then a high fidelity frame
>in an animated AVIF at a high framerate so the low fidelity frame is loaded & plays first,
>this is far from ideal for the average user & adds to an already burdensome encoding process.

JXL doesn't need hacks for progressive loading.

>>108487986
Ah yes, the fridge environment known for desiccating the living fuck out of anything not in a ziplock bag is going to corrode a flash drive...
>>
>>108483704
Nas and raid is just automating it, plus it can spread out. So you could have 6 drives with 4 drives worth of storage and 2 drives of redundancy. So if you had 6 10TB drives, you would have 40TB of space you could write to, and you could lose up to 2 disks without losing data. If you lose a disk you just put a new one in and it rebuilds to that drive.
>>
>>108483372
Just burn whatever you want to keep to some m discs
>>
HDDs/SDDs, and RAID, are NOT archives. Those are for files you want to access soon or frequently. DO NOT STORE YOUR SHIT ON FLASH DRIVES as some people in this thread has mentioned. USB flash drives are like SD cards in that they can randomly lose data, and WILL lose data if not regularly plugged in and given power.
If you want to truly archive stuff for the long term, then there's only one really viable solution. Gold archival-grade DVD-Rs. They are literally rated to out-live you, so they will last for as long as you want to keep the data. 4.5GBs per disk, and roughly $2 a disc, but you can shove them in a closet somewhere and know they will still have the data ready to go when you need them.
I've got two 2TB drives on my PC for data I need quick access to, four 1TB drives on my home file server in a mergefs 4TB block for my collection of video files to watch from any computer on my home network, and everything I want to keep long-term is burnt to gold archival-grade DVD-R's. Anything not burnt, I consider to be 'could be lost'.
>>
>>108483313
Sorry for being obvious, but AI is lossy compression with probabilistic retrieval. People usually want to archive stuff in its original form and easily retrieve exactly what they store.
>>
File: 1773109447913423.jpg (49 KB, 750x563)
49 KB
49 KB JPG
>>108490062
That's why I said to store flash drives in your refrigerator NOT your oven/hot car garage. Refrigerated NAND is arguably on par with LTO tape.

>Colder storage temperatures significantly improve SSD data retention for unpowered drives by slowing down electron leakage from flash cells.
>While high temperatures accelerate data decay, keeping SSDs in a cool environment can increase data retention from days to decades.
>keeping SSDs in a cool environment can increase data retention from days to decades.
>increase data retention from days to decades.
>>
File: 1761195786802409.png (680 KB, 1920x1080)
680 KB
680 KB PNG
>>108490062
Also you're confusing DVDs with blu-rays. DVDs are EXTREMELY DOGSHIT for long term data storage. Some can develop pinholes even in storage away from sunlight because they use an organic layer that just fucking rots whenever it feels like it lol.
>>
>>108483306
Bravo! Welcome to the club! Last summer I bought 16TB external HDD and started collecting hilichurl and goblin porn, also started getting into pokemons recently. Maybe you should know about rsync in case you want to have extra backups on other disks or remote servers.
>>
File: 1750277566888312.png (309 KB, 460x624)
309 KB
309 KB PNG
>>108489828
I know that on paper this should be the best option but it's just such a huge hassle, like you have to baby them around else 1 small scratch = GBs of data corrupted.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.