[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/t/ - Torrents

[Advertise on 4chan]

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • There are 65 posters in this thread.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: file.png (70 KB, 499x367)
70 KB
70 KB PNG
This is a collection of all the HTML files from yuki.la as of Feb 2021, packaged by Anonymous !wJbF8ZWUxk.

It was a process of scraping over multiple months and completed nearly the time that yuki.la vanished for good.

Each board is collected into 2 tar.gz files, containing either the entirety of a board's HTML files, or a portion. After each file is extracted, you will have all the files of the board.

Unfortunately, no thumbnails or images were saved before yuki.la went offline, but it is possible to find images using other archive websites.

If you'd like to support my efforts, you can donate ETH to: 0xD952EeD3a5f10A891f962f9bC411fFa41272F78A

More information regarding 4chan archives can be found at https://bibanon.org/

Cheers! I hope you find this collection useful.

magnet:?xt=urn:btih:42b3aa33de93b705ef1074776b27f50765891b86&dn=yuki.la&tr=udp%3a%2f%2ftracker.openbittorrent.com%3a6969&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce
>>
Shit, I've been putting off setting up my seedbox for months, but now this is something that's really worth spending the effort for
>>
Are you from archive.is LPySg?
>>
>>1033298
Yes
>>
Lack of images suck. Request section has continuous stream of hot girl pics. I'm sure someone has huge archieve of them. That kind sould should share magnet of them.
>>
>>1033260
Nice anon
>>
>>1033260
You're a hero, OP.
>>
>>1033260
I love you archive kun
>>
bump
>>
>>1033260
fuck bibanon. i hope that they all get suicide bombed
>>
>>1033260
>Unfortunately, no thumbnails or images were saved before yuki.la went offline, but it is possible to find images using other archive websites.
then it's worthless
you should have at least saved image urls from the site, i think it was using imgur as a front end for image hosting.

pray to god somebody has a solution to undo the loss.
>>
>>1034377
fuck off discord goon
>>
>>1033260
you do know there's archived.moe and https://archive.wakarimasen.moe archiving everything? latter even having full search?
only issue is they are not so old

archiveofsins.com has /t/ since 5years+
>>
>>1034400
The image urls are saved you fuck. It's the raw HTML from the threads.
>>
>>1034446
>archived.moe
-outsources links from other archive
-most of the search is unavailable aside slow or coom boards
>https://archive.wakarimasen.moe
-another fresh FoolzFuuka archive that hasn't been around for years and could kick the bucket any time like the previous ones
-also doesn't have any of the old stuff yuki.la had either
>archiveofsins.com
unrelated
>>
>>1033260
>If you'd like to support my efforts, you can donate ETH to: 0xD952EeD3a5f10A891f962f9bC411fFa41272F78A

No one cares about your shitty efforts.
Go beg elsewhere faggot
>>
>>1034622
you have to be over 18 to use this website
>>
>>1034622
Iunno there's plenty of peers on this torrent.
>>
bruh it took 16 hours to decompress /a/
though my HDD is ship
>>
>>1033260
Do I have to download the whole collection or can I choose what files I want?
>>
>>1035206
choose
but make sure you have a lot of space
/a/ alone might be over 300-200 gb
>>
File: file.png (12 KB, 1362x34)
12 KB
12 KB PNG
>>1034400
>>
File: file.png (29 KB, 948x434)
29 KB
29 KB PNG
>>1034400
>>
>>1035246
>>1035256
I knew i was wrong
was thinking of (ugh) 4chanarchive
>>
>no images
>400+ GB

wot
>>
>>1035310
4chan has had literally billions of posts, it adds up
>>
>>1034622
lol you want a medal or something with that post?
>>
>>1033260
What happened to yukila and will it ever come back?
>>
>>1035980
No one really knows, and probably not.
>>
>>1036936
It's really a shame how inconsiderate some of those archive owners are.

They know how valuable the info on those sites are to some people and while we can't force them to keep doing something they don't want to/can't afford to do they should at least have the decency to dump the data somewhere and not just disappear without a trace like that

Also I am pretty some of them they get tons of donations anyways so it's not like they are simply doing people a favor.
>>
>>1036936
Ah, well, thanks anyways. What I'm having to do now is search for stuff in that torrent collection, get the thread numbers, then look those threads up on archived.moe. Even then it's still not as good because it only has thumbnails instead of yukila's saved images.
>>
What are our alternatives? wakarimasen is pretty good, but it lacks a lot of old stuff
>>
>>1033260
Bump
>>
Bump for epic thread
>>
>>1033260
I unpacked the archives and fuck. there are so many html files that my computer will take weeks to transfer them onto my external hard drive...
>>
>>1038959
why didn't you download them directly to your external hard drive?
>>
>>1038959
Yes it's a huge collection. It took almost half a year to complete the download.
>>
>>1033260
holy shit I can't believe I didn't see this until now! thank you so much anon with that weird apostrophe thing that isn't actually an apostrophe at least in english
I also recall you saying that you would upload it to archive.org when it was finished zipping. Is that still in the plans? Or am I just retarded and can't find it?
>>
>>1039081
Yes, I have tried uploading to IA but kept getting disconnected by their servers. Filesize may be too big, but I wanted to get it into multiple hands as soon as possible so a torrent was what came to mind.

Would of hated for my hard drive to die or something like that before getting it uploaded.
>>
Thank you anon!
>>
>>1039124
oops, wrong tripcode
>>
>>1039081
backtick
>>
Someone make a website and upload all the stuff there
>>
>>1035310
>>1035323
I don't know why this was deleted AND intentionally excluded from archive.org. I also can't find the 4chan article about it either, but from this dead link: http://chrishateswriting.com/post/68794699432/small-things-add-up
>By migrating to the new domain, end users now save roughly 100 KB upstream per page load, which at 500 million pageviews per month adds up to 46 terabytes per month in savings for our users. I find this unreal.
This was simple from changing image domains from img.4chan.org to i.4cdn.org to save 50 bytes.
Just that accounted for 46TB of extra text data a month.
>>
Bump
>>
>>1033260
What happened to yuki anyway? Will it come back?
>>
>>1039752
>>1033260
Also, not an option for me to back it up all of it now, but maybe some day. Thank you for the effort.
>>
>>1034377
>>1034401
Okay what did I miss?
>>
Bump
>>
how long does it date back to?
>>
Come back pls... yuki, the search on other archives is garbage
>>
>>1040317
Dunno, yuki was kind of annoying with how it loaded, but the good thing was how extensive it was. I wonder what really happened to it. As far as I know they didn't communicate or take donations...
Also, wish more archival sites let you give donations without me have to bother with memecoins.




>>
Guys tell me the date of these archives so i can know if i should bother downing and seeding
>>
>>1040735
February 2, 2008 to sometime in February 2021
>>
>>1040881
ok thanks for the info
>>
bump
>>
any more torrentable 4chan dumps?
>>
>>1041838
check bibanon and related sites, as well as internet archive (which generates a torrent file for most initial uploads, though obviously the torrent can't be updated if the collection changes)
>>
So what happened?
>>
>>1042518
It’s dead
>>
>>1042789
but why
>>
>>1042815
he car door man hook horn
>>
do any archives at all allow searching for symbols? seems like yuki.la was the only one that allowed this. like i cant search for things like "a****" , it either brings up irrelevant results or none at all.
>>
Is there any archive atm that plays webms on /gif/
>>
How do you search now for /t/ archives?
Archived.moe doesnt have search activated
>>
>>1043599
archive of sins but their search feature is momentarily disabled
>>
>>1040317
>tfw didnt even know yuki had a search engine until it went down

>>1040619
>wish more archival sites let you give donations without me have to bother with memecoins
this, I'm too much of a fucktard to figure it out and plus its annoying
>>
What was yuki.la?
>>
>>1044740
4chan threads archive
>>
>>1044740
one of the few archives that saved .webms from longer than 5 years ago and seemed to be one of the oldest surviving archives up until it died over a month ago.

A lot of people have wanted to help out to bring it back because of that, but no one knows anything about it other than it existed and the email for the manager/admin is lost because it was also from yuki.la
>>
>>1044930
the archiver never interacted with anyone and never had a contact email or anything like that either, or else people would have contacted him ages ago when things started to go down
>>
>>1044935
his contact email was hosted or however it's called from yuki.la servers. so once that went down, no one could email him.
I forget what error code the website was giving out that weekend, so maybe someone could have tried to reach out on early Sunday and Saturday, before it was just GONE.

sad that he never attempted to reach out.
>>
>>1044954
This, hell, pls archivers, set up proper donations (not just memecoin ones), if you must.
>>
Bump
>>
bump, thank you very much for the archive, man I miss yuki like you wouldn't believe
>>
>>1039809
I would also like to know
>>
>>1033260
Thanks anon, keep it up! Fucking super glad you archived this, so thanks!
I can't donate now but I'll keep this seeded forever.
>>
>>1034400
Bullshit it's worthless, if it has all the image hashes then it can be rebuilt from image archives.
A billion images are useless without their threads.
Having a nice archive like this sitting around is invaluable. In time I plan on archiving all archives, scraping all the images I can, and compiling them into a mega-archive.
The software I'll need to handle it will need to be written from scratch but my programming is improving and I'll be there soon. For now, I'm just happy to see people saving everything they can.
>>
>>1040317
>tfw arch.b4k.co is fucking garbage
>rebeccablacktech fucking merged with fucking desuarchive
>>
>>1033260
oh thank fuck
i miss yuki.la so fucking much
you would've though anons would have learned from other past archive failures like the old archive, fireden, fireden 2, 4tan etc by now
>>
Stuck at 95% anyone else?
>>
>>1046628
After I posted this it resumed again.
Thanks to whoever archived this but I wish there were pictures.
>>
bump
>>
Thank you for doing this anons. I enjoyed reading old threads from times past
>>
>>1034622
reddit ritard niger
>>
thank you for your effort archive anon
>>
Got the whole thing sitting on my NAS, can't donate but I'll seed forever.
Thanks OP, you're an absolute legend
>>
Bump
>>
Would make a backup, but need more space...
>>
last bump
>>
That's absolutely amazing, thank you for your efforts.
>>
Bumping till my new drive gets here.
>>
Bump
>>
Bump
>>
>>1033260
I quickly wrote up a couple simple python (tested on 3.7.9) scripts to extract all threads and posts matching a criteria.
It should be pretty easy to repurpose them to fit anyone's needs. You could even build a database with the scraped data for more efficient storage.

Script 1
>move all HTML files that match a regex to a different folder
https://pastebin.com/rZKZW7V2

Script 2
>scan all HTML files, match regex in the thread subject field, extract every post, clean and split the posts and save them to a single column CSV
https://pastebin.com/0u8etcZX

The regex are probably not perfect and I used the CSV in an AI project so you probably want to take out the part that cleans and splits posts and save more columns than just the post text.
It should be easy to get them to work on tar files rather than folders of html files but I haven't tried that.
>>
>>1052675
thank you friend
>>
bump
>>
>>1040317
Yuki's search sucked. I used desuarchive to find files and posts, and then went to the thread/post on yuki for the image file.
>>
RIP
>>
>no images
>no webms
nooo my memerinos
>>
File: 1574887227398.jpg (7 KB, 250x201)
7 KB
7 KB JPG
>>1033260
Great praise OP.

Thank you for your efforts. A part of 4chan died when yuki.la vanished. Does anyone even know what happened to yuki? Can't believe the best archive would just vanish like that without much warning or a chance to rescue more of what it had archived.
>>
>>1036937
The only existing /v/ and /vg/ full image archive from 2015-2019 are now on fireden's onion service that could go down at literally any time. It'd take years to scrape it all and there's no dumps. Real downer eh?
>>
I wrote a scraper for yuki.la at one point. It used this niche file format called nozomi, named after a Japanese train service, which stored all thread OP post id's for a given board in a single file. Its kind of an odd choice, but it offloads some pagination and lookup work to the client.
Another website which goes by the name nozomi.la, uses this same niche nozomi format, but to greater effect. Each tag has its own nozomi file, and using a single set intersection operation across any number of nozomi files, a client can find images which satisfy all tags fairly quickly and then request those from the server one by one.
My guess is that I the same person made both websites. They had the same TLD, use the same tech, and have similar interests.
I could previously find a github or gitlab which explained what nozomi was but I can't find it anymore. Maybe it was privated when yuki went down.
>>
>>1057149
I would love more detail about how it worked.





Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.