[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/t/ - Torrents

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: liveleak-625636968.jpg (49 KB, 600x300)
49 KB
49 KB JPG
I found a 1.2TB liveleak archive on archive.org and spent the last half a year off and on trying to extract it. It haas content from 2019 until the shutdown in 2021.
https://archive.org/details/archiveteam_liveleak
The archive consists of 50+ .megawarc.warc webarchive files which hold the videos and some metadata. There are also .idx , .xml , .sqlite , .json and .cdx files for each .megawarc archive. These archives were compressed into individual .gz files, which only saved a small amount of data, so I discarded the packed version.
I extracted the archive using warcat.
https://github.com/chfoo/warcat
The videos are stored with an ID in the filename, so maybe one of you can try to make a torrent with correct video titles.
I am posting it here, because you guys can probably do something with it.
The torrent below has the full unpacked archive (1.2TB) and a extracted folder that only contains the videos (0.97TB). I have port forwarding enabled and I'll seed for as long as I can. The videos are in a .mp4 format, but labelled with a wrong extention. You can view them normally in VLC without renaming.

magnet:?xt=urn:btih:495625d4a64dfe796eac7e0c06d6511e0031f6d4&dn=liveleak&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337&tr=udp%3A%2F%2Fcoppersurfer.tk%3A6969%2Fannounce

.torrent: https://files.catbox.moe/m8xe2b.torrent
>>
>>1317947
Fucking Sweet!
TY Anon!
>>
Please seed this
>>
Bump and thank you OP
>>
Thank you!
>>
>>1317947
waiting to see if someone can make a proper torrent with titles and tags
>>
>>1317947
Alright seems doable, however would need to extract the warc files again since they contain the names, tags, description, when it was added and which files (720p, 360p, thumbnails) are related.

It would probably take ages on my pc but i'll give it a try tomorrow and see how it goes.

Next step would be to rename them and create a very simple local webpage to scroll through them by tags.
>>
in it's current state not much can be seen from the archive. Unfortunatelly I only have spinning rust to extract this, so it will take more than forever
>>
>>1317947
idk if anyone is still around, but i have a working program now which finds the names and so on for each clip, puts all tags for each in a database for easy access, changes the names to end with .mp4 and puts all videos in yearly folders instead of hundres of small ones.

The ~10GB example i tried had 549 unique videos for each a 720p and 360p version and 3800 Tags.
265 files which could not be matched to any title/description.
Still slightly bothered by those unidentified ones, however unsure if there simply is no related html page inside or those have a different layout.

Anyways maybe tomorrow, won't process them yet without having an answer to the remaining unidentified files.
>>
File: ok (2) - Copy.png (294 KB, 932x523)
294 KB
294 KB PNG
>>1328690
Still unsure why we are interested in this stuff but the sorting works now.
Just don't know what to do with related pictures and thumbnails, probably same as with videos.
Tags can be searched easily with the database, one file two tables. This one archive gave around 1400 videos and only 25 which couldn't be connected to anything, good enough. Those 25 will be moved to their own folder. Those 1400 videos had over 4000 tags lol which are also stored related to the filename.

Now just download the other archives and let automation do the rest.
>>
File: ok (1) - Copy.png (84 KB, 1469x583)
84 KB
84 KB PNG
>>1328725
>>
File: ok (3) - Copy.png (27 KB, 333x634)
27 KB
27 KB PNG
>>1328725
>>
File: 1725363518660139.jpg (31 KB, 558x750)
31 KB
31 KB JPG
bump
>>
File: dog.png (967 KB, 1920x1040)
967 KB
967 KB PNG
>>1317947
Local webpage is working now, the design well lets not talk about it.
All videos are search-, and playable in the browser only three additional files needed (one html, one phyton for the local server and the database file).
Description opens with a click. Search looks through tags, description and titles.Still need to let the cleanup program run over the archives, will make a torrent when done.
It will only keep the best quality video and picture and delete everything else.
>>
>>1328913
nice! any plan on launching it soon?
>>
>>1328929
Still downloading the remaining archives lol hope it'll be done in a few days.
For clarification, all files needed to run the site offline / locally on your machine will be included in the torrent. Once the sorted files and database is shared maybe someone with more html/css/js skills can make a nicer frontend.
>>
>>1328965
desu front end is a bloat, just let it rip, CLI only if needed
>>
>>1329097
12KB no bloat and search field comfy.
~100GB to go, down speeds slow af, maybe tomorrow.
>>
File: Unbeiugiuguignannt.png (36 KB, 1616x220)
36 KB
36 KB PNG
Hello OP here again, sorry for seeding this with such a slow home connection. I have uploaded 9.4+ TibiBytes so far. Glad to know that this material is actually being used! Please seed it for some time, because otherwise it will always stay slow. I will continue seeding as long as possible.

>>1328965
Please make sure to create a new thread for the new torrent!
>>
Bammmmping for long lost live leak. Fml. I come back to this shit.
>>
>>1317947
One of the more boring archives sadly.
In total slightly over 43000 unique videos at around 719GB and only very few interesting ones.

We would need a earlier archive with more fucked up content to be good, it looks like pornhub after the purge.
Most are clips you could add to a short news article or reddit post and are forgotten right after.

For example some shit trump said, corona news, cat sitting funny, protestors blocking the street,
a traffic jam in new delhi, palestinians march in solidary with tiny bit of violence mixed in.

I mean sure if it's a event happening right now those clips would be nice to have but now? idk
>>
bump
>>
Bump
>>
>>1317947
how did archive org get the videos anyway ? in general it shouldn't be able to get videos or am i wrong ?
>>
bumping
>>
>>1328913
please open source this, I need it to watch downloaded videos.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.