helvede.net is one of the many independent Mastodon servers you can use to participate in the fediverse.
Velkommen til Helvede, fediversets hotteste instance! Vi er en queerfeministisk server, der shitposter i den 9. cirkel. Welcome to Hell, We’re a DK-based queerfeminist server. Read our server rules!

Server stats:

171
active users

#datahoarder

3 posts3 participants1 post today

Found my first DVD that just straight up will not read because of delamination of the disc. I have "a" copy of these episodes transcoded, but I'm working on re-transcoding to AV1, adding the "No laugh track" audio I missed last time, etc. and am planning on keeping ISOs of these discs for long term usage. I may have to re-order a copy of just this season so I can rip it properly.

Just discovered ArchiveBox — FOSS, self-hosted internet archiving.

The way the web is going, with the US government redacting and outright erasing historic content, publishers segmenting content by region (and also sometimes redacting/censoring it), and CloudFlare shitting all over everything, I think it's time for me to start my #archiving and #DataHoarding journey.

#SelfHosting #SelfHosted #DataHoarder

github.com/ArchiveBox/ArchiveB

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... - ArchiveBox/ArchiveBox
GitHubGitHub - ArchiveBox/ArchiveBox: 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... - ArchiveBox/ArchiveBox

#TechHelp please (calling any #DataHoarder out here):

is there a way I can archive web pages (not the entire website, just single pages) locally en masse, kind of like Adobe Encoder? Just give it a list of links and off it goes?

Bonus points for:

Retaining all of the media therein (like video, images, links, audio...)

Either being able to get around paywalls, or letting me know which pages have them (so, I guess including that info in any error messages)

Being able to omit ads (but I don't care too much about this)

Having some sort of UI (Windows user; extremely uncomfortable with command line-type stuff)

I now have my own fully search-able mirror of #Kiwix hosted on my home server, including #Wikipedia. You can check it out at:

kiwix.marcusadams.me

Content may change as time goes on since I literally "just" got it set up and working, and you should definitely prioritize the original resources and donate to the folks hosting it. But IF some of these resources become unavailable from their original source, feel free to use my mirror as long as it's up.

kiwix.marcusadams.meWelcome to Kiwix Server

#TIL about the Internet History Initiative (@IHI). It's a website that focuses on historical relevant public data sets. As a #datanerd and #datahoarder of #internet data, I appreciate that something like this spun up.

However, I am shocked, I haven't heard from it so far. Although, it's online since January 2024 already! Will definitely start to keep an eye on it.

Edit: Forgot to link the website: internethistoryinitiative.org

Internet History InitiativeInternet History InitiativeCuration and preservation of historical Internet measurement datasets
#infosec#data#DNS