helvede.net is one of the many independent Mastodon servers you can use to participate in the fediverse.
Velkommen til Helvede, fediversets hotteste instance! Vi er en queerfeministisk server, der shitposter i den 9. cirkel. Welcome to Hell, We’re a DK-based queerfeminist server. Read our server rules!

Server stats:

171
active users

#datasets

2 posts2 participants0 posts today

#ListenBrainz / #MetaBrainz I'm confused. Aren't sponsors the true customer? Why use this? 🤔

On one hand #Music: "Listen together", "Ethical forever"

On the other: #DATASETS

"Some of the world’s biggest platforms such as Google and Amazon, use our data"

"We ask commercial supporters to support us in order to help fund the creation and maintenance of these datasets."

"The following organizations make use of the data-sets published by MetaBrainz"

"Unicorn tier: #Google, #Amazon, #Spotify"

STAT: Gold-standard maternal mortality database in limbo as CDC staff placed on leave. “As part of the sweeping layoffs that rocked the Department of Health and Human Services on Tuesday, the entire staff that oversaw an annual survey to better understand infant and maternal health — and that was considered the gold standard in the field — was placed on administrative leave. The Pregnancy […]

https://rbfirehose.com/2025/04/02/stat-gold-standard-maternal-mortality-database-in-limbo-as-cdc-staff-placed-on-leave/

This data may vanish under Trump, so we charted it
Some of most valuable #datasets in human history vanished from #US #government websites, felt like watching Library of Alexandria go up in smoke
Many have gone on record describing #Census Bureau’s #American Community Survey as wonder of modern world
Another loss? #HouseholdPulse survey, online survey that provided week-by-week data on income losses, economic struggles and precarious mental health
washingtonpost.com/business/20
archive.ph/mB512

The Washington Post · This data may vanish under Trump, so we charted itBy Andrew Van Dam
Continued thread

"Some federal #datasets are nearly irreplaceable. Hurricane Helene helped drive that fact home in September, when it flooded much of western North Carolina and temporarily knocked NOAA’s NCEI headquarters in Asheville offline. Scientists found they were unable to complete certain kinds of analyses until the databases were back up and running."

scientificamerican.com/article

Scientific American · Scientists Scramble to Save Climate Data from Trump—AgainBy Chelsea Harvey

Hey #DataScience people!

I am about to start my first “Introduction to Data Science” course at #University, and our professor asked us to team up and think about a project that we want to do.

Nevertheless, since I don’t know anything about the topic yet, I would really appreciate any tips of entry-level data science projects that I could do with #OpenData #DataSets in #Python!

Probably, we will be using #pandas. Since you’re here, any additional learning resources or general suggestions are much welcome, too!

Thanks ❤️👾

(Not sure how useful it is, but this is the course link: ois2.tlu.ee/tluois/subject/ULP)

ois2.tlu.eeTLÜ ÕIS

The #wasm build of #sqlite opens up great new possibilities in the #browser, especially when coupled with the Origin Private File System (#opfs). I've used it to implement importing huge #csv or #jsonl #datasets right in the browser. Import, validate, search, edit, close the browser and continue tomorrow, stream the database (with on-the-fly compression!) to the server when ready – it's all possible!

Another interesting use case I came across today: use it in #golang to get rid of #cgo!

[1/2]

Continued thread

Here, we display the performance of the algorithm on the DVSgesture dataset. For this gesture recognition task, the online HOTS accuracy remains close to the chance level for about 100 events. More evidence needs to be accumulated, and then the accuracy increases monotonically, outperforming the previous method after about 10.000 events (at an average of 9.3% of the number of events in the sample) :

Under the carpet of this article, Mark Zuckerberg is hiding the fact that—contrary to traditional free/open source software—the freedoms to study and modify #Llama models are asymmetric. Meta can exercise both with much more ease than everyone else, because they are the only ones with access to the training datasets (and full training pipeline): about.fb.com/news/2024/07/open

#Opensource #AI needs open training #datasets.

Oh, and scientific #reproducibility needs them too. #openscience

Meta · Open Source AI Is the Path Forward | MetaMark Zuckerberg outlines why he believes open source AI is good for developers, Meta and the world.

The collection of vast #datasets has become crucial for discovering the secrets of the #universe.

What are the challenges and innovations in managing #astronomical #data and how do we deal with #metadata in #astronomy?

➡️ Read the interview with a #BigData #astronomer and an expert on #archiving data from #satellite-mounted astronomical instruments:

🔗 rug.nl/digital-competence-cent

📷 Credit: ESA/Rosetta/NavCam – CC BY-SA IGO 3.0

The Institute for Dissent and Datalove is a loose collective comprised of hackers, artists, activists and tinkerers. It overlaps with networks of solidarities involved in active defense of free speech and free/libre technologies, technology critics and political interventions.

The Institute for Dissent and Datalove has so far mostly been used for operations of de/re-contextualization of large datasets, de-formatting of formats and playful use of liberating algorithms.

It tries to criticize and deconstruct itself, while remaining grounded in uncompromising collective practices of autonomy and solidarity.

We even have a website: dissent-and-datalove.institute