Dronningen af Helvede @queen

0 posts0 participants0 posts today

**Korrespondent zur See** @Hinnerk@mastodon.social · Apr 7

Korrespondent zur See @Hinnerk@mastodon.social

Hallo schlaues Fediverse, ich tauche gerade in ein völlig absurdes #Rabbithole und mein M1 Macbook hat dank #LlmStudio und #Ollama seinen Lüfter wiederentdeckt… Aktuell ist lokal bei #LLM mit 8-12b Schluss (32GB Ram). Gibt es irgendwo #Benchmarks die mir bitte ausreden, dass das mit einem M4 >48GB RAM drastisch besser wird? Oder wäre was ganz anderes schlauer? Oder anderes Hobby? Muss Mobil (erreichbar) sein, weil zu unsteter Lebenswandel für ein Desktop. Empfehlungen gern in den Kommentaren.

0%M4 MacBook bevor der Zoll es noch teurer macht
0%Klick dir was bei Hetzner (oder z.b. Empfehlung)
0%Linux Notebook mit fetter Grafikkarte (Tipps?)

**Seán Fobbe** @seanfobbe@fediscience.org · Feb 24

Feb 24

Seán Fobbe @seanfobbe@fediscience.org

New Essay

"The Intelligent AI Coin: A Thought Experiment"

Open Access here: https://seanfobbe.com/posts/2025-02-21_intelligent-ai-coin-thought-experiment/

Recent years have seen a concerning trend towards normalizing decisionmaking by Large Language Models (LLM), including in the adoption of legislation, the writing of judicial opinions and the routine administration of the rule of law. AI agents acting on behalf of human principals are supposed to lead us into a new age of productivity and convenience. The eloquence of AI-generated text and the narrative of super-human intelligence invite us to trust these systems more than we have trusted any human or algorithm ever before.

It is difficult to know whether a machine is actually intelligent because of problems with construct validity, plagiarism, reproducibility and transferability in AI benchmarks. Most people will either have to personally evaluate the usefulness of AI tools against the benchmark of their own lived experience or be forced to trust an expert.

To explain this conundrum I propose the Intelligent AI Coin Thought Experiment and discuss four objections: the restriction of agents to low-value decisions, making AI decisionmakers open source, adding a human-in-the-loop and the general limits of trust in human agents.

@histodons @politicalscience

seanfobbe.com · Feb 21[Essay] The Intelligent AI Coin: A Thought Experiment

More from

Seán Fobbe

#AI #ArtificialIntelligence #ThoughtExperiment

**Retro/Alvar** @alvar@mastodont.cat · Feb 5

Feb 5

Retro/Alvar @alvar@mastodont.cat

Avui toca aprendre a fer #benchmarks amb #jmh #java #programming

Resultats del benchmark amb JMH:
ArraySearchBenchmark.testIterativeBinarySearch avgt 25 26,152 ± 0,512 ns/op
ArraySearchBenchmark.testLinearSearch avgt 25 3197994,130 ± 127827,229 ns/op
ArraySearchBenchmark.testRecursiveBinarySearch avgt 25 31,762 ± 2,416 ns/op

**Tim (Wadhwa-)Brown** @timb_machine@infosec.exchange · Dec 15, 2024 *

Dec 15, 2024 *

Tim (Wadhwa-)Brown @timb_machine@infosec.exchange

Can't necessarily share all of the details but if you were looking for a security standard for Linux as an endpoint (i.e. laptop, client etc), who would you choose to chaperone it?

#linux, #hardening, #compliance, #benchmarks

**Tim (Wadhwa-)Brown** @timb_machine@infosec.exchange · Nov 22, 2024

Nov 22, 2024

Tim (Wadhwa-)Brown @timb_machine@infosec.exchange

A good starting point for podman:

https://github.com/containers/podman-security-bench

#linux, #hardening, #benchmarks, #compliance

GitHubGitHub - containers/podman-security-benchContribute to containers/podman-security-bench development by creating an account on GitHub.

Continued thread

**Tim (Wadhwa-)Brown** @timb_machine@infosec.exchange · Sep 2, 2024

Sep 2, 2024

Tim (Wadhwa-)Brown @timb_machine@infosec.exchange

Ansible annoyances when writing compliance-as-code checks with it:

* When a value can be set in multiple places
* When you want to check for the required absence of a service
* Dealing with situations when you want to handle multiple files in a single location
* Where one or more options is acceptable
* Handling read-only file systems
* Error handling on shell module

It's a shame as it's still the best general purpose fit I've found but the fact it always assumes you want to deploy a given state vs simply compare with a set of potentially valid options makes it frustrating to use the native modules.

#compliance, #benchmarks, #hardening

**Tim (Wadhwa-)Brown** @timb_machine@infosec.exchange · Aug 24, 2024

Aug 24, 2024

Tim (Wadhwa-)Brown @timb_machine@infosec.exchange

CIS Benchmarks are awful.

Just a bunch of arbitrary commands (not even consistent across checks of the same type) dumped into randomly structured Markdown. Even if the commands themselves are any good (which they may/may not be), they're only usable by humans because you have to parse the instructions on how to interpret them by eye.

Writing compliance-as-code framework and I might as well rewrite the underlying benchmarks /except/ for the fact they're recognisable so people demand them over something more useful.

#compliance, #benchmarks, #hardening

**Anupam 《ミ》λ≡** @haskman@functional.cafe · Apr 30, 2024

Apr 30, 2024

Anupam 《ミ》λ≡ @haskman@functional.cafe

Since the #Moonbit #JavaScript backend post (https://www.moonbitlang.com/blog/js-support) is trending, I thought I'd compare #PureScript backend optimizer (https://github.com/aristanetworks/purescript-backend-optimizer) output to see how it fares. The results were pretty good!

With basically this PureScript code -
```
run = fromArray
>>> flatMapF (fromArray <<< _.members)
>>> filterF _.gender
>>> mapF (\x -> min 100 (x.score + 5))
>>> mapF grade
>>> filterF (_ == 'A')
>>> foldF (\_ x -> x+1) 0
```

the benchmark results are as follows. PureScript is roughly 6x faster than plain JS, and 6x slower than Moonbit output ( -

```
┌─────────┬──────────────┬─────────────┬────────────────────┬──────────┬─────────┐
│ (index) │ Task Name │ ops/sec │ Average Time (ns) │ Margin │ Samples │
├─────────┼──────────────┼─────────────┼────────────────────┼──────────┼─────────┤
│ 0 │ 'Moonbit' │ '34,67,542' │ 288.38869989829305 │ '±0.06%' │ 1733772 │
│ 1 │ 'Plain Js' │ '74,816' │ 13365.983827421464 │ '±0.54%' │ 37409 │
│ 2 │ 'Kotlin Js' │ '1,90,241' │ 5256.474017304151 │ '±0.38%' │ 95121 │
│ 3 │ 'PureScript' │ '4,99,456' │ 2002.1768597161156 │ '±0.70%' │ 249729 │
└─────────┴──────────────┴─────────────┴────────────────────┴──────────┴─────────┘
```

www.moonbitlang.com · Apr 29, 2024MoonBit adds JS backend, up to 25x faster than native JS | MoonBitMoonBit adds JS backend, up to 25x faster than native JS

#FunctionalProgramming #Frontend #Benchmarks

Continued thread

**Dave Rahardja** @drahardja@sfba.social · Jan 2, 2024

Jan 2, 2024

Dave Rahardja @drahardja@sfba.social

Full blog post is here: “Measuring the performance of mergeable libraries in iOS apps”

#ios #iPhone #Xcode #performance #benchmarks

https://www.humancode.us/2024/01/02/measuring-mergeable-libraries.html

humancode.usMeasuring the performance of mergeable libraries in iOS appsI was curious about what the use of mergeable librarie meant for a typical app that may have a lot of library dependencies, so I devised a test which would measure the performance of merged frameworks versus old-school dynamic frameworks.

**Dave Rahardja** @drahardja@sfba.social · Jan 2, 2024 *

Jan 2, 2024 *

Dave Rahardja @drahardja@sfba.social

I finally have benchmarks for Mergeable Libraries! Here are the results on my iPhone 14 Pro. I took app startup measurements with 0–100 small frameworks in three batches: plain old dynamic frameworks, directly merged dynamic frameworks, and using one intermediate framework to merge the frameworks. See the results for yourself. The second image is a close-up near the origin.

The results are measured from the time the app begins running (the process is created) to just before the UIKit initialization signpost. Process creation time varies wildly, but typically ranges from 100–400 ms.

Blog post here: https://www.humancode.us/2024/01/02/measuring-mergeable-libraries.html

EDIT: Added measurement using static frameworks instead of dynamic.

Graph. X axis shows number of frameworks from 0–100. Y axis shows time to UIKit initialization. First series: “individual frameworks”. The line goes from around 40 ms at zero frameworks, to about 60 ms at one, and rising linearly to just over 600 ms with 100. Second series: “merged frameworks”. The line hovers around 50 ms, and stays flat and rises slightly to just under 100 ms at 100. Third series: “merged frameworks with intermediate non-merged framework”. The line remains about as flat as the second series, but at a slightly higher plateau around 80 ms. The line rises steadily at 70 to reach about 180 ms at 100. Fourth series: static libraries. The line stays flat at around 30–50 ms.

Close-up of the previous graph around the origin.

#ios #iPhone #Xcode

**Blake Patterson** @blakespot@oldbytes.space · Dec 13, 2023

Dec 13, 2023

Blake Patterson @blakespot@oldbytes.space

Wow, this is nuts. Intel must be suffering far more than I thought.

Check out this asinine marketing from Intel calling out AMD and other rivals. The message is crazy. They are literally using the term "snake oil" and showing sleazy used car salesmen, representing AMD. And this is public - from Intel.

https://www.youtube.com/watch?v=xUT4d5IVY0A

YouTubeIntel's Snake Oil & Completely Insane Anti-AMD MarketingBy Gamers Nexus

#Intel #marketing #AMD

**OSNews** @osnews@mstdn.social · Dec 11, 2023

Dec 11, 2023

OSNews @osnews@mstdn.social

Is RISC-V ready for HPC prime-time: evaluating the 64-core Sophon SG2042 RISC-V CPU

The Sophon SG2042 is the world's first commodity 64-core RISC-V CPU for high performance workloads and an important question is whether the SG2042 has the potential to encourage the HPC community to embrace RISC-V.

In this paper we undertaking a performance explorat

https://www.osnews.com/story/138049/is-risc-v-ready-for-hpc-prime-time-evaluating-the-64-core-sophon-sg2042-risc-v-cpu/

www.osnews.comIs RISC-V ready for HPC prime-time: evaluating the 64-core Sophon SG2042 RISC-V CPU – OSnews

#Benchmarks

**Joel Wirāmu, Pauling** @jwp@cloudisland.nz · Nov 2, 2023

Nov 2, 2023

Joel Wirāmu, Pauling @jwp@cloudisland.nz

#framework13 #laptop #benchmarks on #fedora 39 https://openbenchmarking.org/result/2311027-NE-Y9025498961

Especially impressed at how it performed in several of the encoding benchmarks and reached near server CPU levels of performance on several of the test results including #kernel #compilation as well as beating out it's reference chip. Likely due to the #amd pstate epp guided improvements in the 6.5 kernel in fc39

Timed Linux Kernel Compilation 6.1:
pts/build-linux-kernel-1.15.0 [Build: allmodconfig]
Test 21 of 34
Estimated Trial Run Count: 3
Estimated Test Run-Time: 7 Minutes
Estimated Time To Completion: 3 Hours, 13 Minutes [13:42 UTC]
Running Pre-Test Script @ 10:29:09
Started Run 1 @ 10:29:22
Running Interim Test Script @ 10:57:41
Started Run 2 @ 10:57:52
Running Interim Test Script @ 11:26:07
Started Run 3 @ 11:26:18
Running Post-Test Script @ 11:54:32

Build: allmodconfig:
1697.182
1693.06
1692.525

Average: 1694.256 Seconds
Deviation: 0.15%

Comparison of 945 OpenBenchmarking.org samples since 12 December 2022; median result: 686 Seconds. Box plot of samples:
[ * |-----------------------------------------*------*----*--*########!##--*]
This Result (18th Percentile): 1694 ^
^ Intel Core i5-6260U: 9907 2 x Intel Xeon Platinum 8468: 181 ^
AMD Ryzen 7 PRO 7840U: 2078 ^
AMD Ryzen 7 4700U: 2607 ^

**Linux** @Linux@linuxrocks.online · Oct 28, 2023

Oct 28, 2023

Linux @Linux@linuxrocks.online

::: Linux vs. Windows in 10 games - Linux 17% faster on average

Times have changed have they not

With macOS seemingly dropping out of the gaming field altogether & Linux only rising - where might Linux be in couple of years?

=> https://video.hardlimit.com/w/uZGK12oU5FeSsy8CDLP4hD

#Linux #vs #Windows #benchmarks #Peertube #performance #gaming @cosmic_happiness

PeerTubeLinux vs Windows tested in 10 games - Linux 17% faster on AverageBy Linux (vs Windows) Benchmarks

**OSNews** @osnews@mstdn.social · Oct 18, 2023

Oct 18, 2023

OSNews @osnews@mstdn.social

Windows 11 vs. Ubuntu 23.10 performance on the Lenovo ThinkPad P14s Gen 4

Out of 72 benchmarks ran in total on both operating systems with the Lenovo ThinkPad P14s Gen 4, Ubuntu 23.10 was the fastest about 64% of the time.

If taking the geometric mean of all the benchmark results, Ubuntu 23.10 comes out to being 10% faster than the stock Windows 11 Pro insta

https://www.osnews.com/story/137528/windows-11-vs-ubuntu-23-10-performance-on-the-lenovo-thinkpad-p14s-gen-4/

www.osnews.comWindows 11 vs. Ubuntu 23.10 performance on the Lenovo ThinkPad P14s Gen 4 – OSnews

#Benchmarks

**Paul Richards** @pauldoo@mastodon.scot · Sep 3, 2023

Sep 3, 2023

Paul Richards @pauldoo@mastodon.scot

Looking for a more powerful version of the ‘time’ tool on #Linux. Any suggestions?
I’m doing some simple #Benchmarks that I’m driving from a shell script. The ‘time’ command is a cheap and easy starting point but I’d love to be able to measure in more detail including total IO (in bytes, not IO count like ‘time’ does) and peak filesystem space usage. I suspect some of this could be figured out using ‘strace’ logs, but that would need a bit more work.

Recent searches

Search options

Administered by:

Server stats:

#benchmarks