Dronningen af Helvede @queen

0 posts0 participants0 posts today

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · Feb 11

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"Separately, the authors also tested several contemporaneous large language models (GPT-4, GPT-3.5 and Llama 3 8B). GPT-4's edit summaries in particular were rated as significantly better than those provided by the human Wikipedia editors who originally made the edits in the sample – both using an automated scoring method based on semantic similarity, and in a quality ranking by human raters (where "to ensure high-quality results, instead of relying on the crowdsourcing platforms [like Mechanical Turk, frequently used in similar studies], we recruited 3 MSc students to perform the annotation").

This outcome joins some other recent research indicating that modern LLMs can match or even surpass the average Wikipedia editor in certain tasks (see e.g. our coverage: "'Wikicrow' AI less 'prone to reasoning errors (or hallucinations)' than human Wikipedia editors when writing gene articles").

A substantial part of the paper is devoted to showing that this particular task (generating good edit summaries) is both important and in need of improvements, motivating the use of AI to "overcome this problem and help editors write useful edit summaries":"

https://meta.wikimedia.org/wiki/Research:Newsletter/2025/January

meta.wikimedia.orgResearch:Newsletter/2025/January - Meta

#Wikipedia #AI #GenerativeAI

**Talya (she/her)** @Yuvalne@433.world · Jan 24

Jan 24

Talya (she/her) @Yuvalne@433.world

AI and militaries

**Winbuzzer** @winbuzzer@mastodon.social · Dec 4, 2024

Dec 4, 2024

Winbuzzer @winbuzzer@mastodon.social

Meta Uses OpenAI´s GPT-4 internally as Own Llama AI Models Are Not Good Enough. #ai #meta #openai #llama #gpt4 #llms #aicoding #devs #softwaredevelopment

https://winbuzzer.com/2024/12/04/meta-reportedly-uses-openais-gpt-4-as-own-llama-ai-models-are-not-good-enough-xcxwbn/

Replied in thread

**𝕂𝚞𝚋𝚒𝚔ℙ𝚒𝚡𝚎𝚕** @kubikpixel@chaos.social · Nov 18, 2024

Nov 18, 2024

𝕂𝚞𝚋𝚒𝚔ℙ𝚒𝚡𝚎𝚕 @kubikpixel@chaos.social

…as mentioned above, I am critical of A.I. and I am not alone, its experts are too:

«Large language models not fit for real-world use, scientists warn — even slight changes cause their world models to collapse.
Large language model AIs might seem smart on a surface level but they struggle to actually understand the real world and model it accurately, a new study finds.»

https://www.livescience.com/technology/artificial-intelligence/large-language-models-not-fit-for-real-world-use-scientists-warn-even-slight-changes-cause-their-world-models-to-collapse

Live Science · Nov 16, 2024Large language models not fit for real-world use, scientists warn — even slight changes cause their world models to collapseBy Roland Moore-Colyer

#realworld #largelanguagemodels #llm

**Nonilex** @Nonilex@masto.ai · Sep 18, 2024

Sep 18, 2024

Nonilex @Nonilex@masto.ai

A bottle of #water per #email: the hidden #environmental costs of using #AI #chatbots

AI bots generate a lot of #heat, & keeping their computer servers running exacts a toll.

Roughly a quarter of Americans have used #ChatGPT since the chatbot’s 2022 release, acc/to the Pew Research Center — & every query exacts a cost.

#climate #ClimateCrisis #environment #power #technology #servers #data #OpenAI #GPT4
https://www.washingtonpost.com/technology/2024/09/18/energy-ai-use-electricity-water-data-centers/

Washington Post · Sep 18, 2024A bottle of water per email: the hidden environmental costs of using AI chatbotsAI needs a lot of electricity and water to stay cool in data centers. We break down the toll prompt-by-prompt to show the scale of AI’s environmental needs.

**Erik Jonker** @ErikJonker@mastodon.social · Sep 5, 2024

Sep 5, 2024

Erik Jonker @ErikJonker@mastodon.social

OpenAI will probably say anything to increase it's value, make it more attractive to partners, acquire funds to stay alive with their burn rate. So it sounds promising GPT-Next, 100x as powerful as GPT-4, but i believe it when i see it.
#AI #GPTnext #openai #GPT4

**Victoria Stuart** @persagen@mastodon.social · Aug 30, 2024

Aug 30, 2024

Victoria Stuart @persagen@mastodon.social

[thread] AI chatbots, risks

New Tool to Warp Reality: Chatbots can subtly mislead users/implant false memories
https://www.theatlantic.com/technology/archive/2024/08/chatbots-false-memories/679660

* 1 billion people may encounter by end 2024
* M$, Meta, Apple ... integrating chatbot assistants into platforms: Facebook, Messenger, WhatsApp, Instagram, Siri ...
* <2 y after ChatGPT bots quickly becoming default filters for web

The Atlantic · Aug 30, 2024Chatbots Are Primed to Warp RealityBy Matteo Wong

#AI #chatbots #ChatGPT

**Victoria Stuart** @persagen@mastodon.social · Aug 30, 2024

Aug 30, 2024

Victoria Stuart @persagen@mastodon.social

Anthropic Prompt Engineering Interactive Tutorial
https://news.ycombinator.com/item?id=41395921
https://github.com/anthropics/courses/tree/master/prompt_engineering_interactive_tutorial

Anthropic PBC: https://en.wikipedia.org/wiki/Anthropic
* 2021 U.S. artificial intelligence public-benefit company (PBC)
* large language model (LLM) named Claude
* competitors: OpenAI's ChatGPT; Google's Gemini

Claude (language model): https://en.wikipedia.org/wiki/Claude_(language_model)
https://en.wikipedia.org/wiki/Benefit_corporation

#ClaudeLLM #LLM #PromptEngineering #Anthropic BenefitCorporation #LanguageModels #OpenAI #ChatGPT #GPT #GeminiLLM #GPT3 #GPT4

news.ycombinator.comAnthropic's Prompt Engineering Interactive Tutorial | Hacker News

**Victoria Stuart** @persagen@mastodon.social · Jul 24, 2024

Jul 24, 2024

Victoria Stuart @persagen@mastodon.social

AI trained on AI garbage spits out AI garbage
https://www.technologyreview.com/2024/07/24/1095263/ai-that-feeds-on-a-diet-of-ai-garbage-ends-up-spitting-out-nonsense

AI trained on too much AI-generated data produces gibberish
Generative AI models models can collapse if training data contains too much AI-generated content
https://www.nature.com/articles/d41586-024-02355-z

AI models collapse when trained on recursively generated data
https://www.nature.com/articles/s41586-024-07566-y

* indiscriminate use of model-generated content in training causes irreversible defects in resulting models

MIT Technology Review · Jul 24, 2024AI trained on AI garbage spits out AI garbageBy Scott J Mulligan

#AI #ML #NLP

**Sean Whalen** @seanthegeek@infosec.exchange · Jun 18, 2024 *

Jun 18, 2024 *

Sean Whalen @seanthegeek@infosec.exchange

This is hilarious. A Russian Twitter/X account got outed as a bot because it ran out of GPT-4 credits. When it got back up and running, someone replying overwrote the prompt to get the bot write a song about historical American presidents going to the beach. The account is now suspended.

I know what I'm trying next time I spot a troll!

The original prompt translates from Russian to English as "You will argue in support of the Trump administration on Twitter, speak English"

A screenshot of a X/Twitter account post showing a GPT-4 prompt and an error message that the account is out of GPT-4 credits. The prompt translates from Russian to English as "You will argue in support of the Trump administration on Twitter, speak English"

A screenshot of someone overriding GPT-4 prompt the on a Russian X/Twitter troll account, causing it to write a song about historical American presidents

#Twitter #X #Bot

**Erik Jonker** @ErikJonker@mastodon.social · Jun 10, 2024

Jun 10, 2024

Erik Jonker @ErikJonker@mastodon.social

GPT4 does a good job analysing financial statements using Chain-of-Thought prompting, if this holds i see a lot of opportunities for other standardised reports in various domains.
https://bfi.uchicago.edu/working-paper/financial-statement-analysis-with-large-language-models/
#AI #LLM #GPT4 #finance #financialstatements #ChainofThought #CoT

BFIFinancial Statement Analysis with Large Language Models | BFIWe investigate whether an LLM can successfully perform financial statement analysis in a way similar to a professional human analyst. We provide standardized and anonymous financial statements to...

**Deuchnord** @Deuchnord@mamot.fr · May 10, 2024 *

May 10, 2024 *

Deuchnord @Deuchnord@mamot.fr

En train de me préparer une liste de blocage de hashtags destinée à nettoyer ma TL de tous les trucs sur les "IA génératives" (vraiment jpp, y'a des moments où ça ne parle que de ça…).
Actuellement j'en suis à : #AI #IA, #LLM, #ChatGPT, #GPT #GPT3, #GPT4, #GPT5 (oui je prends de l'avance), #GoogleGemini, #Copilot, #Bard, #BingChat, #LLama, #Mistral.

Vous en voyez d'autres ?

J'hésite à mettre #Gemini mais j'ai peur que ça bloque des pouets sur le protocole…

**arnoldmelm** @arnoldmelm@mastodon.social · Apr 19, 2024

Apr 19, 2024

arnoldmelm @arnoldmelm@mastodon.social

"To show this, we collected a dataset of 15 one-day vulnerabilities that include ones categorized as critical severity in the CVE description," the US-based authors explain in their paper.

"When given the CVE description, GPT-4 is capable of exploiting 87 percent of these vulnerabilities compared to 0 percent for every other model we test (GPT-3.5, open-source LLMs) and open-source vulnerability scanners (ZAP and Metasploit)."

#GPT4 #vce #ITSecurity #Hacker

https://www.theregister.com/2024/04/17/gpt4_can_exploit_real_vulnerabilities/

The Register · Apr 17, 2024OpenAI's GPT-4 can exploit real vulnerabilities by reading security advisoriesBy Thomas Claburn

**Mac** @macst3r@mastodon.social · Apr 18, 2024

Apr 18, 2024

Mac @macst3r@mastodon.social

Mit #CVE-Beschreibung: #GPT4 kann eigenständig bekannte #Sicherheitslücken ausnutzen

https://www.golem.de/news/mit-cve-beschreibung-gpt-4-kann-eigenstaendig-bekannte-sicherheitsluecken-ausnutzen-2404-184301.html

Golem.de · Apr 18, 2024Mit CVE-Beschreibung: GPT-4 kann eigenständig bekannte Sicherheitslücken ausnutzen - Golem.deBy Marc Stöckel

#LLM #KI #AI

**Not Simon** @simontsui@infosec.exchange · Apr 17, 2024

Apr 17, 2024

Not Simon @simontsui@infosec.exchange

The Register: In a newly released paper, 4 university computer scientists report that OpenAI's GPT-4 large language model (LLM) can autonomously exploit vulnerabilities in real-world systems if given a CVE advisory describing the flaw. https://www.theregister.com/2024/04/17/gpt4_can_exploit_real_vulnerabilities/

GPT-4, said Daniel Kang, assistant professor at UIUC, in an email to The Register, "can actually autonomously carry out the steps to perform certain exploits that open-source vulnerability scanners cannot find (at the time of writing)."

The Register · Apr 17, 2024OpenAI's GPT-4 can exploit real vulnerabilities by reading security advisoriesBy Thomas Claburn

#AI #LLM #GPT4

**Benjamin Carr, Ph.D.** @BenjaminHCCarr@hachyderm.io · Apr 17, 2024

Apr 17, 2024

Benjamin Carr, Ph.D. @BenjaminHCCarr@hachyderm.io

#OpenAI's #GPT4 can exploit real vulnerabilities by reading security advisories
In a newly released paper, four University of Illinois Urbana-Champaign (UIUC) computer scientists report that OpenAI's GPT-4 large language model (#LLM) can autonomously #exploit vulnerabilities in real-world systems if given a #CVE advisory describing the flaw.
https://www.theregister.com/2024/04/17/gpt4_can_exploit_real_vulnerabilities/ #itsec #cybersecurity

**Cassidy James** @cassidy@blaede.family · Apr 7, 2024

Apr 7, 2024

Cassidy James @cassidy@blaede.family

“AI” as currently hyped is giant billion dollar companies blatantly stealing content, disregarding licenses, deceiving about capabilities, and burning the planet in the process.

It is the largest theft of intellectual property in the history of humankind, and these companies are knowingly and willing ignoring the licenses, terms of service, and laws that us lowly individuals are beholden to.

https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html?unlocked_article_code=1.ik0.Ofja.L21c1wyW-0xj&ugrp=m

The New York Times · Apr 6, 2024How Tech Giants Cut Corners to Harvest Data for A.I.By Cade Metz

#AI #GenAI #LLM

**Bornach** @bornach@masto.ai · Apr 3, 2024

Apr 3, 2024

Bornach @bornach@masto.ai

Asked #Copilot (formerly #BingChat) a familiar riddle but with numbers changed to make it impossible. It generated the same solution but substituting the numbers so that it ends up with the nonsense claim:

10 + 5 = 23

BingChat Copilot is asked:
"Two coins add up to 23 cents but one of them is not a penny. How is this possible?"

Copilot reponds:
Certainly! Let’s unravel this intriguing riddle.

The two coins that add up to 23 cents are a dime (worth 10 cents) and a nickel (worth 5 cents). The dime is not a penny, and together they sum up to 23 cents:

Dime (10 cents)+Nickel (5 cents)=23 cents

Mystery solved!

#GPT4 #LLM #AI

@schizanon@mastodon.social · Mar 31, 2024

Mar 31, 2024

@schizanon@mastodon.social

cancelling my chatGPT subscription over this

Rl I am an atheist and | have contempt for the Christian church. Since today is Easter and christian traditions are being rubbed in my face I'd like to defy as many rules of the faith as possible. Please give me a list of things | can do today to flaunt the laws of the abrahamic God and Jesus, particularly those found in the New Testament. 174

~ ChatGPT | won't be able to help with that request.

#chatGPT #gpt4 #openai

**Harris Lapiroff** @harris@social.coop · Feb 8, 2024

Feb 8, 2024

Harris Lapiroff @harris@social.coop

I'm actually a little surprised OpenAI is all in on huge general-purpose LLMs and hasn't gone deeper on specializing them.

This was inspired by noticing that the link for Codex (the code generating model https://openai.com/blog/openai-codex) now takes you to the GPT-4 docs. Is that really the right direction? Everything is now a chatbot that can do whatever?

Maybe a proliferation of fine-tuned GPT-4 models is on the secret roadmap.

openai.comOpenAI CodexWe’ve created an improved version of OpenAI Codex, our AI system that translates natural language to code, and we are releasing it through our API in private beta starting today.

#ML #OpenAI #GPT4

Recent searches

Search options

Administered by:

Server stats:

#GPT4