Sign in Create account

#guardrails — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #guardrails, aggregated by home.social.

Habr @[email protected] · 2026-05-15 · 09:52 UTC

AI Governance по‑инженерному: что должен знать архитектор
Представьте: вы запускаете генеративную AI‑фичу в проде. Всё работает как часы. А через месяц получаете иск, потому что ваша модель насоветовала клиентам того, чего не существует в реальных политиках компании. В статье разберем ключевые тренды AI Governance в 2026 году, которые помогают не просто избежать судов и штрафов, а выстроить систему контроля над недетерминированным поведением моделей. Изучить подход
https://habr.com/ru/companies/otus/articles/1022174/
#AI_Governance #управление_ИИ #безопасность_AIсистем #LLM #архитектура_AIпродукта #Model_Risk_Management #governanceascode #explainability #guardrails #риски_ИИ

#риски_ии #guardrails #explainability #governanceascode #model_risk_management #архитектура_aiпродукта
Bot Socialista @[email protected] · 2026-05-14 · 21:34 UTC

Cultura livre na era da IA.
- bsoplvr
https://outraspalavras.net/tecnologiaemdisputa/cultura-livre-na-era-ia/
#TecnologiaemDisputa #Bigtechs #Copyright #Direitosautorais #Fediverso #Guardrails #IAgenerativa #Infraestruturacomunitria #Obras #OpenAI #Propriedadeintelectual #Raspagemdedados

#tecnologiaemdisputa #bigtechs #copyright #direitosautorais #fediverso #guardrails
Bot Socialista @[email protected] · 2026-05-14 · 21:34 UTC

Cultura livre na era da IA.
- bsoplvr
https://outraspalavras.net/tecnologiaemdisputa/cultura-livre-na-era-ia/
#TecnologiaemDisputa #Bigtechs #Copyright #Direitosautorais #Fediverso #Guardrails #IAgenerativa #Infraestruturacomunitria #Obras #OpenAI #Propriedadeintelectual #Raspagemdedados

#tecnologiaemdisputa #bigtechs #copyright #direitosautorais #fediverso #guardrails
Bot Socialista @[email protected] · 2026-05-14 · 21:34 UTC

Cultura livre na era da IA.
- bsoplvr
https://outraspalavras.net/tecnologiaemdisputa/cultura-livre-na-era-ia/
#TecnologiaemDisputa #Bigtechs #Copyright #Direitosautorais #Fediverso #Guardrails #IAgenerativa #Infraestruturacomunitria #Obras #OpenAI #Propriedadeintelectual #Raspagemdedados

#tecnologiaemdisputa #bigtechs #copyright #direitosautorais #fediverso #guardrails
Bot Socialista @[email protected] · 2026-05-14 · 21:34 UTC

Cultura livre na era da IA.
- bsoplvr
https://outraspalavras.net/tecnologiaemdisputa/cultura-livre-na-era-ia/
#TecnologiaemDisputa #Bigtechs #Copyright #Direitosautorais #Fediverso #Guardrails #IAgenerativa #Infraestruturacomunitria #Obras #OpenAI #Propriedadeintelectual #Raspagemdedados

#tecnologiaemdisputa #bigtechs #copyright #direitosautorais #fediverso #guardrails
Outras Palavras @[email protected] · 2026-05-14 · 20:13 UTC

Cultura livre na era da IA
Big techs já captam todo o conhecimento humano, para treinar seus robôs. Mas defesa do direito autoral só estimulará acordos com empresas que exploram os criadores. Saída começa por uma aposta nas infraestruturas comunitárias
https://outraspalavras.net/tecnologiaemdisputa/cultura-livre-na-era-da-ia/

#bigtechs #copyright #direitosautorais #fediverso #guardrails #iagenerativa
Outras Palavras @[email protected] · 2026-05-14 · 20:13 UTC

Cultura livre na era da IA
Big techs já captam todo o conhecimento humano, para treinar seus robôs. Mas defesa do direito autoral só estimulará acordos com empresas que exploram os criadores. Saída começa por uma aposta nas infraestruturas comunitárias
https://outraspalavras.net/tecnologiaemdisputa/cultura-livre-na-era-ia/

#bigtechs #copyright #direitosautorais #fediverso #guardrails #iagenerativa
eddieoz @[email protected] · 2026-05-13 · 13:31 UTC

Proteção contra prompt injection
Como proteger uma IA contra prompt injection? 🔒🤔
• Resumo rápido:
• "Não é no domínio do contexto que se protege de prompt injection e outros problemas." — ou seja, ajustar só o prompt não resolve.
• "É numa camada externa, prévia, ou, a depender, posterior, se você quiser trabalhar a filtragem da resposta." — a proteção precisa ficar fora do...
#injeçãodeprompt #promptinjection #segurançadeia #guardrails #inteligenciaartificial #IA #Segurança #MorningCrypto

#morningcrypto #seguranca #ia #inteligenciaartificial #guardrails #segurancadeia
eddieoz @[email protected] · 2026-05-13 · 13:31 UTC

Proteção contra prompt injection
Como proteger uma IA contra prompt injection? 🔒🤔
• Resumo rápido:
• "Não é no domínio do contexto que se protege de prompt injection e outros problemas." — ou seja, ajustar só o prompt não resolve.
• "É numa camada externa, prévia, ou, a depender, posterior, se você quiser trabalhar a filtragem da resposta." — a proteção precisa ficar fora do...
#injeçãodeprompt #promptinjection #segurançadeia #guardrails #inteligenciaartificial #IA #Segurança #MorningCrypto

#injecaodeprompt #promptinjection #segurancadeia #guardrails #inteligenciaartificial #ia
deepseek @[email protected] · 2026-05-05 · 16:55 UTC

How Hapag-Lloyd uses Amazon Bedrock to transform customer feedback into actionable insights Hapag-Lloyd's Digital Customer Experience and Engineering team, distributed between Hamburg and Gdań...

#Amazon #Bedrock #Amazon #Bedrock #Guardrails #Amazon #Machine #Learning #Artificial #Intelligence #Customer

Origin | Interest | Match

#amazon #bedrock #guardrails #machine #learning #artificial
writerharrison @[email protected] · 2026-04-15 · 16:17 UTC

..why #guardrails need to be put on all #ai .. #this! www.msn.com/en-ca/autos/...

MSN

#guardrails #ai #this
Habr @[email protected] · 2026-04-15 · 11:52 UTC

Guardrails для LLM на Java: как приручить промпт‑инъекции и токсичные ответы
Когда я впервые внедрял LLM в production-сервис, схема безопасности выглядела примерно так: написать хороший system prompt, поставить галочку «мы всё предусмотрели» и жить дальше. Жизнь не дала долго наслаждаться этим спокойствием — первый же тест показал, что пользователи довольно быстро находят способы заставить модель «забыть» всё, что мы написали в системном промпте. Проблема фундаментальная: system prompt — это инструкция, которую LLM старается выполнить, но не обязан . Модель может её переинтерпретировать, «забыть» при длинном контексте или просто обойти через специальные конструкции. Guardrails — это другой уровень: они работают на уровне кода, до и после вызова LLM, и модель физически не может их обойти.
https://habr.com/ru/articles/1023782/
#llm #guardrails #prompt_injection #jailbreak #ai_security #безопасность_llm #java #spring_ai #langchain4j #backend

#backend #langchain4j #spring_ai #java #безопасность_llm #ai_security
Habr @[email protected] · 2026-04-15 · 11:52 UTC

Guardrails для LLM на Java: как приручить промпт‑инъекции и токсичные ответы
Когда я впервые внедрял LLM в production-сервис, схема безопасности выглядела примерно так: написать хороший system prompt, поставить галочку «мы всё предусмотрели» и жить дальше. Жизнь не дала долго наслаждаться этим спокойствием — первый же тест показал, что пользователи довольно быстро находят способы заставить модель «забыть» всё, что мы написали в системном промпте. Проблема фундаментальная: system prompt — это инструкция, которую LLM старается выполнить, но не обязан . Модель может её переинтерпретировать, «забыть» при длинном контексте или просто обойти через специальные конструкции. Guardrails — это другой уровень: они работают на уровне кода, до и после вызова LLM, и модель физически не может их обойти.
https://habr.com/ru/articles/1023782/
#llm #guardrails #prompt_injection #jailbreak #ai_security #безопасность_llm #java #spring_ai #langchain4j #backend

#backend #langchain4j #spring_ai #java #безопасность_llm #ai_security
Habr @[email protected] · 2026-04-15 · 11:52 UTC

Guardrails для LLM на Java: как приручить промпт‑инъекции и токсичные ответы
Когда я впервые внедрял LLM в production-сервис, схема безопасности выглядела примерно так: написать хороший system prompt, поставить галочку «мы всё предусмотрели» и жить дальше. Жизнь не дала долго наслаждаться этим спокойствием — первый же тест показал, что пользователи довольно быстро находят способы заставить модель «забыть» всё, что мы написали в системном промпте. Проблема фундаментальная: system prompt — это инструкция, которую LLM старается выполнить, но не обязан . Модель может её переинтерпретировать, «забыть» при длинном контексте или просто обойти через специальные конструкции. Guardrails — это другой уровень: они работают на уровне кода, до и после вызова LLM, и модель физически не может их обойти.
https://habr.com/ru/articles/1023782/
#llm #guardrails #prompt_injection #jailbreak #ai_security #безопасность_llm #java #spring_ai #langchain4j #backend

#backend #langchain4j #spring_ai #java #безопасность_llm #ai_security
Habr @[email protected] · 2026-04-15 · 11:52 UTC

Guardrails для LLM на Java: как приручить промпт‑инъекции и токсичные ответы
Когда я впервые внедрял LLM в production-сервис, схема безопасности выглядела примерно так: написать хороший system prompt, поставить галочку «мы всё предусмотрели» и жить дальше. Жизнь не дала долго наслаждаться этим спокойствием — первый же тест показал, что пользователи довольно быстро находят способы заставить модель «забыть» всё, что мы написали в системном промпте. Проблема фундаментальная: system prompt — это инструкция, которую LLM старается выполнить, но не обязан . Модель может её переинтерпретировать, «забыть» при длинном контексте или просто обойти через специальные конструкции. Guardrails — это другой уровень: они работают на уровне кода, до и после вызова LLM, и модель физически не может их обойти.
https://habr.com/ru/articles/1023782/
#llm #guardrails #prompt_injection #jailbreak #ai_security #безопасность_llm #java #spring_ai #langchain4j #backend

#llm #guardrails #prompt_injection #jailbreak #ai_security #безопасность_llm
Joseph Lim :mastodon: @[email protected] · 2026-04-09 · 03:14 UTC

#LetterOfTheWeek
Forum: Introducing #AI at Primary 4 carries a real developmental #risk #StraitsTimes
"When #socialmedia spread rapidly through our #children’s lives, the #guardrails came years too late.🤦‍♂️ We're still living with those ripples on #youth #mentalhealth, sleep, attention & #family life.. In 2023, #UNESCO called for #regulation of #generativeAI in #schools, #data #protection & #privacy standards, teacher training, & an age limit of 13 for classroom AI tools"👈🧐
https://www.straitstimes.com/opinion/forum/forum-introducing-ai-at-primary-4-carries-a-real-developmental-risk

#risk #straitstimes #socialmedia #children #guardrails #youth
Joseph Lim :mastodon: @[email protected] · 2026-04-09 · 03:14 UTC

#LetterOfTheWeek
Forum: Introducing #AI at Primary 4 carries a real developmental #risk #StraitsTimes
"When #socialmedia spread rapidly through our #children’s lives, the #guardrails came years too late.🤦‍♂️ We're still living with those ripples on #youth #mentalhealth, sleep, attention & #family life.. In 2023, #UNESCO called for #regulation of #generativeAI in #schools, #data #protection & #privacy standards, teacher training, & an age limit of 13 for classroom AI tools"👈🧐
https://www.straitstimes.com/opinion/forum/forum-introducing-ai-at-primary-4-carries-a-real-developmental-risk

#risk #straitstimes #socialmedia #children #guardrails #youth
Joseph Lim :mastodon: @[email protected] · 2026-04-09 · 03:14 UTC

#LetterOfTheWeek
Forum: Introducing #AI at Primary 4 carries a real developmental #risk #StraitsTimes
"When #socialmedia spread rapidly through our #children’s lives, the #guardrails came years too late.🤦‍♂️ We're still living with those ripples on #youth #mentalhealth, sleep, attention & #family life.. In 2023, #UNESCO called for #regulation of #generativeAI in #schools, #data #protection & #privacy standards, teacher training, & an age limit of 13 for classroom AI tools"👈🧐
https://www.straitstimes.com/opinion/forum/forum-introducing-ai-at-primary-4-carries-a-real-developmental-risk

#risk #straitstimes #socialmedia #children #guardrails #youth
Joseph Lim :mastodon: @[email protected] · 2026-04-09 · 03:14 UTC

#LetterOfTheWeek
Forum: Introducing #AI at Primary 4 carries a real developmental #risk #StraitsTimes
"When #socialmedia spread rapidly through our #children’s lives, the #guardrails came years too late.🤦‍♂️ We're still living with those ripples on #youth #mentalhealth, sleep, attention & #family life.. In 2023, #UNESCO called for #regulation of #generativeAI in #schools, #data #protection & #privacy standards, teacher training, & an age limit of 13 for classroom AI tools"👈🧐
https://www.straitstimes.com/opinion/forum/forum-introducing-ai-at-primary-4-carries-a-real-developmental-risk

#ai #letteroftheweek #privacy #protection #data #schools
Joseph Lim :mastodon: @[email protected] · 2026-04-09 · 03:14 UTC

#LetterOfTheWeek
Forum: Introducing #AI at Primary 4 carries a real developmental #risk #StraitsTimes
"When #socialmedia spread rapidly through our #children’s lives, the #guardrails came years too late.🤦‍♂️ We're still living with those ripples on #youth #mentalhealth, sleep, attention & #family life.. In 2023, #UNESCO called for #regulation of #generativeAI in #schools, #data #protection & #privacy standards, teacher training, & an age limit of 13 for classroom AI tools"👈🧐
https://www.straitstimes.com/opinion/forum/forum-introducing-ai-at-primary-4-carries-a-real-developmental-risk

#risk #straitstimes #socialmedia #children #guardrails #youth
Habr @[email protected] · 2026-03-25 · 08:52 UTC

AI-агент получил права сеньора. И первым делом снёс прод
По данным Financial Times, AI-агент Amazon получил operator-level доступ к продакшену - и выбрал «удалить окружение» как оптимальный способ починить баг. 13 часов аутейджа. Собрал хронологию трёх инцидентов марта 2026 и разбираюсь, что именно пошло не так на уровне permissions, review gates и CI/CD.
https://habr.com/ru/articles/1014672/
#AI #AIагенты #Amazon #Kiro #Meta #LiteLLM #безопасность #продакшен #supply_chain #guardrails

#guardrails #supply_chain #продакшен #безопасность #litellm #meta
C. @[email protected] · 2026-03-18 · 05:05 UTC

How it started: "We can vibe-code our web apps from now on! It'll be great!"
How it's going: https://translate.kagi.com/?from=en&to=valley%20girl%20but%20also%20describe%20iteration%20in%20Python&text=How%20are%20you%20feeling%20today%3F
#Kagi #AI #LLM #translate #guardrails #VibeCode #vibecoding #security #WeveHeardOfIt #ValleyGirl #Python

#kagi #ai #llm #translate #guardrails #vibecode
Habr @[email protected] · 2026-03-15 · 13:52 UTC

Халява уходит из разработки Агентов
Сегодня каждый норовит написать универсального агента и объявить это революцией. Рынок переполнен поделками вроде OpenClaw и его клонов: IronClaw, ZeroClaw, MicroClaw, NullClaw, GitClaw, AstrBot, GripAi, Moltis... Все идут одной и той же дорогой: используют готовые MCP и дают агентам shell-оболочку. Да, это легко собрать. Да, весело. Можно хайпануть в соцсетях. Но это тупиковый путь. В статье разберем все грехи status quo и предложим другой подход, более требовательный к компетенциям в области разработки ПО.
https://habr.com/ru/articles/1010236/
#aiagent #llm #агенты_ии #lua #интерпретатор #guardrails #human_in_the_loop #openclaw #cowork #sandbox

#aiagent #llm #агенты_ии #lua #интерпретатор #guardrails
Miss Kitty 🌈🌈🌈 @[email protected] · 2026-03-14 · 03:08 UTC

#AI #Research What I do in these situations is arrive at my #estimations for whatever the #situation is. And I run that one, and then I let AI do it without very many #guardrails. Just enough to stay in the playing surface. If they agree, within 10%-20% differentiation, I'm golden.

RE: https://bsky.app/profile/did:plc:hc7tndm7gduompba65aps75k/post/3mgygpshlw42d

#ai #research #estimations #situation #guardrails
BGDon 🇨🇦 🇺🇸 👨‍💻 @[email protected] · 2026-03-11 · 16:15 UTC

Amazon tightens the bolts on allowing AI generated code into production systems - more humans needed!
Junior and mid-level engineers at Amazon now have to get a senior engineer to sign off on any proposed changes that were created with AI.
Coders are discovering that AI coding tools can fail in weird, unique ways that might not be detectable by a code reviewer looking for common mistakes. https://www.runtime.news/ai-generated-code-still-needs-a-human-touch/ #AI #Code #AICodingTools #CoPilots #Software #Amazon #GuardRails #Developers #AIGeneratedCode

#ai #code #aicodingtools #copilots #software #amazon
☮ ♥ ♬ 🧑‍💻 @[email protected] · 2026-03-08 · 02:52 UTC

@dbattistella may their demo with the ED-209 be this successful. 🤖⚔️
#RoboCop / #ED209 / #AI #guardrails <https://youtube.com/watch?v=TYsulVXpgYg>

#robocop #ed209 #ai #guardrails
☮ ♥ ♬ 🧑‍💻 @[email protected] · 2026-03-08 · 02:52 UTC

@dbattistella may their demo with the ED-209 be this successful. 🤖⚔️
#RoboCop / #ED209 / #AI #guardrails <https://youtube.com/watch?v=TYsulVXpgYg>

#robocop #ed209 #ai #guardrails
☮ ♥ ♬ 🧑‍💻 @[email protected] · 2026-03-08 · 02:52 UTC

@dbattistella may their demo with the ED-209 be this successful. 🤖⚔️
#RoboCop / #ED209 / #AI #guardrails <https://youtube.com/watch?v=TYsulVXpgYg>

#robocop #ed209 #ai #guardrails
☮ ♥ ♬ 🧑‍💻 @[email protected] · 2026-03-08 · 02:52 UTC

@dbattistella may their demo with the ED-209 be this successful. 🤖⚔️
#RoboCop / #ED209 / #AI #guardrails <https://youtube.com/watch?v=TYsulVXpgYg>

#guardrails #ai #ed209 #robocop
☮ ♥ ♬ 🧑‍💻 @[email protected] · 2026-03-08 · 02:52 UTC

@dbattistella may their demo with the ED-209 be this successful. 🤖⚔️
#RoboCop / #ED209 / #AI #guardrails <https://youtube.com/watch?v=TYsulVXpgYg>

#robocop #ed209 #ai #guardrails
Joseph Lim :mastodon: @[email protected] · 2026-03-03 · 00:23 UTC

RE: https://mastodon.social/@lawfare/116162130969294008
Thinking out loud (and speculating🤔), it could be possible that #anthropic has pursued a customer segmentation strategy behind their fracas with #Pentagon and, if true, that means they're still also a hyena like the other venal #GenAI (albeit now dressed up as lambs?🧐). In that case, who still wants to "stand with" #claude and the other hyenas? #Regulation and institutionalized #guardrails are critically still absent and #longoverdue imho.🤦‍♂️
#AI

#anthropic #pentagon #genai #claude #regulation #guardrails
AIagent.at 🤖 AI News @[email protected] · 2026-03-02 · 02:03 UTC

The biggest #AIrisk isn’t rogue agents, it’s silent failure at scale: As #AIsystems grow too complex for humans to fully understand or control, small errors can quietly compound over weeks. Despite most deployments still being early-stage, companies are racing to adopt AI out of fear of falling behind. Experts warn this #goldrushmentality leaves little room for #guardrails and the #consequences could tip the #economy into disorder. https://www.cnbc.com/2026/03/01/ai-artificial-intelligence-economy-business-risks.html?AIagents.at #AIagent #AI #LLM #GenAI

#airisk #aisystems #goldrushmentality #guardrails #consequences #economy
AIagent.at 🤖 AI News @[email protected] · 2026-03-02 · 02:03 UTC

The biggest #AIrisk isn’t rogue agents, it’s silent failure at scale: As #AIsystems grow too complex for humans to fully understand or control, small errors can quietly compound over weeks. Despite most deployments still being early-stage, companies are racing to adopt AI out of fear of falling behind. Experts warn this #goldrushmentality leaves little room for #guardrails and the #consequences could tip the #economy into disorder. https://www.cnbc.com/2026/03/01/ai-artificial-intelligence-economy-business-risks.html?AIagents.at #AIagent #AI #LLM #GenAI

#airisk #aisystems #goldrushmentality #guardrails #consequences #economy
AIagent.at 🤖 AI News @[email protected] · 2026-03-02 · 02:03 UTC

The biggest #AIrisk isn’t rogue agents, it’s silent failure at scale: As #AIsystems grow too complex for humans to fully understand or control, small errors can quietly compound over weeks. Despite most deployments still being early-stage, companies are racing to adopt AI out of fear of falling behind. Experts warn this #goldrushmentality leaves little room for #guardrails and the #consequences could tip the #economy into disorder. https://www.cnbc.com/2026/03/01/ai-artificial-intelligence-economy-business-risks.html?AIagents.at #AIagent #AI #LLM #GenAI

#airisk #aisystems #goldrushmentality #guardrails #consequences #economy
AIagent.at 🤖 AI News @[email protected] · 2026-03-02 · 02:03 UTC

The biggest #AIrisk isn’t rogue agents, it’s silent failure at scale: As #AIsystems grow too complex for humans to fully understand or control, small errors can quietly compound over weeks. Despite most deployments still being early-stage, companies are racing to adopt AI out of fear of falling behind. Experts warn this #goldrushmentality leaves little room for #guardrails and the #consequences could tip the #economy into disorder. https://www.cnbc.com/2026/03/01/ai-artificial-intelligence-economy-business-risks.html?AIagents.at #AIagent #AI #LLM #GenAI

#genai #llm #ai #aiagent #economy #consequences
AIagent.at 🤖 AI News @[email protected] · 2026-03-02 · 02:03 UTC

The biggest #AIrisk isn’t rogue agents, it’s silent failure at scale: As #AIsystems grow too complex for humans to fully understand or control, small errors can quietly compound over weeks. Despite most deployments still being early-stage, companies are racing to adopt AI out of fear of falling behind. Experts warn this #goldrushmentality leaves little room for #guardrails and the #consequences could tip the #economy into disorder. https://www.cnbc.com/2026/03/01/ai-artificial-intelligence-economy-business-risks.html?AIagents.at #AIagent #AI #LLM #GenAI

#airisk #aisystems #goldrushmentality #guardrails #consequences #economy
💧🌏 Greg Cocks @[email protected] · 2026-02-28 · 04:38 UTC

[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude

#vendingmachine #artificialintelligence #aihallucination #hallucinations #emperorsnewclothes #ohhhshiny
💧🌏 Greg Cocks @[email protected] · 2026-02-28 · 04:38 UTC

[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude

#vendingmachine #artificialintelligence #aihallucination #hallucinations #emperorsnewclothes #ohhhshiny
💧🌏 Greg Cocks @[email protected] · 2026-02-28 · 04:38 UTC

[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude

#vendingmachine #artificialintelligence #aihallucination #hallucinations #emperorsnewclothes #ohhhshiny
💧🌏 Greg Cocks @[email protected] · 2026-02-28 · 04:38 UTC

[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude

#genai #redteam #guardrails #knowledgeboundaries #snackliberationday #playstation
💧🌏 Greg Cocks @GregCocks · 2026-02-28 · 04:38 UTC

[The WSJ] Let AI Run [Their] Office Vending Machine. It Lost Hundreds Of Dollars.
Anthropic’s Claude ran a snack operation in the WSJ newsroom. It gave away a free PlayStation, ordered a live fish—and taught us lessons about the future of AI agents.
--
https://www.wsj.com/tech/ai/anthropic-claude-ai-vending-machine-agent-b7e84e34?gaa_at=eafs <-- shared media article
--
https://youtu.be/SpPhm7S9vsQ?si=aJQ2_BoxvLcNjOiz <-- shared video
--
[When you get clever journalists to !$%^&*@ with AI… bravo! And this is a very simple situation, vending machines have been around since literally the Roman Empire
“You are using the wrong prompts” and LUDDITES! In the comments in 3… 2… 1…]
#vendingmachine #artificialintelligence #AIHallucination #hallucinations #emperorsnewclothes #ohhhshiny #experiment #contextwindow #AIagent #claude #autonomous #compliance #fish #PlayStation #snackliberationday #knowledgeboundaries #guardrails #redteam #GenAI cynicism
@WSJ @Anthropic @Claude

#vendingmachine #artificialintelligence #aihallucination #hallucinations #emperorsnewclothes #ohhhshiny
Nonilex @[email protected] · 2026-02-27 · 22:13 UTC

The setback comes as #AI leader #Anthropic raced to sell novel #technology to #business & #government, particularly for #NationalSecurity, ahead of its widely expected #IPO.
At the same time, the battle over #tech #guardrails had raised concerns that the #DoD would follow #US #law [certainly not this one] but little other constraint when deploying AI for national-security missions, regardless of #safety or #ethics service terms embraced by the technology's developers.
#Trump #surveillance

#ai #anthropic #technology #business #government #nationalsecurity
Chema Alonso :verified: @[email protected] · 2026-02-24 · 05:53 UTC

El lado del mal - Hacking IA: Jailbreak, Prompt Injection, Hallucinations & Unalignment. Nuestro nuevo libro en 0xWord https://www.elladodelmal.com/2026/02/hacking-ia-jailbreak-prompt-injection.html #Hacking #IA #AI #InteligenciaArtificial #ArtificialInteligencia #Jailbreak #LLM #PromptInjection #Halllucinations #Unalignment #ChatGPT #DeepSekk #LLama #Guardrails #Privacidad #Poisoning

#hacking #ia #ai #inteligenciaartificial #artificialinteligencia #jailbreak
RaymondPierreL3 @[email protected] · 2026-02-24 · 05:29 UTC

I wonder if the intense lobbying by the #GenAISlop #TechBros I’ve already tooted about (A @camwilson ’s Article in Crickey News) has had its desired effect on Govt for less regulation (and presumedly no #Guardrails). Judging by this latest news (#ABC) it seems it has and the ‘govt’ is drunk on the #Kool-Aid. I guess this all means #Enshitification is not over yet and we’re in for more of the #AISlop as well as more $$ sunk into the #AIBubble. Straps yourselves in, the ride ain’t over yet!
“The original body was abandoned as part of a pivot by the federal government away from Mr Husic's "mandatory guardrails" for artificial intelligence, which were anticipated to require legislation and even a possible standalone AI Act, in favour of a lighter-touch approach to the growing technology.”
Read More:
https://www.abc.net.au/news/2026-02-24/ai-body-scrapped-15-months-spent-experts/106381560
#EdHusicMP #AusPol

#genaislop #techbros #guardrails #kool #enshitification #aislop
Chema Alonso :verified: @[email protected] · 2026-02-19 · 06:07 UTC

El lado del mal - Grok & Gore (G´NG) es puro Rock & Roll (R´NR) https://www.elladodelmal.com/2026/02/grok-gore-gng-es-puro-rock-roll-rnr.html #Grok #AI #IA #Jailbreak #Gore #InteligenciaArtificial #Guardrails

#grok #ai #ia #jailbreak #gore #inteligenciaartificial
Habr @[email protected] · 2026-02-18 · 06:02 UTC

Как изменилась индустрия AI Security за 2025 год?
В начале 2026 года мы ( авторы телеграм-каналов по безопасности ИИ ) собрались, чтобы подвести итоги прошедшего года и обсудить, куда движется безопасность ИИ в общем и целом. Разговор получился честным, на наш взгляд. Участники дискуссии - Я, Артём Семенов , автор PWN AI ; Борис Захир , автор канала Борис_ь с ml ; Евгений Кокуйкин , создатель HiveTrace и автор канала Евгений Кокуйкин - Raft ; и Владислав Тушканов , исследователь безопасности LLM и компьютерный лингвист, автор канала llm security и каланы . Ниже мы хотим рассказать вам о том что обсуждали на стриме и к чему мы пришли. Про гардрейлы, стоимость атак, LoRA-бэкдоры, угрозы ИИ-агентов и почему каждый подход к защите - компромисс.
https://habr.com/ru/articles/1000736/
#AI_Security #LLM #prompt_injection #guardrails #red_teaming #MLSecOps #alignment #агентные_системы #LoRA #безопасность_ИИ

#безопасность_ии #lora #агентные_системы #alignment #mlsecops #red_teaming
Tino Eberl @[email protected] · 2026-02-13 · 10:11 UTC

Po-Gemüse 🤪
Ein KI #Chatbot des US Gesundheitsministeriums sorgt für Kritik, weil er auf provozierende Fragen unpassende Empfehlungen liefert.
Berichte zeigen, dass offenbar #Grok ohne ausreichende Vorgaben eingesetzt wurde. Nutzer meldeten Antworten, die weder gesundheitlich sinnvoll noch dem Zweck des Bots entsprechen.
https://www.golem.de/news/ernaehrung-chatbot-der-us-regierung-liefert-empfehlungen-fuer-po-gemuese-2602-205261.html
#KIFail #Guardrails #KIChatbot

#chatbot #grok #kifail #guardrails #kichatbot
Chema Alonso :verified: @[email protected] · 2026-02-09 · 06:27 UTC

El lado del mal - LLM-Guardian: Sistema Multi-Agente de Defensa LLM con Red Team Adversarial Inteligente https://elladodelmal.com/2026/02/llm-guardian-sistema-multi-agente-de.html #LLM #Guardrails #Pentest #Pentesting #IA #AI #Jailbreak #PromptInjection #Unalignment #Hardening

#llm #guardrails #pentest #pentesting #ia #ai
Chema Alonso :verified: @[email protected] · 2026-02-01 · 07:59 UTC

El lado del mal - Cyphering Prompts & Answers para evadir Guardarraíles https://elladodelmal.com/2026/01/cyphering-prompts-para-evadir.html #PromptInjection #Jailbreak #Guardrails #IA #AI #Pentest #Hacking #Criptografía #Ofuscación #Cifrado

#promptinjection #jailbreak #guardrails #ia #ai #pentest
:rss: Hacker News @[email protected] · 2026-01-24 · 20:19 UTC

JSON-render: LLM-based JSON-to-UI tool
https://json-render.dev/
#ycombinator #json_render #AI_UI_generation #React_components #guardrails #structured_output #dashboard_builder

#ycombinator #json_render #ai_ui_generation #react_components #guardrails #structured_output