AI — Annonces des labos d'IA - Page 31

/

Introducing Mistral Small 4

16 mars 2026

Systematic debugging for AI agents: Introducing the AgentRx framework

As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-step API workflows, a new challenge has emerged: transparency. When a human makes a mistake, we can usually trace the logic. But when an AI agent fails, perhaps by hallucinating a tool output or […] The post Systematic debugging for AI agents: Introducing the AgentRx framework appeared first on Microsoft Research.

12 mars 2026

Microsoft Research

Lire

Anthropic invests $100 million into the Claude Partner Network

12 mars 2026

Anthropic News

Lire

Designing AI agents to resist prompt injection

How ChatGPT defends against prompt injection and social engineering by constraining risky actions and protecting sensitive data in agent workflows.

11 mars 2026

OpenAI

Lire

From model to agent: Equipping the Responses API with a computer environment

How OpenAI built an agent runtime using the Responses API, shell tool, and hosted containers to run secure, scalable agents with files, tools, and state.

11 mars 2026

OpenAI

Lire

Wayfair boosts catalog accuracy and support speed with OpenAI

Wayfair uses OpenAI models to improve ecommerce support and product catalog accuracy, automating ticket triage and enhancing millions of product attributes at scale.

11 mars 2026

OpenAI

Lire

Rakuten fixes issues twice as fast with Codex

11 mars 2026

OpenAI

Lire

Introducing The Anthropic Institute

11 mars 2026

Anthropic News

Lire

Rails testing on autopilot: Building an agent that writes what developers won't

11 mars 2026

Mistral AI

Lire

Improving instruction hierarchy in frontier LLMs

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.

10 mars 2026

OpenAI

Lire