Toutes les catégories

IA

688 articles

Spotting and Avoiding ROT in Your Agentic AI

Spotting and Avoiding ROT in Your Agentic AI

The following article originally appeared on Q McCallum’s blog and is being republished here with the author’s permission. Generative AI agents and rogue traders pose similar insider threats to their employers. Specifically, we can expect companies to deploy agentic AI with broad reach and insufficient oversight. That creates the conditions for a particular flavor of […]

25 mars 2026

O'Reilly Radar — AI/ML

Auto mode for Claude Code

IA Programmation

Auto mode for Claude Code

Auto mode for Claude Code Really interesting new development in Claude Code today as an alternative to --dangerously-skip-permissions: Today, we're introducing auto mode, a new permissions mode in Claude Code where Claude makes permission decisions on your behalf, with safeguards monitoring actions before they run. Those safeguards appear to be implemented using Claude Sonnet 4.6, as described in the documentation: Before each action runs, a separate classifier model reviews the conversation…

24 mars 2026

Simon Willison's Weblog

How to Build an OS for Your AI Workforce?

How to Build an OS for Your AI Workforce?

An OS layer to manage a fleet of AI agents!

24 mars 2026

Daily Dose of Data Science

Package Managers Need to Cool Down

IA Programmation

Package Managers Need to Cool Down

Package Managers Need to Cool Down Today's LiteLLM supply chain attack inspired me to revisit the idea of dependency cooldowns, the practice of only installing updated dependencies once they've been out in the wild for a few days to give the community a chance to spot if they've been subverted in some way. This recent piece (March 4th) piece by Andrew Nesbitt reviews the current state of dependency cooldown mechanisms across different packaging tools. It's surprisingly well supported! There's…

24 mars 2026

Simon Willison's Weblog

Quoting Christopher Mims

IA Programmation

Quoting Christopher Mims

I really think "give AI total control of my computer and therefore my entire life" is going to look so foolish in retrospect that everyone who went for this is going to look as dumb as Jimmy Fallon holding up a picture of his Bored Ape — Christopher Mims, Technology columnist at The Wall Street Journal Tags: ai, security

24 mars 2026

Simon Willison's Weblog

Malicious litellm_init.pth in litellm 1.82.8 — credential stealer

IA Programmation

Malicious litellm_init.pth in litellm 1.82.8 — credential stealer

Malicious litellm_init.pth in litellm 1.82.8 — credential stealer The LiteLLM v1.82.8 package published to PyPI was compromised with a particularly nasty credential stealer hidden in base64 in a litellm_init.pth file, which means installing the package is enough to trigger it even without running import litellm. (1.82.7 had the exploit as well but it was in the proxy/proxy_server.py file so the package had to be imported for it to take effect.) This issue has a very detailed description of what…

24 mars 2026

Simon Willison's Weblog

Book Review: Open Socrates (Part 2)

Book Review: Open Socrates (Part 2)

Yesterday I posted Part 1. Read that first. This is Part 2 of 2.

24 mars 2026

Don't Worry About the Vase

How to Build a General-Purpose AI Agent in 131 Lines of Python

How to Build a General-Purpose AI Agent in 131 Lines of Python

The following article originally appeared on Hugo Bowne-Anderson’s newsletter, Vanishing Gradients, and is being republished here with the author’s permission. In this post, we’ll build two AI agents from scratch in Python. One will be a coding agent, the other a search agent. Why have I called this post “How to Build a General-Purpose AI […]

24 mars 2026

O'Reilly Radar — AI/ML

Streaming experts

IA Programmation

Streaming experts

I wrote about Dan Woods' experiments with streaming experts the other day, the trick where you run larger Mixture-of-Experts models on hardware that doesn't have enough RAM to fit the entire model by instead streaming the necessary expert weights from SSD for each token that you process. Five days ago Dan was running Qwen3.5-397B-A17B in 48GB of RAM. Today @seikixtc reported running the colossal Kimi K2.5 - a 1 trillion parameter model with 32B active weights at any one time, in 96GB of RAM on…

24 mars 2026

Simon Willison's Weblog

Quoting Neurotica

IA Programmation

Quoting Neurotica

slop is something that takes more human effort to consume than it took to produce. When my coworker sends me raw Gemini output he’s not expressing his freedom to create, he’s disrespecting the value of my time — Neurotica, @schwarzgerat.bsky.social Tags: ai-ethics, slop, generative-ai, ai, llms

23 mars 2026

Simon Willison's Weblog

datasette-files 0.1a2

IA Programmation

datasette-files 0.1a2

Release: datasette-files 0.1a2 The most interesting alpha of datasette-files yet, a new plugin which adds the ability to upload files directly into a Datasette instance. Here are the release notes in full: Columns are now configured using the new column_types system from Datasette 1.0a26. #8 New file_actions plugin hook, plus ability to import an uploaded CSV/TSV file to a table. #10 UI for uploading multiple files at once via the new documented JSON upload API. #11 Thumbnails are now generated…

23 mars 2026

Simon Willison's Weblog

Anatomy of the .claude/ Folder

Anatomy of the .claude/ Folder

A complete guide to CLAUDE.md, custom commands, skills, agents, and permissions, and how to set them up properly.

23 mars 2026

Daily Dose of Data Science

Quoting David Abram

IA Programmation

Quoting David Abram

I have been doing this for years, and the hardest parts of the job were never about typing out code. I have always struggled most with understanding systems, debugging things that made no sense, designing architectures that wouldn't collapse under heavy load, and making decisions that would save months of pain later. None of these problems can be solved LLMs. They can suggest code, help with boilerplate, sometimes can act as a sounding board. But they don't understand the system, they don't…

23 mars 2026

Simon Willison's Weblog

Book Review: Open Socrates (Part 1)

Book Review: Open Socrates (Part 1)

These are all important, in their own way, call it a treasure hunt and collect them all…

23 mars 2026

Don't Worry About the Vase

Import AI 450: China's electronic warfare model; traumatized LLMs; and a scaling law for cyberattacks

Import AI 450: China's electronic warfare model; traumatized LLMs; and a scaling law for cyberattacks

How will timeless minds value time?

23 mars 2026