Toutes les sources

Hugging Face

115 articles Flux RSS

IA

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

14 mai 2026

Unlocking asynchronicity in continuous batching

Unlocking asynchronicity in continuous batching

14 mai 2026

Building Blocks for Foundation Model Training and Inference on AWS

Building Blocks for Foundation Model Training and Inference on AWS

11 mai 2026

MachinaCheck: Building a Multi-Agent CNC Manufacturability System on AMD MI300X

MachinaCheck: Building a Multi-Agent CNC Manufacturability System on AMD MI300X

10 mai 2026

"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

9 mai 2026

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

8 mai 2026

EMO: Pretraining mixture of experts for emergent modularity

EMO: Pretraining mixture of experts for emergent modularity

8 mai 2026

vLLM V0 to V1: Correctness Before Corrections in RL

vLLM V0 to V1: Correctness Before Corrections in RL

6 mai 2026

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

6 mai 2026

AI evals are becoming the new compute bottleneck

AI evals are becoming the new compute bottleneck

29 avril 2026

Granite 4.1 LLMs: How They’re Built

Granite 4.1 LLMs: How They’re Built

29 avril 2026

DeepInfra on Hugging Face Inference Providers 🔥

DeepInfra on Hugging Face Inference Providers 🔥

29 avril 2026

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

28 avril 2026

How to build scalable web apps with OpenAI's Privacy Filter

How to build scalable web apps with OpenAI's Privacy Filter

27 avril 2026

DeepSeek-V4: a million-token context that agents can actually use

DeepSeek-V4: a million-token context that agents can actually use

24 avril 2026