Topic: Agent Security

27 articles in this topic.

This topic page curates research-focused writing on Agent Security, with an emphasis on practical security implications, reproducible observations, and implementation-aware takeaways. Instead of isolated summaries, the collection is organized to help you connect attack techniques, defensive controls, and evaluation criteria across multiple papers and project write-ups.

Across 27 articles, this cluster highlights how Agent Security appears in real workflows and where teams commonly miss risk boundaries. The coverage includes news digest, trend report, paper review, research paper and connects this theme with adjacent areas such as LLM Security, Adversarial ML, AI Safety, so you can move from conceptual understanding to deployable engineering decisions.

This page is maintained as a high-signal index for Agent Security. Use it to follow newer articles first, then branch into adjacent topics and defensive patterns that repeatedly appear across projects and paper reviews.

What You Will Find Here

Related directions: LLM Security, Adversarial ML, AI Safety.
Start with: AI Security Digest — May 31, 2026 and This Week in AI Security — May 31, 2026.
Use this page as a hub for internal links when publishing future posts in the same area.

AI Security Digest — May 31, 2026

A digest covering the first in-the-wild LLM agent attacks, focusing on RAG pipeline injection and multi-agent system jailbreaks.

2026-05-31·News Digest·4 min readLLM SecurityRAG SecurityAgent SecurityAdversarial ML

This Week in AI Security — May 31, 2026

A weekly roundup of AI security research focusing on the shift from static defenses to dynamic runtime containment for autonomous agents.

2026-05-31·Trend Report·5 min readLLM SecurityRAG SecurityAgent SecurityAdversarial ML

AI Security Digest — May 30, 2026

This digest covers major advancements in AI safety, including OpenAI's biodefense efforts and Arm's defensive automation. It also details new research on memory poisoning and prompt fragility in LLMs.

2026-05-30·News Digest·5 min readLLM SecurityAgent SecurityAI SafetyAdversarial ML

AI Security Digest — May 27, 2026

The speed of AI exploitation is accelerating, demanding a shift to real-time verification. This digest covers malware poisoning, semantic validation of PE tools, and agentic AI attack vectors.

2026-05-27·News Digest·5 min readLLM SecurityAgent SecurityData PoisoningAdversarial ML

This Week in AI Security — May 24, 2026

This week, the AI security research community signaled a decisive pivot from static, prompt-response safety paradigms to the volatile, high-stakes realm of agentic autonomy and complex system integrat

2026-05-24·Trend Report·12 min readLLM SecurityRAG SecurityAgent SecurityData PoisoningAdversarial ML

AI Security Digest — May 18, 2026

The security boundary of generative AI has definitively shifted from stateless prompt-engineering vulnerabilities to structural and temporal exploits within multi-agent orchestration architectures. Th

2026-05-18·News Digest·11 min readLLM SecurityAgent SecurityAI SafetyAdversarial ML

AI Security Digest — May 07, 2026

The rapid paradigm shift from stateless, single-turn Large Language Model (LLM) prompts to stateful, multi-step autonomous agentic workflows has rendered traditional boundary-based and per-turn securi

2026-05-07·News Digest·9 min readLLM SecurityAgent SecurityAdversarial ML

AI Agent Traps: When the Environment Becomes the Attacker

Franklin et al. (DeepMind, SSRN 2026) introduce a taxonomy of 'AI agent traps'—adversarial content embedded in digital environments to misdirect, deceive, or exploit autonomous agents. We walk through six classes of traps spanning perception, reasoning, memory, action, multi-agent dynamics, and human oversight.

2026-05-04·Paper Review·11 min readLLM SecurityAgent SecurityAdversarial ML

AI Security Digest — April 22, 2026

The unifying theme of this week's AI security landscape is the critical transition from superficial, syntax-level filtering to deep, state-aware behavioral defenses across both agentic workflows and s

2026-04-22·News Digest·11 min readLLM SecurityRAG SecurityAgent SecurityData PoisoningAI SafetyCode Security

AI Security Digest — April 21, 2026

The dominant security theme today is the structural breakdown of boundaries between reasoning engines and executive environments, transitioning the primary threat vector from semantic prompt manipulat

2026-04-21·News Digest·10 min readLLM SecurityRAG SecurityAgent SecurityAI SafetyPrivacyCode SecurityWatermarkingDeepfakes & Biometrics

AI Security Digest — April 20, 2026

The systematic scaling of automated, AI-driven vulnerability discovery has triggered a structural crisis in legacy patch-management frameworks, as evidenced by the 263% surge in CVEs forcing an overha

2026-04-20·News Digest·6 min readLLM SecurityAgent SecurityAI SafetyPrivacyCode SecurityInfrastructure Security

This Week in AI Security — April 19, 2026

The dominant theme this week is the decisive transition from isolated 'model-centric' security toward systemic, hardware-software co-designed infrastructure integrity. As enterprise AI deployments sca

2026-04-19·Trend Report·8 min readLLM SecurityAgent SecurityAI SafetyAdversarial MLWatermarkingInfrastructure Security

AI Security Digest — April 18, 2026

As autonomous agentic systems and multi-modal models increasingly bypass static guardrails, the core paradigm of AI security is shifting from superficial post-hoc input/output filtering to deep, execu

2026-04-18·News Digest·12 min readLLM SecurityAgent SecurityData PoisoningAdversarial MLWatermarkingInfrastructure Security

Security of Autonomous AI Agents: Trust Boundary-Based Attack Surface Analysis and Trends

A trust-boundary framework for autonomous AI agent security: six attack surfaces, the shift from output safety to behavioral safety, and the open research agenda.

2026-04-15·Research Paper·13 min readLLM SecurityAgent Security

AI Security Digest — April 12, 2026

The dominant theme this week is the collapse of static, text-centric alignment barriers as multimodal models and autonomous agents merge to create highly dynamic execution-level security risks. As dem

2026-04-12·News Digest·6 min readAgent SecurityAI SafetyAdversarial ML

This Week in AI Security — April 12, 2026

This week’s threat landscape signals a structural shift from transient text-based 'jailbreaks' toward the systematic exploitation of autonomous agent execution layers, specifically targeting Model Con

2026-04-12·Trend Report·8 min readLLM SecurityAgent SecurityAdversarial ML

AI Security Digest — April 11, 2026

The single dominant theme in this week’s landscape is the systemic collapse of static, input-boundary defense paradigms as adversarial exploits pivot to dynamic, multi-agent cascading injections and v

2026-04-11·News Digest·13 min readLLM SecurityRAG SecurityAgent SecurityAdversarial ML

AI Security Digest — April 10, 2026

Today’s intelligence briefing highlights a critical inflection point in AI security: the formal invalidation of boundary-based sanitization as systems transition to active, kinetic physical execution.

2026-04-10·News Digest·11 min readLLM SecurityAgent SecurityAI SafetyAdversarial MLInfrastructure Security

AI Security Digest — April 07, 2026

The current AI security landscape is defined by a critical architectural shift: as autonomous agent ecosystems transition from stateless chat interfaces to persistent, multi-tool environments, the tra

2026-04-07·News Digest·8 min readLLM SecurityRAG SecurityAgent SecurityData PoisoningInfrastructure Security

AI Security Digest — April 05, 2026

The transition of Large Language Models (LLMs) from static chat interfaces to autonomous, multi-agent frameworks has transformed the AI threat landscape, rendering standard input-filtering guardrails

2026-04-05·News Digest·9 min readLLM SecurityRAG SecurityAgent SecurityAdversarial MLInfrastructure Security

This Week in AI Security — April 05, 2026

The primary security trajectory this week marks a decisive transition away from localized prompt injection toward systemic, stateful exploitation of autonomous, multi-agent architectures. As artificia

2026-04-05·Trend Report·9 min readLLM SecurityAgent SecurityData PoisoningAI SafetyInfrastructure Security

AI Security Digest — April 03, 2026

The enterprise security landscape is undergoing a critical transition as defensive architectures pivot from token-level static guardrails to countering complex, goal-directed agentic exploits. Emergin

2026-04-03·News Digest·11 min readLLM SecurityAgent SecurityAI SafetyAdversarial ML

AI Security Digest — April 01, 2026

The dominant theme this week is the structural vulnerability of agentic integrations that decouple security policies from real-time execution state, leaving enterprise pipelines highly vulnerable to c

2026-04-01·News Digest·14 min readLLM SecurityAgent SecurityAI SafetyAdversarial ML

AI Security Digest — March 31, 2026

The AI security landscape has reached a critical inflection point, shifting from reactive output filtering to deep-stack defense across intermediate reasoning layers (Chain-of-Thought) and physical ex

2026-03-31·News Digest·12 min readLLM SecurityAgent SecurityAI SafetyInfrastructure Security

AI Security Digest — March 29, 2026

The dominant theme in AI security is the operational crisis emerging from the rapid transition of large language models (LLMs) from passive information-retrieval engines to active, high-privileged age

2026-03-29·News Digest·5 min readLLM SecurityAgent SecurityData PoisoningCode SecurityInfrastructure Security

Bridging Models and Agents: Protocol Architectures and Security in MCP & A2A

We analyze the architectures and security models of Model Context Protocol (MCP) and Agent-to-Agent (A2A) protocol, uncovering attack vectors and proposing mitigations for secure multi-agent AI systems.

2026-03-18·Research Paper·9 min readLLM SecurityAgent Security

AgentFuzz: Automatic Detection of Taint-Style Vulnerabilities in LLM-based Agents

An analysis of AgentFuzz, a novel fuzzing framework that automatically detects taint-style vulnerabilities in LLM-based agents through LLM-assisted seed generation, feedback-driven scheduling, and sink-guided mutation.

2025-09-11·Paper Review·11 min readAgent SecurityCode Security

Related Topics