Topic: Privacy

6 articles in this topic.

Privacy risks in modern AI systems are not limited to obvious data leaks. They also appear through indirect channels such as membership inference, memorization extraction, and retrieval traces that expose sensitive context.

These articles analyze how privacy leakage happens in practice and what engineering controls reduce exposure. You will find discussions on threat modeling, evaluation methods, and lightweight safeguards that can be integrated into existing model and RAG deployments.

This page is maintained as a high-signal index for Privacy. Use it to follow newer articles first, then branch into adjacent topics and defensive patterns that repeatedly appear across projects and paper reviews.

Related Topics

LLM Security AI Safety RAG Security

What You Will Find Here

Related directions: LLM Security, AI Safety, RAG Security.
Start with: AI Security Digest — April 21, 2026 and AI Security Digest — April 20, 2026.
Use this page as a hub for internal links when publishing future posts in the same area.

AI Security Digest — April 21, 2026

The dominant security theme today is the structural breakdown of boundaries between reasoning engines and executive environments, transitioning the primary threat vector from semantic prompt manipulat

2026-04-21·News Digest·10 min readLLM SecurityRAG SecurityAgent SecurityAI SafetyPrivacyCode SecurityWatermarkingDeepfakes & Biometrics

AI Security Digest — April 20, 2026

The systematic scaling of automated, AI-driven vulnerability discovery has triggered a structural crisis in legacy patch-management frameworks, as evidenced by the 263% surge in CVEs forcing an overha

2026-04-20·News Digest·6 min readLLM SecurityAgent SecurityAI SafetyPrivacyCode SecurityInfrastructure Security

An Information Theoretic Approach to Machine Unlearning

A novel zero-shot machine unlearning method using information theory and curvature analysis, enabling efficient removal of data influence without requiring access to the retain set.

2025-07-23·Paper Review·9 min readLLM SecurityAI SafetyPrivacy

Machine Unlearning for LLMs: Foundations and the AltPO Approach

An introduction to machine unlearning in Large Language Models, covering the TOFU benchmark, various unlearning methods (GradDiff, NPO, IdkPO, AltPO), and the challenges of maintaining model utility while forgetting specific knowledge.

2025-04-09·Paper Review·7 min readLLM SecurityAI SafetyPrivacy

Membership Inference Attacks on Retrieval-Augmented Generation: A Comprehensive Survey

A comprehensive analysis of membership inference attacks against RAG systems, examining three state-of-the-art approaches: RAG-MIA, S²MIA, and MBA, along with their defenses and limitations.

2025-03-09·Paper Review·7 min readLLM SecurityRAG SecurityPrivacy

Teach LLMs to Phish: Stealing Private Information from Language Models

An analysis of neural phishing attacks that teach LLMs to memorize and leak private information by inserting benign-appearing poison data during pretraining, achieving up to 90% secret extraction rates.

2024-11-20·Paper Review·9 min readLLM SecurityData PoisoningPrivacy