Reading

Steganography Without Modification: Hidden Communication via LLM Seeds

The study reveals a steganographic channel leveraging the inherent properties of LLM inference stacks: secret information is encoded via PRNG seeds, and receivers can reconstruct probability intervals from generated text to recover the seed. A 100% recovery rate is achievable within 300 tokens under known prompt settings.

隐写术LLM安全伪随机数生成器隐蔽通信确定性解码安全漏洞

Published 2026-06-08 15:32Recent activity 2026-06-09 11:54Estimated read 6 min

Steganography Without Modification: Hidden Communication via LLM Seeds

Section 01

Introduction: LLM Seed Steganography—Hidden Communication Without Modification

Key Findings: The study reveals a steganographic channel leveraging the inherent properties of LLM inference stacks, where secret information is encoded via PRNG seeds, and receivers can reconstruct probability intervals from generated text to recover the seed. Under known prompt settings, a 100% recovery rate can be achieved within 300 tokens. This channel does not require modifying model weights, sampling code, or output distributions—even standard LLM services could potentially be used for hidden communication.

Section 02

Background: Inherent Steganographic Channels Exist in LLM Inference Stacks

Original Authors and Source

Original Authors: Paper research team
Source Platform: arXiv
Original Title: Steganography Without Modification: Hidden Communication via LLM Seeds
Original Link: http://arxiv.org/abs/2606.09135v1
Publication Date: June 8, 2026

Security Alert

Widely deployed LLM inference stacks have inherent steganographic channels that can be exploited without modifying model weights, sampling code, or output distributions—meaning standard LLM services may be used for hidden communication.

Section 03

Technical Principles and Operational Modes

Core Principles

Leveraging structural features of deterministic decoding: The sequence of token-level probability intervals generated by PRNG in inverse transform sampling depends on the seed and can be reconstructed from the generated text.

Encoding and Decoding Process

Sender: Encode secret information into a PRNG seed, then generate text using standard sampling with this seed.
Receiver: Reconstruct probability intervals from the text, exhaustively search the seed space to recover the seed and extract the hidden payload.

Two Operational Modes

Known Prompt: Both parties share the prompt; the receiver can accurately reconstruct intervals, and forced alignment achieves perfect recovery.
Unknown Prompt: Use only the generated text; recover the seed via approximate interval reconstruction plus maximum hit count scoring.

Section 04

Experimental Evidence and Analysis of Influencing Factors

Experimental Results

Known Prompt: Tested across 6 model families and 5 text domains; 32-bit seeds are recovered from a 2^32 candidate space with 100% accuracy within 300 tokens, taking <35 seconds on a single GPU.
Unknown Prompt: Recovery accuracy approaches perfection at 600-800 tokens, taking approximately 12 seconds.

Influencing Factors

Prompt Strategy: Affects probability distribution and reconstruction accuracy
Tokenization Ambiguity: Introduces noise
Sampling Hyperparameters (temperature, top-p): Affect channel capacity and reliability

Section 05

Research Conclusions: Security Implications and Steganography Feasibility

Steganographic transmission of 32-bit information is feasible, sufficient to deliver sensitive data such as key instructions and encryption keys.
"Not knowing the prompt" is not a valid security assumption—hidden information can still be extracted even without the original prompt.
Basic LLM components (e.g., PRNG) may become vectors for security attacks.

Section 06

Response Recommendations: Potential Mitigation Measures

Mitigation solutions for this steganographic channel:

Use unpredictable random seed sources
Add random noise to inference services
Monitor abnormal generation patterns
Adopt security-hardened inference stacks for sensitive applications

Section 07

Broader Impact: System Design and Research Directions

For LLM Service Providers

Need to consider steganography resistance during the system design phase.

For Security Researchers

Opens new directions: Designing and evaluating generative models resistant to steganography.

This study is not only a security vulnerability report but also a profound examination of the security boundaries of LLM systems.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

Building an AWS Generative AI Application from Scratch: EC2 + Bedrock Hands-On Tutorial

A complete cloud-native AI application development guide for beginners, building a simple generative AI chatbot using Amazon EC2, Apache, Python CGI, and Amazon Bedrock, covering architecture design, IAM permission configuration, security best practices, and cost optimization suggestions.

Recent activity 2026-06-02 19:49