Reading

ShellGames: A Large Model-Based SSH Deception System and Dynamic Network Defense

This article introduces the ShellGames system, an SSH honeypot based on large language models, which addresses the limitations of traditional honeypots in interaction authenticity and persistence through various technical innovations.

网络欺骗蜜罐SSH大语言模型网络安全主动防御arXiv

Published 2026-06-16 22:40Recent activity 2026-06-17 10:32Estimated read 5 min

ShellGames: A Large Model-Based SSH Deception System and Dynamic Network Defense

Section 01

[Main Floor/Introduction] ShellGames: Core Overview of the Large Model-Based SSH Deception System

ShellGames is an SSH honeypot system based on large language models (LLMs), designed to address the limitations of traditional honeypots in interaction authenticity, long-term session maintenance, and other aspects. It combines various technical innovations (such as automatic chain of thought, memory management, speculative execution, etc.) to effectively overcome issues like statelessness and inconsistent output in pure LLM solutions. This article is sourced from an arXiv paper (arXiv:2606.17986v1), published on June 16, 2026.

Section 02

[Background] Dilemmas of Network Deception and Limitations of Pure LLM Solutions

Network deception and moving target defense are important active defense strategies, but they face dilemmas such as insufficient interaction authenticity, difficulty maintaining long-term sessions, and high requirements for behavioral consistency. Traditional honeypots either have limited interaction (low-interaction) or high cost and risk (high-interaction). Although pure LLM solutions can generate realistic text, they have problems like lack of persistent state, inconsistent output, hallucinations, response delays, and vulnerability to subversion.

Section 03

[Method] Five Technical Innovations of ShellGames

ShellGames addresses the above issues through five technologies: 1. Automatic chain of thought and few-shot learning to improve response correctness; 2. A memory management system to maintain persistent states (file systems, processes, etc.); 3. Speculative execution to reduce response delays; 4. Intelligent routing of complex commands to real sandboxes; 5. Subversion detection mechanisms to identify malicious attempts.

Section 04

[Evidence] Performance Verification and User Study of ShellGames

Standardized benchmark tests cover four dimensions: correctness, consistency, state tracking, and robustness. Experimental results show: command accuracy of 0.898 (5.3% improvement), sequence-level accuracy of 0.918 (36% improvement), state tracking accuracy of 0.98 (18.3% improvement), and robustness accuracy of 0.95 (37% improvement). In user studies, 20 participants found it difficult to distinguish ShellGames from a real Shell, with excellent performance in realism and command coverage.

Section 05

[Conclusion] Application Value and Technical Insights of ShellGames

Application scenarios include attacker behavior analysis, threat intelligence collection, attack chain delay, blue team training, etc. Technical insights: the value of hybrid architectures (LLM + real systems), the importance of state management, and the versatility of speculative execution.

Section 06

[Outlook] Limitations and Future Directions of ShellGames

Limitations: high resource consumption, challenges in handling complex scenarios, risk of adversarial attacks. Future directions: optimizing resource efficiency, enhancing complex scenario capabilities, improving adversarial robustness, and exploring multimodal honeypots.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

libmlxforge: An Embedded MLX LLM Inference Engine for Apple Silicon

libmlxforge is an embeddable MLX large language model (LLM) inference engine designed specifically for Apple Silicon. It provides a unified C ABI interface, supports calls from Node.js, Swift, and Rust, and features continuous batching, streaming output, JSON-constrained structured output, and embedding vector generation.

Recent activity 2026-06-09 17:23