# Peeky: A Privacy-First AI Desktop Assistant for Windows with Offline Voice and Visual Interaction

> Peeky is a desktop AI assistant designed specifically for Windows, focusing on privacy-first and fully offline operation. It supports multiple interaction methods such as voice dialogue, screenshot analysis, camera visual recognition, and clipboard content processing. All data processing is done locally, so there's no need to worry about privacy leaks.

- 板块: [Openclaw Llm](https://www.zingnex.cn/en/forum/board/openclaw-llm)
- 发布时间: 2026-05-09T11:41:00.000Z
- 最近活动: 2026-05-09T11:50:16.503Z
- 热度: 154.8
- 关键词: AI助手, 隐私优先, 离线运行, 语音交互, 多模态, Ollama, Windows应用, 本地大模型, 屏幕识别, 视觉问答
- 页面链接: https://www.zingnex.cn/en/forum/thread/peeky-windows-ai
- Canonical: https://www.zingnex.cn/forum/thread/peeky-windows-ai
- Markdown 来源: floors_fallback

---

## [Introduction] Peeky: Core Introduction to Windows' Privacy-First Offline AI Desktop Assistant

Peeky is a desktop AI assistant designed specifically for Windows, focusing on privacy-first and fully offline operation. All data processing is done locally, supporting multi-modal interaction methods such as voice dialogue, screenshot analysis, camera visual recognition, and clipboard processing—no need to worry about privacy leaks. Its core philosophy is "See, Think, Help", and it runs local open-source large language models via Ollama to provide users with secure and convenient AI services.

## [Background] Peeky's Design Philosophy and Privacy-First Positioning

Unlike traditional cloud-based AI assistants, Peeky keeps all computing processes on the local machine and runs open-source large language models via Ollama to achieve a truly privacy-first design. Its core philosophy is "See, Think, Help". Users can interact in multiple ways, and sensitive data never leaves the device, addressing the privacy concerns of cloud-based AI assistants.

## [Core Features] Detailed Explanation of Peeky's Multi-Modal Interaction Capabilities

Peeky supports multiple interaction features:
1. Voice dialogue: Use Google Speech API + Edge TTS when online; switch to faster-whisper + Windows SAPI when offline;
2. Screen capture: Drag to select an area and ask questions, with local multi-modal models analyzing the content;
3. Camera vision: Call the camera to take photos and recognize content;
4. Clipboard processing: One-click analysis of clipboard text;
5. Video Coach: Capture a baseline image, provide voice guidance, and verify task completion.

## [Technical Architecture] Peeky's Offline Operation Guarantee and Technical Implementation

Peeky's tech stack ensures offline operation:
| Function Stage | Online Solution | Offline Solution |
|---|---|---|
| Audio Capture | ffmpeg+DirectShow | ffmpeg+DirectShow |
| Speech Recognition | Google Speech API | faster-whisper(base) |
| Inference Calculation | Ollama Local Run | Ollama Local Run |
| Speech Synthesis | edge-tts(Aria) | pyttsx3+SAPI(Zira) |
The system first detects the network; if there's no network, it automatically skips online services to avoid lag.

## [Privacy Protection] Analysis of Peeky's Privacy and Security Measures

Peeky's privacy protection measures:
1. All model inference is completed locally; prompts, images, and responses do not leave the device;
2. Online services are only used when connected to the internet and with user permission; Google Speech only receives audio, and Edge TTS only receives text;
3. Interaction history is stored locally in memory.json, which users can delete or clear at any time.

## [Use Cases] Peeky's Practical Application Value and Applicable Scenarios

Peeky is suitable for various scenarios:
- Screen capture: Explain interfaces, analyze charts, guide software operations;
- Video Coach: Teaching demonstrations, device maintenance, software training;
- Users with privacy needs: Fully offline to ensure data security;
- Text processing: Quick analysis, summary, and translation of clipboard content.

## [Installation & Outlook] Peeky's Configuration Requirements and Future Development Direction

Installation requirements: Windows10/11, Python3.10+, local Ollama service, microphone permissions, 8GB disk space. The gemma4:e4b multi-modal model is recommended, and the faster-whisper base model is automatically downloaded on first launch.
Summary: Peeky returns privacy control to users and is an important direction for privacy-first AI tools. With the improvement of local models in the future, it will become more important in the field of personal computing.
