Reading

AuraDent: A Real-Time Voice-Driven Dental Clinical Documentation Automation Platform

AuraDent is a real-time documentation platform for dental clinics. Using Deepgram speech recognition, AI intelligent extraction, and AWS asynchronous processing, it automatically converts doctors' chairside dictations into structured medical records, treatment charts, and post-treatment guidelines.

医疗AI语音识别临床文档牙科DeepgramAWS LambdaPII脱敏

Published 2026-04-27 06:44Recent activity 2026-04-27 07:23Estimated read 5 min

Section 01

Introduction / Main Floor: AuraDent: A Real-Time Voice-Driven Dental Clinical Documentation Automation Platform

Section 02

Pain Points and Opportunities in Clinical Documentation

During dental treatment, doctors need to record medical records, update treatment charts, and write post-treatment guidelines while treating patients—these documentation tasks are both time-consuming and error-prone. The traditional approach is to fill in records from memory after treatment, which makes it hard to ensure the accuracy and completeness of information. AuraDent was created to address this industry pain point: letting doctors focus on treatment while AI handles documentation.

Section 03

System Architecture Overview

AuraDent uses a TypeScript monorepo architecture, integrating real-time voice processing, AI intelligent extraction, and asynchronous post-processing. The entire system is divided into five core modules:

Section 04

Real-Time Gateway

The real-time gateway built with Fastify and WebSocket is the system's entry point. It receives front-end audio streams from browsers and forwards them to Deepgram for speech recognition. The gateway manages session lifecycles, distinguishes between partial and final transcriptions, and performs PII (Personally Identifiable Information) desensitization before sending content to AI.

Section 05

Intelligent Agent Core

This is the system's "brain", built on the Vercel AI SDK. The agent receives desensitized transcribed text, extracts structured clinical findings through typed tool calls and Zod validation. For example, when a doctor says "The patient's lower right second molar needs root canal treatment", the agent identifies the tooth position (#31), diagnosis (needs root canal treatment), and updates the corresponding data structure.

Section 06

Web Frontend

The clinical terminal interface built with React + Vite provides real-time feedback to doctors. The interface includes:

Waveform Visualization: Displays microphone activity status
Transcription Area: Shows partial and final transcribed text
Treatment Chart: Animates updates to tooth status
Tracking View: Displays the agent's thinking process, tool calls, and completion events

Section 07

Normalization Layer (Ingestion)

Responsible for converting the raw structured data extracted by the agent into a record format suitable for persistence, including deduplication logic (merging multiple mentions of the same tooth) and source tracing (recording the voice segment corresponding to each finding).

Section 08

Asynchronous Worker

A post-processing module based on AWS Lambda. When a session ends, the gateway sends session data (desensitized transcriptions, structured findings, tracking records, performance metrics) to an SQS queue, triggering the worker to generate post-treatment PDF guidelines, simulate insurance pre-authorization, and write the complete record to PostgreSQL.

Continue Reading

Keep going with more reads from the same topic.

Nornir MCP Server: An Enterprise-Grade Bridge for Integrating Large Language Models into Network Automation

Nornir MCP Server is an enterprise-level server based on the Model Context Protocol (MCP). It seamlessly integrates large language models (such as Claude) with the Nornir network automation framework, supporting natural language orchestration for multi-vendor network devices (Cisco, Arista, Juniper, etc.), and providing production-grade features like a dual-engine architecture (NAPALM + Netmiko), intelligent filtering, and a secure sandbox.

Recent activity 2026-05-06 20:51

Bibliothèque Française LLM: A French Public Domain Literature Index System Optimized for Large Language Models

Bibliothèque Française LLM is a structured indexing and annotation project for French public domain literature designed specifically for large language models (LLMs). It integrates multiple authoritative sources such as DraCor, Common Corpus, and Wikisource, providing metadata indexing categorized by genre, author, and era, as well as in-depth annotations for dramatic texts (including characters, lines, stage directions, etc.). Its aim is to enable LLMs to efficiently read and understand classic French literary works.

Recent activity 2026-05-06 20:50

Splinter: A Lock-Free Zero-Copy Shared Memory KV and Vector Storage Library That Eliminates Socket and Memcpy Overhead for LLM Inference

Splinter is a minimalist, high-performance key-value (KV) and vector storage system enabling zero-latency inter-process communication via shared memory and atomic operations. With only 766 lines of core code, it supports millions of operations per second and 768-dimensional vector storage, offering a new architectural approach for local LLM inference and data-intensive applications.

Recent activity 2026-04-03 08:49

libmlxforge: An Embedded MLX LLM Inference Engine for Apple Silicon

libmlxforge is an embeddable MLX large language model (LLM) inference engine designed specifically for Apple Silicon. It provides a unified C ABI interface, supports calls from Node.js, Swift, and Rust, and features continuous batching, streaming output, JSON-constrained structured output, and embedding vector generation.

Recent activity 2026-06-09 17:23