Reading

Federated Learning vs. Centralized Learning: A Complete Benchmark Suite

Using the Flower.ai framework to compare the performance of three mainstream machine learning models under federated learning and centralized learning, supporting round-by-round accuracy tracking and CSV export.

联邦学习Flower.ai机器学习隐私计算基准测试Random ForestXGBoost神经网络

Published 2026-05-19 22:43Recent activity 2026-05-19 22:48Estimated read 6 min

Federated Learning vs. Centralized Learning: A Complete Benchmark Suite

Section 01

Guide to the Federated Learning vs. Centralized Learning Benchmark Suite

This article introduces an open-source benchmark project based on the Flower.ai framework, which systematically compares the performance differences between federated learning (FL) and traditional centralized learning across three mainstream models: Random Forest, Neural Network (MLP), and XGBoost. The project supports dual-mode training comparison, Non-IID data simulation, round-by-round accuracy tracking, and CSV result export, aiming to help researchers and engineers understand the performance differences between FL and centralized learning and provide data support for technical decisions.

Section 02

Project Background and Motivation

Traditional centralized learning requires collecting all data on a server for training, which is efficient but faces data privacy challenges (e.g., sensitive data in healthcare and finance cannot leave local devices). Federated learning allows multi-party collaborative training without sharing raw data, but can FL achieve the performance of centralized training? How do different models perform in an FL environment? These questions gave birth to this benchmark project.

Section 03

Project Overview and Core Features

The project is built on the Flower.ai framework and supports three models: Random Forest, Neural Network (MLP), and XGBoost. Core features include:

Dual-mode comparison: Supports both centralized and federated training for direct comparison;
Non-IID data: Simulates real-world scenarios where each client has a different data category distribution;
Round-by-round accuracy tracking: Records model performance after each round of FL communication;
Structured export: Automatically generates comparison reports in CSV format.

Section 04

Technical Implementation Details

Data Distribution: Non-IID partitioning is used by default, with 3 clients each having a different category distribution; FL Process: Each round includes client local training → local evaluation → server metric aggregation → statistical calculation → next round; Aggregation Strategy: Neural networks use the FedAvg algorithm to aggregate parameters; tree models (Random Forest, XGBoost) aggregate prediction results after independent training; Configurable Parameters: Adjust the number of clients (default 3, max ~6), number of communication rounds (default 10), number of local training rounds, task type (classification/regression), etc., via config/__init__.py.

Section 05

Experimental Results and Analysis

The project was tested on the Iris dataset, with results as follows: Centralized Learning Results

Model	Accuracy
Random Forest	90.0%
Neural Network	76.7%
XGBoost	93.3%

Federated Learning Results (After 10 Rounds)

Model	Average Accuracy	Minimum Accuracy	Maximum Accuracy
Random Forest	92.2%	86.7%	96.7%
Neural Network	84.4%	80.0%	90.0%
XGBoost	88.9%	80.0%	96.7%

Analysis: 1. Random Forest performs slightly better in FL than in centralized mode (due to ensemble learning effects); 2. Neural Network performance improves from 76.7% to 84.4% in FL, but Non-IID data has a significant impact; 3. XGBoost maintains high performance in both modes.

Section 06

Application Value and Future Directions

Application Value:

Technical selection reference: Helps teams choose suitable models for FL environments;
Performance expectation setting: Understand the possible range of accuracy loss in FL;
Rapid prototype verification: Quickly verify the feasibility of FL on specific datasets using existing code.

Future Directions:

Support more UCI datasets;
Improve FL aggregation strategies for tree models;
Introduce advanced aggregation algorithms like FedProx and FedOptimize;
Add differential privacy support;
Optimize client sampling and communication efficiency.

Continue Reading

Keep going with more reads from the same topic.

SignalCut: An Intelligent Tool for Turning AI Search Visibility Gaps into Video Marketing Campaigns

SignalCut is an innovative web application that analyzes brands' visibility gaps in AI search, automatically generates evidence-based marketing strategies, and creates Hera video materials, helping early-stage brands gain a competitive edge in the AI answer engine era.

Recent activity 2026-04-26 11:27

AWS Open-Sources AI Search Citation Analysis System: Track Brand Exposure in AI Search Engines

An open-source project officially released by AWS, built on Amazon Bedrock, Step Functions, and React to form a complete serverless citation analysis system. It helps enterprises monitor their brand's citation status and competitive landscape in AI searches like ChatGPT, Perplexity, Gemini, and Claude.

Recent activity 2026-03-31 20:49

Next.js Application SEO and GEO Integrated Optimization Solution: Comprehensive Visibility from Search Engines to AI Assistants

This article delves into the stevewerme/seo-geo-nextjs project, an open-source tool designed specifically for Next.js applications to simultaneously optimize traditional search engine rankings (SEO) and generative engine visibility (GEO). It analyzes the project's core architecture, implementation mechanisms, practical application scenarios, and its strategic significance for developers and content creators.

Recent activity 2026-04-03 14:48

Baiyuan GEO Platform Technical White Paper: SaaS Engineering Practice for Generative Engine Optimization (GEO)

This article deeply analyzes the GEO Platform technical white paper developed by Baiyuan Technology, covering the seven-dimensional AI citation rate scoring algorithm, AXP shadow document delivery mechanism, Schema.org three-layer entity knowledge graph, and the hallucination automatic detection and repair closed-loop system, providing an engineering solution for brands to gain visibility in generative AI such as ChatGPT and Claude.

Recent activity 2026-04-18 22:54