HERB+: Evolving an Industrial-Strength Privacy-Preserving Machine Learning Framework

Qianying Liao, Alexandre Cortez Santos, Bruno Cabral, João Paulo Fernandes, Nuno Lourenco

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Supervised machine learning does not hold without data. However, the needed data can be distributed in different locations and are non-shareable under privacy constraints. Methods to circumvent disclosure restrictions in collaborative machine learning are in strong demand. Thus, we propose HERB+ (Homomorphic Encryption for Random forest and gradient Boosting plus), a confidential learning framework for tree-based models under the scenario of vertically dispersed data. While previous related work focused on a specific algorithm, this work presents a wide variety of privacy-preserved and distributed tree-based algorithms (i.e., Decision Tree, Random Forest, and Gradient Boosting Decision Trees for both classification and regression tasks). HERB+ provides the most detailed and general discussions on using Fully Homomorphic Encryption for computing distributed tree-based algorithms during the training process. Our experiments show that although the learning protocols' efficiencies are not optimal, the predictive performance and privacy are preserved. The results imply that practitioners can overcome the barrier of data sharing and produce tree-based models for data-heavy domains with strict privacy requirements, such as Health Prediction, Fraud Detection, and Risk Evaluation.

Original languageEnglish (US)
Title of host publicationProceedings - 2022 IEEE 27th Pacific Rim International Symposium on Dependable Computing, PRDC 2022
PublisherIEEE Computer Society
Pages212-223
Number of pages12
ISBN (Electronic)9781665485555
DOIs
StatePublished - 2022
Event27th IEEE Pacific Rim International Symposium on Dependable Computing, PRDC 2022 - Virtual, Online, China
Duration: Nov 28 2022Dec 1 2022

Publication series

NameProceedings of IEEE Pacific Rim International Symposium on Dependable Computing, PRDC
Volume2022-November
ISSN (Print)1541-0110

Conference

Conference27th IEEE Pacific Rim International Symposium on Dependable Computing, PRDC 2022
Country/TerritoryChina
CityVirtual, Online
Period11/28/2212/1/22

Keywords

  • BFV
  • BGV
  • CART
  • CKKS
  • decision tree
  • fully homomorphic encryption
  • gradient boosting
  • privacy-preserving machine learning
  • random forest
  • vertical distribution

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'HERB+: Evolving an Industrial-Strength Privacy-Preserving Machine Learning Framework'. Together they form a unique fingerprint.

Cite this