cmNIPS208

ID：32484

阅读量：0

大小：0.27 MB

页数：10页

时间：2023-01-29

金币：5

上传者：战必胜

Privacy-preserving logistic regression

Kamalika Chaudhuri

Information Theory and Applications

University of California, San Diego

kamalika@soe.ucsd.edu

Claire Monteleoni

∗

Center for Computational Learning Systems

Columbia University

cmontel@ccls.columbia.edu

Abstract

This paper addresses the important tradeoff between privacy and learnability,

when designing algorithms for learning from private databases. We focus on

privacy-preserving logistic regression. First we apply an idea of Dwork et al. [6]

to design a privacy-preserving logistic regression algorithm. This involves bound-

ing the sensitivity of regularized logistic regression, and perturbing the learned

classiﬁer with noise proportional to the sensitivity.

We then provide a privacy-preserving regularized logistic regression algorithm

based on a new privacy-preserving technique: solving a perturbed optimization

problem. We prove that our algorithm preserves privacy in the model due to [6].

We provide learning guarantees for both algorithms, which are tighter for our new

algorithm, in cases in which one would typically apply logistic regression. Ex-

periments demonstrate improved learning performance of our method, versus the

sensitivity method. Our privacy-preserving technique does not depend on the sen-

sitivity of the function, and extends easily to a class of convex loss functions. Our

work also reveals an interesting connection between regularization and privacy.

1 Introduction

Privacy-preserving machine learning is an emerging problem, due in part to the increased reliance on

the internet for day-to-day tasks such as banking, shopping, and social networking. Moreover, pri-

vate data such as medical and ﬁnancial records are increasingly being digitized, stored, and managed

by independent companies. In the literature on cryptography and information security, data privacy

deﬁnitions have been proposed, however designing machine learning algorithms that adhere to them

has not been well-explored. On the other hand, data-mining algorithms have been introduced that

aim to respect other notions of privacy that may be less formally justiﬁed.

Our goal is to bridge the gap between approaches in the cryptography and information security com-

munity, and those in the data-mining community. This is necessary, as there is a tradeoff between

the privacy of a protocol, and the learnability of functions that respect the protocol. In addition to

the speciﬁc contributions of our paper, we hope to encourage the machine learning community to

embrace the goals of privacy-preserving machine learning, as it is still a ﬂedgling endeavor.

In this work, we provide algorithms for learning in a privacy model introduced by Dwork et al. [6].

The -differential privacy model limits how much information an adversary can gain about a par-

ticular private value, by observing a function learned from a database containing that value, even if

she knows every other value in the database. An initial positive result [6] in this setting depends on

the sensitivity of the function to be learned, which is the maximum amount the function value can

change due to an arbitrary change in one input. Using this method requires bounding the sensitivity

of the function class to be learned, and then adding noise proportional to the sensitivity. This might

be difﬁcult for some functions that are important for machine learning.

∗

The majority of this work was done while at UC San Diego.

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 10



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

cmNIPS208

最近更新

大家都在看

相关文章

相关标签