CDTO Campus 2024: Shaping the Digital Future
Georgetown Universityjj1088@georgetown.edu
Thursday, 8 August 2024
💻 | 🌏 | |
---|---|---|
MS in Computer Science, Stanford University | → | PhD in International Relations, Columbia University |
Assistant Teaching Professor, Data Science and Analytics | → | Courtesy Teaching Professor, McCourt School of Public Policy |
🇵🇸 | 🇺🇦 |
---|---|
2015-2023: Taught Mobile App Development in West Bank+Gaza | So when I learned of дія, I thought some lessons/skills could apply! |
For \(N = 2\), simple mechanism for envy-free division:
We consider the well-studied cake cutting problem in which the goal is to find an envy-free allocation based on queries from \(n\) agents. The problem has received attention in computer science, mathematics, and economics. It has been a major open problem whether there exists a discrete bounded envy-free protocol. We resolve the problem with a discrete bounded envy-free protocol for any number of agents. The maximum number of queries required by the protocol is \(n^{n^{n^{n^{n^n}}}}\). (Aziz and Mackenzie 2016)
Jeff teaches Data Ethics at Georgetown University. Jeff teaches Data Ethics at Georgetown University. Jeff teaches Data Ethics at Georgetown University. Jeff teaches Data Ethics at Georgetown University. Jeff teaches Data Ethics a...
Джеф викладає етику даних у Джорджтаунському університеті. Джеф викладає етику даних у Джорджтаунському університеті. Джеф викладає етику даних у Джорджтаунському університеті. Джеф викладає етику дан...
\[ \begin{align*} \max_{c \mkern1.0mu \in \mkern1.0mu \text{Choices}} \; & f(c) = \text{Goodness of }c \\ \text{subject to } & g(c) = \text{Constraints on }c \end{align*} \]
\[ \begin{align*} \max_{\omega \mkern1.0mu \in \mkern1.0mu [0,1]^N} \; & f(\omega) = \textstyle\sum_{i=0}^{N}\omega_i u_i \\ \text{subject to } &\textstyle\sum_{i=0}^{N}\omega_i = 1 \end{align*} \]
We inject “noise” into the data by swapping records for certain households with those from households with similar characteristics in a nearby area. The Census does not release information about its specific methods for swapping. While this confidentiality around swapping techniques is important to protect against disclosure, it means that the practice is not transparent to data users
If traditional disclosure avoidance techniques were applied to the 2020 Census data, the amount of noise required to protect against new attacks would make census data unfit for most uses
\(\implies\) We need a framework for quantifying “acceptable” privacy loss (tradeoff!)
The answer: Differencing Attacks 😰
громада | True Population | Noise | Noisy Counts | Post-Processed | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
<18 | 18+ | Total | <18 | 18+ | Total | <18 | 18+ | Total | <18 | 18+ | Total | |
Зелена Поляна | 20 | 57 | 77 | 0 | -4 | +2 | 20 | 53 | (≠)79 | 23 (+3) | 54 (+1) | 77 (-2) |
Рихли | 30 | 82 | 112 | -3 | +2 | +3 | 27 | 84 | 115 | 28 (+1) | 85 (+1) | 113 (-2) |
Великий Ліс | 20 | 86 | 106 | -2 | +1 | +1 | 18 | 87 | 107 | 17 (-1) | 87 | 104 (-3) |
Розльоти | 150 | 155 | 305 | 0 | +2 | 0 | 150 | 157 | 305 | 150 | 156 (-1) | 306 (+1) |
Total | 220 | 380 | 600 | -5 | +1 | +6 | 215 | 381 | 606 | 218 | 382 | 600 |
\[ M(D) \triangleq \text{Lap}(x; \varepsilon) = \frac{1}{2\varepsilon}\exp\mkern-2.5mu\left[-\frac{|x|}{\varepsilon}\right] \]
\[ \Pr\left[M(D) \in S\right] \leq e^\varepsilon \times \Pr\mkern-2.5mu\left[M(D') \in S\right] \]
\[ \mathcal{L}(\gamma) = \ln\left[\frac{\Pr[M(D) = \gamma]}{\Pr[M (D') = \gamma]}\right] \]
Consider a real-valued function \(f\). The (worst-case, or global) sensitivity of \(f\) is the maximum absolute value by which the addition or deletion of a single database row can change the value of \(f\):
\[ \Delta f = \max_{D, D'}|f(D) - f(D')| \]
Queries of the form “How many people in the database are over six feet tall?” have sensitivity \(\Delta f = 1\), since the presence or absence of any individual in \(D\) can affect the true answer by at most 1. Thus, the Laplace mechanism returns the true count perturbed by a random draw from \(\text{Lap}(x; 1/\varepsilon)\). (Dwork 2014)
Shaping the Digital Future: Data Ethics