Week 12: Final Projects, Causality and Racecraft
DSAN 5450: Data Ethics and Policy
Spring 2024, Georgetown University
Making Projects Less Scary
How Do I Pick A Topic?
- I know that “whatever is interesting to you” can be way overly-vague!
- So, one approach is: imagine yourself in a job interview for your dream job, and they bring up DSAN 5450: “Interesting, what did you do in that class?”
- [Insert final project elevator pitch] “Wow, that’s such a cool project, we really want someone who can [say] take a data-driven approach to a policy question like that. You’re hired!”
- (Jeff gets a commission: 10% of your salary)
The Rating System
Discussing Fairness
Below Expectations |
"This algorithm is unfair" |
Meets Expectations |
"This algorithm violates the Predictive Parity criterion of fairness when run on this dataset" |
Exceeds Expectations |
"This algorithm violates the Predictive Parity criterion when run on this dataset, but that’s because [other mitigating factor]. It still satisfies Within-\(\varepsilon\) Predictive Parity, for \(\varepsilon = 0.02\)" |
Doing Too Much |
All of the above, plus I developed a new better algorithm that is more fair |
Evaluating Policy
Below Expectations |
"This policy is bad" |
Meets Expectations |
"This policy is bad bc it's biased towards [group 1], and doesn't take sufficient account of the welfare of [group 2]" |
Exceeds Expectations |
"This policy is bad bc it's biased towards [group 1], and doesn't take sufficient account of the welfare of [group 2], which violates the Rawlsian notion of what would be chosen by rational agents behind a 'veil of ignorance'" |
Doing Too Much |
"This policy is bad bc the inferred welfare weights \(\omega_i\) are \(0.1632\) off from the optimal welfare weights \(\omega_i^*\)" |
Frequently Awesome Questions
- How much data-processing/R/Python is required? None!
- You can think of it like: In every other DSAN class, we plop a dataset in front of you and say “do the thing”
- Here you can (and should) first ask, “is [data sci technique] an effective way to address this issue?”
- Example: archive of missing datasets
- Sometimes the problem is not “We haven’t trained a neural net on this yet!” but instead “nobody with power cares enough about this to make a dataset in the first place [or they purposefully prevent it]”
Getting From Here to There
- Minimum Viable Product (MVP)
- \(\leadsto\) Final Product (but… Hofstadter’s Law)
Causality and Identity Formation
- Race as a Noun vs. Race as a Verb (“Racecraft”)
- Race as a static property vs. race as a social practice
But First… Phenomenology
- “Objective” account: Roquentin sits down on a bus seat; “Subjective” account:
I lean my hand on the seat but pull it back hurriedly: it exists. This thing I’m sitting on, leaning my hand on, is called a seat. They made it purposely for people to sit on, they took leather, springs and cloth, they went to work with the idea of making a seat and when they finished, that was what they had made. They carried it here, into this car and the car is now rolling and jolting with its rattling windows, carrying this red thing in its bosom. I murmur: “It’s a seat,” a little like an exorcism. But the word stays on my lips: it refuses to go and put itself on the thing. It stays what it is, with its red plush, thousands of little red paws in the air, all still, little dead paws…
W.E.B. Du Bois and the Epistemological One-Way Mirror
Black people in America are […] born with a veil […] in this American world—a world which yields him no true self-consciousness, but only lets him see himself through the revelation of the other world. It is a peculiar sensation, this double-consciousness, this sense of always looking at oneself through the eyes of others, of measuring one’s soul by the tape of a world that looks on in amused contempt and pity. One ever feels his two-ness—an American, a Negro; two souls, two thoughts, two unreconciled strivings; two warring ideals in one dark body. (Du Bois 1903)
- The veil: the world is seen and experienced differently on either side of the color line
- One-way mirror: Whites project their constructions of Blacks onto the veil and see their projections reflected on it \(\Rightarrow\) the power to define themselves and others
- The projections of whites onto the veil become realities that Black subjects have to contend with in their self-formation.
- Twoness: in process of self-formation, the racialized subject must account for the views of two different social worlds—the Black world, constructed behind the veil, and the white world, which dehumanizes via lack of recognition of their humanity.
\(\textsf{Race}_{\textsf{Variable}}\) vs. \(\textsf{Race}_{\textsf{Construct}}\)
- Careful scientific, causal studies measure the effect that changing \(X\) (\(do(X)\)) has on \(Y\), controlling for \(C\) (via, at least under the hood, “Do-Calculus”)
- But, even the most careful, controlled (and thus informative!) experiments must, at some level, partition variables into “race” and “not race”
- Keep in back of your mind as we look at example of how (measured by thorough, statistically-principled experiment), race can have direct, measurable, causal impacts on important aspects of our everyday lives
Racial Discrimination
- Marianne Bertrand and Sendhil Mullainathan. 2004. “Are Emily and Greg More Employable Than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination.” American Economic Review. (Bertrand and Mullainathan 2004)
We study race in the labor market by sending fictitious resumes to help-wanted ads in Boston and Chicago newspapers. To manipulate perceived race, resumes are randomly assigned African-American- or White-sounding names. White names receive 50 percent more callbacks for interviews. Callbacks are also more responsive to resume quality for White names than for African-American ones. The racial gap is uniform across occupation, industry, and employer size. We also find little evidence that employers are inferring social class from the names. Differential treatment by race still appears to still be prominent in the U.S. labor market.
“Controlling for” Everything Besides Race
Age Discrimination?
- Based on Lily Hu, What is ‘Race’ in Algorithmic Discrimination on the Basis of Race? - IPAM at UCLA (YouTube)
“Cool Theory, I Guess…”
- “Good luck measuring ideas inside of people’s heads… I’ll be over here measuring real things and doing real data science!” -My Opps
“Cool Theory, I Guess…”
Opening A Big Can Of Worms
- Social interactions among \(t^e_0\), \(t^e_1\), \(t^e_2\)…
Opening A Big Can Of Worms
- Social interactions among \(t^e_0\), \(t^e_1\), \(t^e_2\)…
- Mediated by external things \(o^e_3\) to \(o^e_8\) (giving rise to patterns of interaction)…
Opening A Big Can Of Worms
- Social interactions among \(t^e_0\), \(t^e_1\), \(t^e_2\)…
- Mediated by external things \(o^e_3\) to \(o^e_8\) (giving rise to patterns of interaction)…
- Each person \(x\) forming their own internal representations \(\widetilde{t^x_0}\), \(\widetilde{t^x_1}\), \(\widetilde{t^x_2}\) of one another based on patterns of interaction, then
- Generalizing to an internal representation of a “type of person” \(\widetilde{t^x_9}\)…
Opening A Big Can Of Worms
- Social interactions among \(t^e_0\), \(t^e_1\), \(t^e_2\)
- Mediated by external things \(o^e_3\) to \(o^e_8\) (giving rise to patterns of interaction)
- Each person \(x\) forming their own internal representations \(\widetilde{t^x_0}\), \(\widetilde{t^x_1}\), \(\widetilde{t^x_2}\) based on patterns of interaction, then
- Generalizing to an internal representation of a “type of person” \(\widetilde{t^x_9}\)
- Which they then externalize as \(t^x_9\).
- \(t^0_9\), \(t^1_9\), \(t^2_9\) “congeal” into a shared external representation \(t_9^e\) via social mechanism (discussion, media, culture, propaganda, parenting, religion, education, …) \(\Rightarrow t^e_9\) “reified” (causal effects on \(t_0\), \(t_1\), \(t_2\))