TPDP 2024 Talk Notes

This page provides notes for my TPDP 2024 talk called "Data and Privacy in Data Privacy".

Training Data Attribution

Short Summary

Differential privacy, membership inference, and training data attribution are all research areas that care a lot about "counterfactual worlds" where certain examples are included or excluded. Right now, there is not much overlap between these areas (both in the people working on them and the techniques), and my claim here is that there should be.

Reading List

Open Questions/Directions

Data Curation

Short Summary

Curating data has become very important to training state of the art models. However, there is limited investigation of the implications and opportunities of data curation for privacy.

Reading List

Open Questions/Directions

Privacy Semantics

Short Summary

The ML privacy literature has begun to consider different "privacy semantics". Often dealing more with access control-type approaches rather than differential privacy, the DP community's experience thinking about privacy may be helpful here.

Reading List

Open Questions/Directions