Highlights
PUEs turn data learnability into a certified quantity, giving data owners a formal tool for reasoning about whether protected data can still be used to train accurate models.
Abstract
We introduce the theory of certified learnability. Certified $(q,\eta)$-Learnability measures how learnable a dataset is by computing a probabilistic upper bound on the test performance of classifiers trained on this dataset, as long as those classifiers fall within a certified parameter set.
Using certified $(q,\eta)$-Learnability, we measure the effectiveness and robustness of unlearnable examples and propose Provably Unlearnable Examples (PUEs), which can reduce the learnability of protected training data.
Motivation
Challenge
Unlearnable examples are designed to prevent unauthorized model training, but their protection strength needs to be measured beyond empirical attack-and-defense benchmarks.
Solution
PUE connects data protection to certified learnability, giving a principled measurement for how learnable a protected dataset can be under certified training conditions.
BibTeX
@inproceedings{wang2025provably,
title={Provably Unlearnable Data Examples},
author={Wang, Derui and Xue, Minhui and Li, Bo and Camtepe, Seyit and Zhu, Liming},
booktitle={Network and Distributed System Security Symposium},
year={2025}
}