Research
Papers
Open research from the KlusAI privacy program. Each paper ships a working artifact — a benchmark, dataset, or model — not just a writeup.
KlusAI Technical Report
EuroPriv-BenchPan-European de-identification benchmark · 7 languages · re-identification-risk metric
EuroPriv-Bench: A Unified Pan-European De-identification Benchmark with Re-identification Risk Metrics
Detection F1 doesn't predict privacy on decode-bearing national identifiers: the weakest PII detector leaks the fewest Romanian national IDs (1.4%), while the strongest leak 26–35%. Aggregate F1 stays high while a model misses the rare, high-stakes tokens that carry the re-identification — national IDs are the clearest provable case. A unified, openly-licensed pan-European de-identification benchmark that scores re-identification risk — not just detection F1.