Abstract
The debate on the use of personal data in language resources usually focuses — and rightfully so — on anonymisation. However, this very same debate usually ends quickly with the conclusion that proper anonymisation would necessarily cause loss of linguistically valuable information. This paper discusses an alternative approach — pseudonymisation. While pseudonymisation does not solve all the problems (inasmuch as pseudonymised data are still to be regarded as personal data and therefore their processing should still comply with the GDPR principles), it does provide a significant relief, especially — but not only — for those who process personal data for research purposes. This paper describes pseudonymisation as a measure to safeguard rights and interests of data subjects under the GDPR (with a special focus on the right to be informed). It also provides a concrete example of pseudonymisation carried out within a research project at the Institute of Information Technology and Communications of the Otto von Guericke University Magdeburg.- Anthology ID:
- 2022.legal-1.4
- Volume:
- Proceedings of the Workshop on Ethical and Legal Issues in Human Language Technologies and Multilingual De-Identification of Sensitive Data In Language Resources within the 13th Language Resources and Evaluation Conference
- Month:
- June
- Year:
- 2022
- Address:
- Marseille, France
- Editors:
- Ingo Siegert, Mickael Rigault, Victoria Arranz
- Venue:
- LEGAL
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 17–21
- Language:
- URL:
- https://aclanthology.org/2022.legal-1.4
- DOI:
- Cite (ACL):
- Pawel Kamocki and Ingo Siegert. 2022. Pseudonymisation of Speech Data as an Alternative Approach to GDPR Compliance. In Proceedings of the Workshop on Ethical and Legal Issues in Human Language Technologies and Multilingual De-Identification of Sensitive Data In Language Resources within the 13th Language Resources and Evaluation Conference, pages 17–21, Marseille, France. European Language Resources Association.
- Cite (Informal):
- Pseudonymisation of Speech Data as an Alternative Approach to GDPR Compliance (Kamocki & Siegert, LEGAL 2022)
- PDF:
- https://preview.aclanthology.org/naacl24-info/2022.legal-1.4.pdf