MasKIT
Pseudonymization of Czech legal texts
Introduction
MasKIT is an on-line tool and REST API service for anonymization (pseudonymization) of Czech legal texts. Taking a plain text as input (e.g., a letter sent by a legal authority to a citizen), it runs external services for dependency parsing and named entity recognition and then identifies and replaces personal information in the text.
The software is available under the Creative Commons CC BY-NC-SA licence.
Copyright 2023-2024 by Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Czech Republic.
Description of the available methods is available in the API Documentation.
Online Web Application and Web Service
MasKIT Web Application is available at http://quest.ms.mff.cuni.cz/maskit/.
MasKIT REST Web Service is also available, with the API documentation available at http://quest.ms.mff.cuni.cz/maskit/api-reference.php.
MasKIT User's Manual
MasKIT User's Manual is available on a separate page.
MasKIT API Reference
MasKIT API Reference is available on a separate page.
Contact
Authors:
Acknowledgements
The development of MasKIT was financed by the TAČR SIGMA project TQ01000526: PONK - Asistent přístupné úřední komunikace.
MasKIT uses external services for its work:
- UDPipe (https://lindat.mff.cuni.cz/services/udpipe/)
- NameTag (http://lindat.mff.cuni.cz/services/nametag/)
This work has been using language resources developed, stored or distributed by the LINDAT/CLARIAH-CZ project of the Ministry of Education of the Czech Republic (project LM2023062).