Korektor is a statistical spellchecker and (occasional) grammar checker released under 2-Clause BSD license and versioned using Semantic Versioning.
Korektor started with Michal Richter's diploma thesis Advanced Czech Spellchecker, but it is being developed further. There are two versions: a command line utility (tested on Linux, Windows and OS X) and a REST service with publicly available API and HTML front end.
The original OS X SpellServer providing System Service integrating Korektor with native OS X GUI applications is no longer developed, but do not hesitate to contact us if you are interested in it.
Copyright 2015 by Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic.
Korektor Spellchecker is a browser plugin which allows using the Korektor Spellchecker for most editable input fields. The plugin allows either directly correcting the content, or showing a dialog with suggested corrections.
Note that the dialog with suggestions is injected directly into the original page, so there can be various problems on untested sites. However, the plugin seem to work fine on many sites.
Although the sources of Korektor service and this plugin are available under BSD-3-Clause license, please respect the CC BY-SA-NC licence of the spellchecking models.
The plugin is available for the following browers:
Korektor Web Application is available at https://lindat.mff.cuni.cz/services/korektor/ using LINDAT/CLARIN infrastructure.
Korektor REST Web Service is also available, with the API documentation available at https://lindat.mff.cuni.cz/services/korektor/api-reference.php.
Korektor releases are available on GitHub, either as a pre-compiled binary package, or source code packages only.
To use Korektor, a spellchecker model is needed. The language models are available from LINDAT/CLARIN infrastructure and described further in the Korektor User's Manual. Currently the following language models are available:
The original Michal Richter's version can be downloaded here.
Korektor is an open-source project and is freely available for non-commercial purposes. The library is distributed under 2-Clause BSD license and the associated models and data under CC BY-NC-SA, although for some models the original data used to create the model may impose additional licensing conditions.
If you use this tool for scientific work, please give credit to us by referencing Korektor website and Richter et al. 2012.
Korektor Installation on separate page.
Korektor User's Manual on separate page.
Korektor Model Creation on separate page.
Current Authors:
Original Author:
This work has been using language resources developed and/or stored and/or distributed by the LINDAT/CLARIN project of the Ministry of Education of the Czech Republic (project LM2010013).
Acknowledgements for individual language models are listed in Korektor User's Manual page.
@InProceedings{richter12, booktitle = {Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012)}, title = {Korektor--A System for Contextual Spell-checking and Diacritics Completion}, editor = {Martin Kay and Christian Boitet}, author = {Michal Richter and Pavel Stra{\v{n}}{\'{a}}k and Alexandr Rosen}, year = {2012}, publisher = {Coling 2012 Organizing Committee}, organization = {{IIT} Bombay}, address = {Mumbai, India}, venue = {{IIT} Bombay, {VMCC}}, pages = {1--12} }
If you prefer to reference Korektor by a persistent identifier (PID),
you can use http://hdl.handle.net/11234/1-1469
.