Most isolated letters (e.g. A-konto) are handled as abbreviations. Only if they do not form part of a name they are lemmatized as _^(označení_pomocí_písmene)
: zápas skupiny B.
The following is a prototype of lemmas, their numbers and AddInfos for an isolated letter. There should be such lemmas for all letters of the Czech alphabet. Note that numbering a lemma by zero is not used anywhere else and might be deprecated in future. Anyway, no program should ever rely that the numbers will be as indicated. Lemma numbers serve to distinguish between homonymous lemmas but they are not meant to bear any semantic information.
K-0_:B_;Y
- given namesK-4_:B_;K
- names of institutionsK-5_:B_;G
- geographical namesK-6_:B_;R
- names of productsK-7_:B_;m
- other names (sporting events etc.)K-9_:B_;S
- surnamesk-8_:B_^(ost._zkratka)
- other abbreviations (not names) - should not be used if the annotator knows the abbreviated word - then the word_:B
lemma should be used insteadk-3_^(označení_pomocí_písmene)
- other isolated letters (not abbreviations, not in names)Table 4.3. Examples of isolated letters
Expression |
Annotation of the letter |
---|---|
A-mužstvo |
|
§ 27 odst. 1 písm. d |
|
16 A |
|
A-konto |
|
ABC, a.s. |
|
na s. 128 |
|