Federica Gamba
Projects
PhD topic: Dealing with Latin variability in parsing (supervisor: Daniel Zeman)
GAUK (104924, 2024-2026): Adapting Uniform Meaning Representation (UMR) for the Italic/Romance languages.
Curriculum Vitae
- since March 2022: PhD student in Computational Linguistics at ÚFAL MFF UK.
- 2021-2022: Research assistant at the Institute for Computational Linguistics (ILC-CNR), Pisa, Italy.
- 2018-2021: Graduate degree in Humanities at IUSS Pavia (University School for Advanced Studies), Italy. Final thesis: 'More data and new tools. Advances in parsing the Index Thomisticus Treebank'.
- 2018-2020: Master's degree in Theoretical and Applied Linguistics at University of Pavia, Italy. Final thesis: 'Including a new textual resource into the LiLa Knowledge Base. Lemmatization, PoS tagging and linking of Querolus'.
- 2015-2019: Undergraduate degree in Humanities at IUSS Pavia (University School for Advanced Studies), Italy.
- 2015-2018: Bachelor's degree in Classics at University of Pavia, Italy.
Selected Bibliography
- Google Scholar
- ORCID: 0000-0003-3632-0594
- Scopus ID: 58024445100
- Researcher ID: HPE-7554-2023
- Predicate Sense Disambiguation for UMR Annotation of Latin: Challenges and Insights. In: Proceedings of the 1st Workshop on Machine Learning for Ancient Languages, pp. 19-29, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-144-5 (url, bibtex)
- Publishing the Dictionary of Medieval Latin in the Czech Lands as Linked Data in the LiLa Knowledge Base. In: Italian Journal of Computational Linguistics, ISSN 2499-4553, vol. 10, no. 1, pp. 95-116 (url, bibtex)
- Universal Feature-based Morphological Trees. In: Proceedings of the LREC-COLING 2024 Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), pp. 125-137, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-20-3 (pdf, local PDF, bibtex)
- Towards a Conversion of the Prague Dependency Treebank Data to the Uniform Meaning Representation. In: Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024), pp. 62-76, CEUR-WS.org, Košice, Slovakia (url, local PDF, bibtex)
- ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic Analysis of Latin. In: Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, pp. 207-214, ELRA and ICCL, Torino, Italia, ISBN 978-2-493814-46-3 (pdf, local PDF, bibtex)
- Linking the Dictionary of Medieval Latin in the Czech Lands to the LiLa Knowledge Base. In: Proceedings of the Ninth Italian Conference on Computational Linguistics, pp. 1-8, CEUR Workshop Proceedings, Venice, Italy (pdf, bibtex)
- Latin Morphology through the Centuries: Ensuring Consistency for Better Language Processing. In: Proceedings of the Ancient Language Processing Workshop, pp. 59-67, INCOMA, Varna, Bulgaria, ISBN 978-954-452-087-8 (pdf, local PDF, local PDF, bibtex)
- Universalising Latin Universal Dependencies: a harmonisation of Latin treebanks in UD. In: Proceedings of the Sixth Workshop on Universal Dependencies (UDW, GURT/SyntaxFest 2023), pp. 7-16, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-34-0 (pdf, local PDF, local PDF, bibtex)
- Language Technologies for the Creation of Multilingual Terminologies. Lessons Learned from the SSHOC Project. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 154-163, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, bibtex)
- D3.9 Report on Ontology and Vocabulary Collection and Publication (technical report). In: (url, bibtex)
- More Data and New Tools. Advances in Parsing the Index Thomisticus Treebank. In: Proceedings of the Conference on Computational Humanities Research 2021, pp. 108-122, CEUR Workshop Proceedings (CEUR-WS.org) (pdf, bibtex)