Jan Hajič jr.
Main Research Interests
My research interests are in the field of Music Information Retrieval, perhaps better described as Computational Music Processing, and Digital Musicology. Besides my own research and reasearch leadership, I love supervising students on these theses!
Specifically, at UFAL, I work on Optical Music Recognition -- currently as the Principal Investigator for OmniOMR, a 5-year applied research project of the Czech Ministry of Culture, together with a team from the Moravian Library in Brno.
My second active topic is digital analysis of Gregorian Chant: especially questions of what is an appropriate music theory for chant and its eight modes, and whether the development of its melodies can be traced via methods from bioinformatics. Formaly, I am the principal investigator of the Genome of Melody project, funded by the John Templeton Foundation through the Cultural Evolution Society, but that one is housed at the Masaryk Institute and Archives of the Czech Academy of Sciences, of which I am also an employee. At UFAL, I pursue this axis of research through leading student theses, which I support through my role as the Co-Investigator and head of Chant Analytics team of the Digital Analysis of Chant Transmission project of the Canadian Social Sciences and Humanities Research Council.
The third topic that I am active in is music generation, mostly because of the huge interest in theses on this topic. I am especially interested in connecting music generation capabilities to specific worthwhile artistic goals, and in support of the artistic process and professional life of musicians -- not least because I play the harpsichord.
Older outputs:
My dissertation on OMR. Within my PhD, I published the MUSCIMA++ dataset.
Ribosomal RNA secondary structure prediction: PI of the GAUK project that created the rPredictor database, based on the ideas of Josef Pánek from the Institute of Microbiology of the Czech Academy of Sciences. (2014-2015) This experience with bioinformatics came in very handy: nine years later, it led to the Genome of Melody project!
Projects
Ongoing:
OmniOMR - optical music recognition using machine learning for digital libraries.
2023-2027, Principal Investigator
NAKI III programme grant no. DH23P08OVV008, supported by the Ministry of Culture of the Czech RepublicDigital Analysis of Chant Transmission
2023-2029. Co-Investigator, leader of the Chant Analytics team.
SSHRC Partnership Grant (Canada).Finished:
Multimodal Optical Music Recognition (GAUK 1444217), 2017 - 2019 (PI).
A summarizing technical report on the project (which basically re-uses text from my dissertation, whenever relevant, and adds things that were not part of my thesis but were done on the project) is here: GAUK1444217_techreport.pdfConvolutional Neural Networks for Optical Music Recognition (GAUK 170217), 2017 - 2018 (Co-investigator)
rRNA Secondary Structure Prediction (GAUK 550214), 2015 - 2016 (PI).
Teaching
Selected Bibliography
- Google Scholar
- ORCID: 0000-0002-9207-567X
- Scopus ID: 57193414196
- Researcher ID: P-4278-2017
Calvo-Zaragoza, J.; Hajič jr., J.; Pacha, A. Understanding Optical Music Recognition. ACM Computing Surveys 53:4, September 2020. (IF: 7.99)
Hajič jr., J.; Ballén, G.; Mühlová, K.H., Vlhová-Wörner, H. Towards building a phylogeny of Gregorian Chant melodies. Proceedings of the 24th International Society for Music Information Retrieval Conference, Milan, Italy, 2023, 571-578.
Lanz, V.; Hajič jr., J. Text boundaries do not provide a better segmentation of Gregorian antiphons. Proceedings of the 10th International Conference on Digital Libraries for Musicology (DLfM '23). Association for Computing Machinery, New York, NY, USA, 72–76.
Jelínek, J.; Hoksza, D.; Hajič jr. J.; Pešek, J.; Drozen, J.; Hladík, T.; Klimpera, M.; Vohradský, J.& Pánek, J. rPredictorDB: a predictive database of individual secondary structures of RNAs and their formatted plots. Database: Vol.2019, 2019, 1-9. (IF: 4.46)
Pacha, Alexander, Jorge Calvo-Zaragoza, and Jan Hajic Jr. Learning Notation Graph Construction for Full-Pipeline Optical Music Recognition. 20th International Society for Music Information Retrieval Conference, 2019, 75-82
Hajič jr., J.; Dorfer, M.; Widmer, G. & Pecina, P. Towards Full-Pipeline Handwritten OMR with Musical Symbol Detection by U-Nets. 19th International Society for Music Information Retrieval Conference, 2018, 225-232.
Pacha, A.; Hajič jr., J. & Calvo-Zaragoza, J. A Baseline for General Music Object Detection with Deep Learning. Applied Sciences, 2018, 8, 1488-1508. (IF: 1.61)
Dorfer, M.; Hajič jr., J.; Arzt, A.; Frostel, H. & Widmer, G. Learning Audio-Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification. Transactions of the International Society for Music Information Retrieval, 2018, 1, 22-33.
Hajič jr., J.; Kolárová, M., Pacha, A., & Calvo-Zaragoza, J. How current optical music recognition systems are becoming useful for digital libraries. In Proceedings of the 5th International Conference on Digital Libraries for Musicology, 2018, 57-61.
Dorfer, M., Hajič jr., J. & Widmer, G. Attention as a Perspective for Learning Tempo-invariant Audio Queries. The 2018 Joint Workshop on Machine Learning for Music, Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden, PMLR 80, 2018. *
Matthias Dorfer, Jan Hajič Jr, Andreas Arzt, Harald Frostel, Gerhard Widmer: Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification. Transactions of the International Society for Music Information Retrieval, 1 (1), 2018. [html]
Jan Hajič jr., Pavel Pecina: The MUSCIMA++ Dataset for Handwritten Optical Music Recognition. Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, Osaka Prefecture University, Kyoto, Japan, November 2017. [pdf]
Hajič jr., J.; Novotný, J.; Pecina, P. & Pokorný, J.: Further Steps towards a Standard Testbed for Optical Music Recognition. Proceedings of the 17th International Society for Music Information Retrieval Conference, New York University, 2016, 157-163 [pdf]
(For a full overview, see Google Scholar.)
Students
I am open to topics especially concerning music technology, song lyrics and poetry (and potentially any machine learning and/or NLP topic).
Here is a nice guide about how to write a thesis, by Martin Koutecký from IUUK.
Aside from that, here is a different guide about how to read research papers (and possibly how and why to write them),
Current students:
Adam Štefunko (PhD): Generating harmonic accompaniments (in style!)
Reut Tal (Bc.): A game interface for collecting natural human judgments on emotional content of music.
Šimon Libřický (Bc.): Implemented a plugin factory for MuseScore.
Jan Borecký (Mgr.): Using sound and music for orienting visually impaired players in open-world games.
Defended theses:
(Feel free to ask those people for references!)
Anna Dvořáková (Bc.): Mapping tools and network analysis for transmission of Gregorian Chant.
Emre Rasimgil (Bc.): Low-resource "infinite radio" music generation.
Patrik Backo (Bc.): Generating audio of novel drum samples.
Vojtěch Lanz defended his Mgr. thesis that experimented with nonparametric Bayesian methods to find meaningful segmentations of Gregorian chant melodies. His work was published at the Digital Libraries for Musicology conference in 2023!
Kristina Szabová defended her Bc. thesis that created software for the analysis of Gregorian chant, i.a. borrowing Multiple Sequence Alignment from bioinformatics in 2021. Her software is live at chantlab.mua.cas.cz.
Jiří Balhar defended his Bc. thesis on melody extraction from recordings of polyphonic music. Won the Dean's Award in the Bc. thesis category, published the work at a satellite event of the ISMIR 2019 conference (Late-Breaking Demo), achieved state-of-the-art results.
Marek Židek defended his Mgr. thesis on generating music a bit more cleverly using musical form derived in an unsupervised manner. Previously defended Bc. thesis on generating music with LSTMs. Both theses include significant effort in evaluation. A composition generated by his models was performed at the Microsoft DOTS conference in 2017.
Jan Výkruta (defended Bc. thesis on automatically expressively reading poetry)
Vladan Glončák (defended Mgr. thesis on exploiting syntax to identify specifically what people have an opinion about in evaluative text such as restaurant reviews.
(Story in progress.)
In 2023, I completed a Masters in harpsichord performance, and came fully back to Academia with two successful project applications: for applied research in Optical Music Recognition, and for basic research on cultural evolution of Gregorian Chant (specifically, its melodies).
In 2020-2022, I worked as a digital musicology postdoc for the project Old Myths, New Facts at the Masaryk Institute and Archives of the Czech Academy of Sciences. In the meantime, I was studying the harpsichord - finishing a Bachelors in combined performance and musicology in 2021, moving on to a Masters in performance.
In 2019, I finished PhD studies at ÚFAL, writing my thesis under RNDr. Pavel Pecina on the topic of Optical Music Recognition. I am generally interested in music informatics: if you are a student and have interest in music, especially machine learning for musical applications, I will be happy to hear about it! For instance, we did some music generation (interview in Czech).
My Mgr. thesis, also under RNDr. Pecina, was on the topic of automatically selecting images for news articles. This work was also done for the CEMI project. My thesis is available here: Matching Images to Texts
I have previously worked on the SEANCE project on Sentiment Analysis, with my Bc. thesis and in the following years.