User profiles for Dominika Kov�r�kov�

Dominika Kovarikova

Univerzita Karlova
Verified email at korpus.cz
Cited by 1011

SYN2015: Representative corpus of contemporary written Czech

…, L Chlumsk�, T Jel�nek, D Kov�ř�kov�…�- Proceedings of the�…, 2016 - aclanthology.org
The paper concentrates on the design, composition and annotation of SYN2015, a new 100-million
representative corpus of contemporary written Czech. SYN2015 is a sequel of the …

The structuralist tradition meets empirical data: Corpus data enhancing the Czech Internet Language Reference Book

D Kov�ř�kov�, M Beneš, K Smejkalov�…�- Word�…, 2023 - euppublishing.com
This paper demonstrates how the corpus grammar tool GramatiKat can be used to improve
and refine morphological information in the Internet Language Reference Book (ILRB), which …

Automatic Identification of Academic Phrases for Czech

D Kov�ř�kov�, O Kov�ř�k�- …�Europhras 2019, Malaga, Spain, September 25�…, 2019 - Springer
The aim of this study is to automatically extract academic phrases in Czech using data-mining
techniques as a first step towards creating a dictionary of academic words and phrases …

Lexicographer's lacunas or how to deal with missing representative dictionary forms on the example of Czech

D Kov�ř�kov�, M Škrabal, V Cvrček…�- International Journal�…, 2020 - academic.oup.com
When compiling a list of headwords, every lexicographer comes across words with an
unattested representative dictionary form in the data. This study focuses on how to distinguish …

Sharing data through specialized corpus-based tools: The case of GramatiKat

D Kov�ř�kov��- Jazykovedn� časopis, 2021 - ceeol.com
This paper presents a specialized corpus tool GramatiKat in the context of Open Science
principles, namely data sharing, which offers opportunities for original research and facilitates …

[PDF][PDF] What belongs in a dictionary? The Example of Negation in Czech

D Kovarikova, L Chlumska, V Cvrcek�- Proceedings of the 15th Euralex�…, 2012 - euralex.org
In this paper, the authors try to answer the basic lexicographical question: how do we know
whether a particular word is a mere word form, or a new lexeme that should thus be assigned …

Machine Learning in Terminology Extraction from Czech and English Texts

D Kov�ř�kov��- Linguistic Frontiers, 2021 - sciendo.com
The method of automatic term recognition based on machine learning is focused primarily
on the most important quantitative term attributes. It is able to successfully identify terms and …

[PDF][PDF] THE DICTIONARY OF CZECH CORE ACADEMIC VOCABULARY

D Kov�ř�kov�, M Škrabal�- Dictionaries and Society, 2022 - ids-pub.bsz-bw.de
Over the past two decades, several lists of academic words and academic phrases emerged,
often focused on L2 teaching or undergraduate students of academic writing classes (…

[PDF][PDF] How to Rank Word Senses According to Frequency

D Kov�ř�kov�, V Cvrček�- EURALEX XIX, 2021 - novaresearch.unl.pt
Introduction: Language corpora are linked to dictionaries from the very beginning of their
existence. The extent of language material in corpora led to a great simplification and …

Možnosti a meze korpusov� lingvistiky

V Cvrček, D Kovař�kov��- Naše řeč (Our Speech), 2011 - infona.pl
This paper addresses two most common comments on corpus linguistics: 1) a corpus is
merely a card file index in electronic form and 2) corpus linguistics covers only corpora …