User profiles for Dominika Kov�r�kov�
Dominika KovarikovaUniverzita Karlova Verified email at korpus.cz Cited by 1011 |
SYN2015: Representative corpus of contemporary written Czech
…, L Chlumsk�, T Jel�nek, D Kov�ř�kov�…�- Proceedings of the�…, 2016 - aclanthology.org
The paper concentrates on the design, composition and annotation of SYN2015, a new 100-million
representative corpus of contemporary written Czech. SYN2015 is a sequel of the …
representative corpus of contemporary written Czech. SYN2015 is a sequel of the …
The structuralist tradition meets empirical data: Corpus data enhancing the Czech Internet Language Reference Book
This paper demonstrates how the corpus grammar tool GramatiKat can be used to improve
and refine morphological information in the Internet Language Reference Book (ILRB), which …
and refine morphological information in the Internet Language Reference Book (ILRB), which …
Automatic Identification of Academic Phrases for Czech
D Kov�ř�kov�, O Kov�ř�k�- …�Europhras 2019, Malaga, Spain, September 25�…, 2019 - Springer
The aim of this study is to automatically extract academic phrases in Czech using data-mining
techniques as a first step towards creating a dictionary of academic words and phrases …
techniques as a first step towards creating a dictionary of academic words and phrases …
Lexicographer's lacunas or how to deal with missing representative dictionary forms on the example of Czech
D Kov�ř�kov�, M Škrabal, V Cvrček…�- International Journal�…, 2020 - academic.oup.com
When compiling a list of headwords, every lexicographer comes across words with an
unattested representative dictionary form in the data. This study focuses on how to distinguish …
unattested representative dictionary form in the data. This study focuses on how to distinguish …
Sharing data through specialized corpus-based tools: The case of GramatiKat
D Kov�ř�kov��- Jazykovedn� časopis, 2021 - ceeol.com
This paper presents a specialized corpus tool GramatiKat in the context of Open Science
principles, namely data sharing, which offers opportunities for original research and facilitates …
principles, namely data sharing, which offers opportunities for original research and facilitates …
[PDF][PDF] What belongs in a dictionary? The Example of Negation in Czech
D Kovarikova, L Chlumska, V Cvrcek�- Proceedings of the 15th Euralex�…, 2012 - euralex.org
In this paper, the authors try to answer the basic lexicographical question: how do we know
whether a particular word is a mere word form, or a new lexeme that should thus be assigned …
whether a particular word is a mere word form, or a new lexeme that should thus be assigned …
Machine Learning in Terminology Extraction from Czech and English Texts
D Kov�ř�kov��- Linguistic Frontiers, 2021 - sciendo.com
The method of automatic term recognition based on machine learning is focused primarily
on the most important quantitative term attributes. It is able to successfully identify terms and …
on the most important quantitative term attributes. It is able to successfully identify terms and …
[PDF][PDF] THE DICTIONARY OF CZECH CORE ACADEMIC VOCABULARY
D Kov�ř�kov�, M Škrabal�- Dictionaries and Society, 2022 - ids-pub.bsz-bw.de
Over the past two decades, several lists of academic words and academic phrases emerged,
often focused on L2 teaching or undergraduate students of academic writing classes (…
often focused on L2 teaching or undergraduate students of academic writing classes (…
[PDF][PDF] How to Rank Word Senses According to Frequency
D Kov�ř�kov�, V Cvrček�- EURALEX XIX, 2021 - novaresearch.unl.pt
Introduction: Language corpora are linked to dictionaries from the very beginning of their
existence. The extent of language material in corpora led to a great simplification and …
existence. The extent of language material in corpora led to a great simplification and …
Možnosti a meze korpusov� lingvistiky
V Cvrček, D Kovař�kov��- Naše řeč (Our Speech), 2011 - infona.pl
This paper addresses two most common comments on corpus linguistics: 1) a corpus is
merely a card file index in electronic form and 2) corpus linguistics covers only corpora …
merely a card file index in electronic form and 2) corpus linguistics covers only corpora …