CLARIN-PL
Not a member yet
504 research outputs found
Sort by
Guidelines for annotating consumer reviews with basic emotions
Guidelines for annotating Polish consumer reviews with basic emotion
DiaBiz ASR benchmark
An evaluation report with accompanying datasets benchmarking the performance of commercially available ASR services of Polish on the DiaBiz corpus
The subset of Polish pluralia tantum
The subset of pluralia tantum extracted from the set of inter-lingual hyponyms between plWordNet and Princeton WordNet. The subset is tagged with the types of gaps and mismatches occurring between Polish and English and with the equivalence types wherever possible
Street name changes in Poznań, Słubice and Zbąszyń, Poland 1916-2018
The corpus presents a historical overview of street and place (park, bridge, square) name changes in the years 1916-2018 for three Polish cities: Poznań, Słubice and Zbąszyń. Included are the data for 2,582 streets in Poznań, 139 streets in Słubice and 105 streets in Zbąszyń, marked for the year of the introduction of a street name, the year when the name was changed or translated (if applicable), and the year when the name was removed (if applicable)
DiaBiz.Kom sample 1.0
DiaBiz.Kom sample is a sample of DiaBiz.Kom corpus, which is a dialog corpus comprising transcriptions of phone-based customer-agent interactions in several key business domains annotated with dialogue acts.
Citation:
Oleksy, M., Wieczorek, J., Drużyłowska, D., Klyus, J., Domogała, A., Hwaszcz, K., ... & Wróż, A. (2022, October). DiaBiz.Kom - towards a Polish Dialogue Act Corpus Based on ISO 24617-2 Standard. In Proceedings of the 29th International Conference on Computational Linguistics (pp. 3631-3638).
(https://aclanthology.org/2022.coling-1.320)
DiaBiz.Kom is based on DiaBiz corpus: http://docs.pelcra.pl/doku.php?id=diabi
plWordNet 4.2 (CLARIN-BIZ-START)
plWordNet (Słowosieć) from Juli 2020, used as the main resources for word sense disambiguation tasks in 2020-2022; the database includes also the mapping to Priceton WordNet 3.1 and the PWN database
PoLitBert_v32k_cos1_5_50k - Polish RoBERTa model
Polish RoBERTa model trained on Polish Wikipedia, Polish literature and Oscar
Wizerunek Andreja Babiša i Mateusza Morawieckiego w kontekście sytuacji kryzysowej 2020/The image of Andrej Babiš and Mateusz Morawiecki in the context of Crisis situation 2020
Zbiór artykułów z prasy czeskiej dotyczący Mateusza Morawickiegi (iDnes) oraz z prasy polskiej dotyczących Andreja Babiša (Rzeczpospolita