RussianSuperGLUE: A Russian Language Und...

Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks

May 3, 2021

95% Match

Tatyana Iazykova, Denis Kapelyushnik, ... , Kutuzov Andrey

Computation and Language

Leader-boards like SuperGLUE are seen as important incentives for active development of NLP, since they provide standard benchmarks for fair comparison of modern language models. They have driven the world's best engineering teams as well as their resources to collaborate and solve a set of tasks for general language understanding. Their performance scores are often claimed to be close to or even higher than the human performance. These results encouraged more thorough analys...

Find SimilarView on arXiv

Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models

February 15, 2022

95% Match

Alena Fenogenova, Maria Tikhonova, Vladislav Mikhailov, Tatiana Shavrina, Anton Emelyanov, Denis Shevelev, Alexandr Kukushkin, ... , Artemova Ekaterina

Computation and Language

Artificial Intelligence

In the last year, new neural architectures and multilingual pre-trained models have been released for Russian, which led to performance evaluation problems across a range of language understanding tasks. This paper presents Russian SuperGLUE 1.1, an updated benchmark styled after GLUE for Russian NLP models. The new version includes a number of technical, user experience and methodological improvements, including fixes of the benchmark vulnerabilities unresolved in the prev...

Find SimilarView on arXiv

A Family of Pretrained Transformer Language Models for Russian

September 19, 2023

93% Match

Dmitry Zmitrovich, Alexander Abramov, Andrey Kalmykov, Maria Tikhonova, Ekaterina Taktasheva, Danil Astafurov, Mark Baushenko, Artem Snegirev, Tatiana Shavrina, Sergey Markov, ... , Fenogenova Alena

Computation and Language

Nowadays, Transformer language models (LMs) represent a fundamental component of the NLP research methodologies and applications. However, the development of such models specifically for the Russian language has received little attention. This paper presents a collection of 13 Russian Transformer LMs based on the encoder (ruBERT, ruRoBERTa, ruELECTRA), decoder (ruGPT-3), and encoder-decoder (ruT5, FRED-T5) models in multiple sizes. Access to these models is readily available ...

Find SimilarView on arXiv

RuMedBench: A Russian Medical Language Understanding Benchmark

January 17, 2022

92% Match

Pavel Blinov, Arina Reshetnikova, Aleksandr Nesterov, ... , Kokh Vladimir

Computation and Language

Artificial Intelligence

Machine Learning

The paper describes the open Russian medical language understanding benchmark covering several task types (classification, question answering, natural language inference, named entity recognition) on a number of novel text sets. Given the sensitive nature of the data in healthcare, such a benchmark partially closes the problem of Russian medical dataset absence. We prepare the unified format labeling, data split, and evaluation metrics for new tasks. The remaining tasks are f...

Find SimilarView on arXiv

TAPE: Assessing Few-shot Russian Language Understanding

October 23, 2022

92% Match

Ekaterina Taktasheva, Tatiana Shavrina, Alena Fenogenova, Denis Shevelev, Nadezhda Katricheva, Maria Tikhonova, Albina Akhmetgareeva, Oleg Zinkevich, Anastasiia Bashmakova, Svetlana Iordanskaia, Alena Spiridonova, Valentina Kurenshchikova, ... , Mikhailov Vladislav

Computation and Language

Recent advances in zero-shot and few-shot learning have shown promise for a scope of research and practical purposes. However, this fast-growing area lacks standardized evaluation suites for non-English languages, hindering progress outside the Anglo-centric paradigm. To address this line of research, we propose TAPE (Text Attack and Perturbation Evaluation), a novel benchmark that includes six more complex NLU tasks for Russian, covering multi-hop reasoning, ethical concepts...

Find SimilarView on arXiv

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

April 20, 2018

92% Match

Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, ... , Bowman Samuel R.

Computation and Language

For natural language understanding (NLU) technology to be maximally useful, both practically and as a scientific object of study, it must be general: it must be able to process language in a way that is not exclusively tailored to any one specific task or dataset. In pursuit of this objective, we introduce the General Language Understanding Evaluation benchmark (GLUE), a tool for evaluating and analyzing the performance of models across a diverse range of existing NLU tasks. ...

Find SimilarView on arXiv

bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark

June 4, 2023

92% Match

Momchil Hardalov, Pepa Atanasova, Todor Mihaylov, Galia Angelova, Kiril Simov, Petya Osenova, Ves Stoyanov, Ivan Koychev, ... , Radev Dragomir

Computation and Language

Information Retrieval

Machine Learning

We present bgGLUE(Bulgarian General Language Understanding Evaluation), a benchmark for evaluating language models on Natural Language Understanding (NLU) tasks in Bulgarian. Our benchmark includes NLU tasks targeting a variety of NLP problems (e.g., natural language inference, fact-checking, named entity recognition, sentiment analysis, question answering, etc.) and machine learning tasks (sequence labeling, document-level classification, and regression). We run the first sy...

Find SimilarView on arXiv

RuSentEval: Linguistic Source, Encoder Force!

February 28, 2021

91% Match

Vladislav Mikhailov, Ekaterina Taktasheva, ... , Artemova Ekaterina

Computation and Language

The success of pre-trained transformer language models has brought a great deal of interest on how these models work, and what they learn about language. However, prior research in the field is mainly devoted to English, and little is known regarding other languages. To this end, we introduce RuSentEval, an enhanced set of 14 probing tasks for Russian, including ones that have not been explored yet. We apply a combination of complementary probing methods to explore the distri...

Find SimilarView on arXiv

Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language

May 17, 2019

91% Match

Yuri Kuratov, Mikhail Arkhipov

Computation and Language

The paper introduces methods of adaptation of multilingual masked language models for a specific language. Pre-trained bidirectional language models show state-of-the-art performance on a wide range of tasks including reading comprehension, natural language inference, and sentiment analysis. At the moment there are two alternative approaches to train such models: monolingual and multilingual. While language specific models show superior performance, multilingual models allow ...

Find SimilarView on arXiv

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

August 22, 2024

91% Match

Artem Snegirev, Maria Tikhonova, Anna Maksimova, ... , Abramov Alexander

Computation and Language

Artificial Intelligence

Embedding models play a crucial role in Natural Language Processing (NLP) by creating text embeddings used in various tasks such as information retrieval and assessing semantic text similarity. This paper focuses on research related to embedding models in the Russian language. It introduces a new Russian-focused embedding model called ru-en-RoSBERTa and the ruMTEB benchmark, the Russian version extending the Massive Text Embedding Benchmark (MTEB). Our benchmark includes seve...

Find SimilarView on arXiv

RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark

Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks

Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models

A Family of Pretrained Transformer Language Models for Russian

RuMedBench: A Russian Medical Language Understanding Benchmark

TAPE: Assessing Few-shot Russian Language Understanding

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark

RuSentEval: Linguistic Source, Encoder Force!

Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design