Anomaly detection for short texts: Identifying whether your chatbot should switch from goal-oriented conversation to chit-chatting

Goal-oriented conversational agents are systems able converse with humans using natural language to help them reach a certain goal. The number of goals (or domains) about which an agent could converse is limited, and one of the issues is to identify whether a user talks about the unknown domain (in order to report a misunderstanding or switch to chit-chatting mode). We argue that this issue could be resolved if we consider it as an anomaly detection task which is in a field of machine learning. The scientific community developed a broad range of methods for resolving this task, and their applicability to the short text data was never investigated before. The aim of this work is to compare performance of 6 different anomaly detection methods on Russian and English short texts modeling conversational utterances, proposing the first evaluation framework for this task. As a result of the study, we find out that a simple threshold for cosine similarity works better than other methods for both of the considered languages. © Springer Nature Switzerland AG 2018.

Authors

Bakarov A.^1, ² , Yadrintsev V. ^2, ⁴ , Sochenkov I. ^2, ³

Journal

Communications in Computer and Information Science

Publisher

Springer Verlag

Language

English

Pages

289-298

Status

Published

DOI

10.1007/978-3-030-02846-6_23

Volume

859

Year

2018

Organizations

¹ The National Research University Higher School of Economics, Moscow, Russian Federation
² Federal Research Center ‘Computer Science and Control’ of Russian Academy of Sciences, Moscow, Russian Federation
³ Skolkovo Institute of Science and Technology, Moscow, Russian Federation
⁴ Peoples’ Friendship University of Russia (RUDN University), Moscow, Russian Federation

Keywords

Anomaly detection; Chatbot; Conversational agent; Distributional semantics; Novelty detection; Word embeddings

Cite

ГОСТ MLA RIS BibTex

HUMAN UMBILICAL CORD TISSUE CRYOPRESERVATION: PROSPECTS FOR CLINICAL APPLICATION

Article

Strokova S.O., Arutyunyan I.V., Mullabaeva S.M., Fatkhudinov T.K.

Akusherstvo i Ginekologiya. ООО «Бионика Медиа». Vol. 2018. 2018. P. 5-10

ASSOCIATION OF OBESITY IN SHIFT WORKERS WITH THE MINOR ALLELE OF A SINGLE-NUCLEOTIDE POLYMORPHISM (RS4851377) IN THE LARGEST CIRCADIAN CLOCK GENE (NPAS2)

Article

Dorokhov V.B., Puchkova A.N., Arsen’ev G.N., Slominsky P.A., Dementienko V.V., Sveshnikov D.S., Putilov A.A.

Biological Rhythm Research. Taylor and Francis Ltd.. 2018.

Anomaly detection for short texts: Identifying whether your chatbot should switch from goal-oriented conversation to chit-chatting

Other records

HUMAN UMBILICAL CORD TISSUE CRYOPRESERVATION: PROSPECTS FOR CLINICAL APPLICATION

ASSOCIATION OF OBESITY IN SHIFT WORKERS WITH THE MINOR ALLELE OF A SINGLE-NUCLEOTIDE POLYMORPHISM (RS4851377) IN THE LARGEST CIRCADIAN CLOCK GENE (NPAS2)

Cite