다중 레이블 다중 클래스 의도의 공동 추출 및 감지를 위한 포인터 네트워크 기반 접근법

초록

작업 지향 대화 시스템에서 의도 감지는 사용자 쿼리를 해석하고 적절한 응답을 제공하는 데 중요합니다. 기존 연구는 주로 단일 의도를 갖는 간단한 쿼리에 대해 다루며, 여러 의도를 갖는 복잡한 쿼리를 처리하고 다양한 의도 구간을 추출하는 효과적인 시스템이 부족합니다. 또한, 다국어, 다중 의도 데이터셋이 부족한 것이 두드러집니다. 본 연구는 세 가지 중요한 작업을 다룹니다: 쿼리로부터 여러 의도 구간 추출, 다중 의도 감지, 그리고 다국어 다중 레이블 의도 데이터셋 개발. 우리는 기존 벤치마크 데이터셋에서 선별한 새로운 다중 레이블 다중 클래스 의도 감지 데이터셋(MLMCID-dataset)을 소개합니다. 또한, 섹스투플 형식의 코어스 및 파인 그레인 레이블로 다중 의도를 추출하고 감지하는 포인터 네트워크 기반 아키텍처(MLMCID)를 제안합니다. 포인터 네트워크 기반 시스템이 다양한 데이터셋에서 정확도와 F1 점수 측면에서 기존 방법보다 우월함을 포괄적으로 분석한 결과를 통해 입증합니다.

English

In task-oriented dialogue systems, intent detection is crucial for interpreting user queries and providing appropriate responses. Existing research primarily addresses simple queries with a single intent, lacking effective systems for handling complex queries with multiple intents and extracting different intent spans. Additionally, there is a notable absence of multilingual, multi-intent datasets. This study addresses three critical tasks: extracting multiple intent spans from queries, detecting multiple intents, and developing a multi-lingual multi-label intent dataset. We introduce a novel multi-label multi-class intent detection dataset (MLMCID-dataset) curated from existing benchmark datasets. We also propose a pointer network-based architecture (MLMCID) to extract intent spans and detect multiple intents with coarse and fine-grained labels in the form of sextuplets. Comprehensive analysis demonstrates the superiority of our pointer network-based system over baseline approaches in terms of accuracy and F1-score across various datasets.

다중 레이블 다중 클래스 의도의 공동 추출 및 감지를 위한 포인터 네트워크 기반 접근법

A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents

초록

Support