DE3TC: Detecting Events with Effective Event Type Information and Context (pdf)

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1007/s11063-024-11570-8.pdf

DE3TC: Detecting Events with Effective Event Type Information and Context

Neural Processing Letters (2024) 56:89 https://doi.org/10.1007/s11063-024-11570-8 DE3TC: Detecting Events with Effective Event Type Information and Context Boyang Liu1 · Guozheng Rao1,3 · Xin Wang1,3 · Li Zhang2 · Qing Cong1 Accepted: 11 February 2024 / Published online: 6 March 2024 © The Author(s) 2024 Abstract Event Detection (ED) is a crucial information extraction task that aims to identify the event triggers and classify them into predefined event types. However, most existing methods did not perform well when processing events with implicit triggers. And most methods considered ED as a sentence-level task, lacking effective context for event semantics. Moreover, how to maintain good performance under low resource conditions still needs further study. To address these problems, we propose a novel end-to-end ED model called DE3TC, which Detects Events with Effective Event Type Information and Context. We construct an event type-specific Clue to capture the interaction between event type name and trigger words, providing event type information for implicit triggers. For accessing the effective context of event semantics for sentence-level ED, we consider the correlations between types and select similar types’ descriptions as context. With contextualized representation from a contextual encoder, DE3TC learns the event type information for all events including implicit ones. And it performs sentence-level ED efficiently with effective contexts. The empirical results on ACE 2005 and MAVEN datasets show that: (i) DE3TC obtains state-of-the-art performance compared with previous methods. (ii) DE3TC is also excelled under low-resource conditions. B. Liu and L. Zhang have contributed equally to this work B Xin Wang Boyang Liu Guozheng Rao Li Zhang Qing Cong 1 College of Intelligence and Computing, Tianjin University, Tianjin 300350, China 2 School of Economics and Management, Tianjin University of Science and Technology, Tianjin 300457, China 3 Tianjin Key Laboratory of Cognitive Computing and Applications, Tianjin 300350, China 123 89 Page 2 of 20 B. Liu et al. Keywords Event detection · Event type information · Effective context · Low-resource learning 1 Introduction Event Detection (ED) is an essential yet challenging information extraction task in the field of Natural Language Processing. An event is identified by a word or a phrase called event trigger which most represents that event. Given an input text, ED aims to identify the event triggers and classify them into predefined event types. For instance, in the input sentence “Stewart’s 1979 marriage to Alana Hamilton lasted five years and produced two children.”, an ED model needs to recognize the word “marriage” as an event trigger and predict its event type as “Life.Marry”. Early ED methods explored statistical information in the training sets and used patternbased methods [1–3]. Due to the excellent performance of neural network in the field of natural language processing, many ED models used various neural networks to extract contextual semantic features of events [4–6], such as Convolutional Neural Networks (CNN) [4], Recurrent Neural Network (RNN) [5], and Graph Convolutional Network (GCN) [6]. Recently, with the development of the Pre-trained Language Models [7–10] based on the transformer [11] architecture, many powerful ED models have emerged, which better understand the semantics of events in the context and have made significant improvements [12–14]. Some of them regarded ED tasks as question answering (QA)/machine reading comprehension tasks and detected events by finding answers to pre-defined questions [14–16]. For instance, Du et al. [14] formulated ED as a QA task and designed several queries for event triggers. Some generation-based methods manually defined templates or output formats to accomplish event detection [17–19]. They utilized event type information and achieved good performance. Lu et al. [18] designed a sequence-to-structure network and generated different structures for different event types. Hsu et al. [19] proposed a generation-based method and manually designed a prompt containing event type descriptions to guide the process of event detection. According to Liu et al. [20], to better leverage the capabilities of pre-trained language models (LM), various methods reformulate downstream tasks, making them more akin to those solved during the original LM training with the help of a textual prompt. The prompt is derived from the original input using a designed template and serves as the input for the LM. Prompt-based methods reformulate the target task into a generative task, attempting to learn a LM to perform the original task, reducing or obviating the need for large supervised datasets. This process requires additional labor for both task reformulation and template design. There are still three problems as follows that have not been solved: (1) How to detect event with implicit trigger simply but efficiently? In real-life situations, events within the text may often be implicit. In these cases, trigger words do not explicitly convey the semantics of events. For example, as shown in Fig. 1 (Example 1), we can easily tell that the word “marriage” is the trigger for the event type “Life.Marry”. However, in Example 2, it is challenging to determine that “deployed” is the trigger word for the event type “Movement.Transport”. Therefore, we need event type information to identify such trigger words. Most existing methods were unable to handle this situation satisfactorily. The interaction between event types and trigger words provides event type information, which greatly aids in identifying implicit trigger words. Some excellent methods based on QA or machine 123 DE3TC: Detecting Events with Effective Event Type… Page 3 of 20 89 Fig. 1 Two examples of sentence-level event detection from ACE 2005 dataset reading comprehension only provided a keyword as a query for the event type, for example, “[EVENT]”. However, these methods failed to obtain event type information. Some promptbased methods designed different templates for each event type. They may capture the event type information to some extent. But they required a significant amount of manual effort, which is not feasible in real-life applications. (2) How to access effective context of event semantic for sentence-level ED? Context is essential for semantic understanding. The sentence-level event detection task often lacks effective context. It is difficult to identify the trigger words based solely on the semantics of a single sentence. As shown in Fig. 1 (Example 2), the sentence lacks effective contextual information of event semantic. If we provide more event semantics for “Movement.Transport”, the model can identify “deployed” as the trigger word more easily. Furthermore, most existing methods ignored the correlations between similar event types, which provide valuable contextual information of event (...truncated)