A Context Preposition Disambiguation Method Based on Artificial Intelligence Neural Network

ZHANG Ming; LIAO Xi

doi:10.13718/j.cnki.xdzk.2024.10.019

2024 Volume 46 Issue 10

Article Contents

Previous Article Next Article

ZHANG Ming, LIAO Xi. A Context Preposition Disambiguation Method Based on Artificial Intelligence Neural Network[J]. Journal of Southwest University Natural Science Edition, 2024, 46(10): 222-232. doi: 10.13718/j.cnki.xdzk.2024.10.019

Citation:

ZHANG Ming, LIAO Xi. A Context Preposition Disambiguation Method Based on Artificial Intelligence Neural Network[J]. Journal of Southwest University Natural Science Edition, 2024, 46(10): 222-232. doi: 10.13718/j.cnki.xdzk.2024.10.019

A Context Preposition Disambiguation Method Based on Artificial Intelligence Neural Network

ZHANG Ming¹,
LIAO Xi²

1.
School of Software, Chengdu Polytechnic, Chengdu 610041, China
2.
School of Communication and Information Engineering, Chongqing University of Posts and Telecommunication, Chongqing 400065, China

More Information

Received Date: 18/12/2023
Available Online: 20/10/2024
MSC: TP393

Abstract

The analysis of prepositional structures presents challenges in effectively classifying prepositions and their structures, mining their semantic information, and effectively disambiguating prepositional structures. To address this challenge, this paper proposes an ACNN-Tree-LSTM model that combines artificial intelligence and neural network techniques, aiming to solve the problem of context-based preposition disambiguation in natural language processing. The core idea is to introduce an attention mechanism to focus the model's attention on key information in the context, which is relevant to the meaning of the preposition. In this study, the context parsing tree and context word embeddings were first embedded to capture the semantic relationships between context words. Then, the Tree-LSTM model was utilized to generate hidden features for each node in the tree, and the context representation of tree nodes was computed by recursively tracking propagation along different branches of the tree. Finally, to reduce the influence of noise on key information related to the meaning of the preposition in the context, an attention mechanism was introduced to enable the model to focus on the crucial parts of the reference document that require disambiguation. This approach allows the model to automatically select and pay attention to the vocabulary mostly relevant to the current prepositional meaning, thereby improving disambiguation accuracy. Experimental results on the Semeval 2013 Task 12 Word Sense Disambiguation dataset demonstrated that the proposed model achieved an F1-score of 88.04%, outperforming existing mainstream deep learning models and validating the effectiveness of the proposed approach.
- artificial intelligence,
- neural networks,
- preposition disambiguation,
- deep learning,
- attention mechanism

References

[1]	淑娴陈. 词义消歧研究综述[J]. 教育科学发展, 2022, 4(1): 137-139. Google Scholar
[2]	何春辉, 胡升泽, 张翀, 等. 融合深层语义和显式特征的中文句子对相似性判别方法[J]. 中文信息学报, 2022, 36(9): 28-37. Google Scholar
[3]	ARSHEYM, ANGEL VIJI K S. An Optimization-Based Deep Belief Network for the Detection of Phishing E-Mails[J]. Data Technologies and Applications, 2020, 54(4): 529-549. doi: 10.1108/DTA-02-2020-0043 CrossRef Google Scholar
[4]	CHAUHAN S, DANIEL P, SAXENA S, et al. Fully Unsupervised Machine Translation Using Context-Aware Word Translation and DenoisingAutoencoder[J]. Applied Artificial Intelligence, 2022, 36(1): 1771-1795. Google Scholar
[5]	LOUREIRO D, REZAEE K, PILEHVAR M T, et al. Analysis and Evaluation of Language Models for Word Sense Disambiguation[J]. Computational Linguistics, 2021, 47(2): 387-443. Google Scholar
[6]	LI J, SUNA X, HAN J L, et al. A Survey on Deep Learning for Named Entity Recognition[J]. IEEE Transactions on Knowledge and Data Engineering, 2022, 34(1): 50-70. doi: 10.1109/TKDE.2020.2981314 CrossRef Google Scholar
[7]	段宗涛, 李菲, 陈柘. 实体消歧综述[J]. 控制与决策, 2021, 36(5): 1025-1039. Google Scholar
[8]	ALOKAILI A, ELBACHIRMENAIM. SVM Ensembles for Named Entity Disambiguation[J]. Computing, 2020, 102(4): 1051-1076. doi: 10.1007/s00607-019-00748-x CrossRef Google Scholar
[9]	MADE JULIARTA I. Prepositional Phrase and Its Translations Found in the Novel "Budha, a Story of Enlightenment"[J]. E-Journal of Linguistics, 2021, 5(1): 28-47. Google Scholar
[10]	NGHI T T, THANG N T, PHUC T H. An Investigation into Factors Affecting the Use of English Prepositions by Vietnamese Learners of English[J]. International Journal of Higher Education, 2020, 10(1): 24-40. doi: 10.5430/ijhe.v10n1p24 CrossRef Google Scholar
[11]	王奥, 吴华瑞, 朱华吉. 基于特征增强的多方位农业问句语义匹配[J]. 西南大学学报(自然科学版), 2023, 45(6): 201-210. doi: 10.13718/j.cnki.xdzk.2023.06.020 CrossRef Google Scholar
[12]	范齐楠, 孔存良, 杨麟儿, 等. 基于BERT与柱搜索的中文释义生成[J]. 中文信息学报, 2021, 35(11): 80-90. Google Scholar
[13]	陆伟, 李鹏程, 张国标, 等. 学术文本词汇功能识别——基于BERT向量化表示的关键词自动分类研究[J]. 情报学报, 2020, 39(12): 1320-1329. Google Scholar
[14]	张国标, 李鹏程, 陆伟, 等. 多特征融合的关键词语义功能识别研究[J]. 图书情报工作, 2021, 65(9): 89-96. Google Scholar
[15]	宋晓涛, 孙海龙. 基于神经网络的自动源代码摘要技术综述[J]. 软件学报, 2022, 33(1): 55-77. Google Scholar
[16]	YUE W, LI L. Sentiment Analysis Using Word2Vec-CNN-BiLSTM Classification[C]//2020 Seventh International Conference on Social Networks Analysis, Management and Security (SNAMS). Paris: IEEE, 2020. Google Scholar
[17]	RADKE M A, GUPTA A, STOCK K, et al. Disambiguating Spatial Prepositions: The Case of Geo-Spatial Sense Detection[J]. Transactions in GIS, 2022, 26(6): 2621-2650. Google Scholar
[18]	PAWAR S, THOMBRE S, MITTAL A, et al. Tapping BERT for Preposition Sense Disambiguation[EB/OL]. 2021: arXiv: 2111. 13972. http://arxiv.org/abs/2111.13972. Google Scholar

Access History

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(6) / Tables(3)

Export Citation

PDF

XML

Article Metrics

Article views(5174) PDF downloads(311) Cited by(0)

Access History

Other Articles By Authors

on this site
- ZHANG Ming
- LIAO Xi
on Google Scholar
- ZHANG Ming
- LIAO Xi

HTML

开放科学（资源服务）标识码（OSID）：
随着人工智能领域的迅猛发展，自然语言处理(Natural Language Processing，NLP)作为一个重要的研究方向受到了广泛关注. NLP旨在使计算机能够理解和处理人类语言，其中上下文理解是其中一个重要的任务. 在自然语言中，词义消歧(Word Sense Disambiguation，WSD)逐步引起了研究者们的关注^[1].

由于神经网络强大的非线性拟合能力，因此深度学习模型被广泛用于词义消歧. 例如，何春辉等^[2]基于余弦相似度，对未标记的语料使用长短期记忆(Long Short-Term Memory，LSTM)模型结合有标注语料上下文语境进行消歧. Arshey等^[3]选用多义词邻接的4个词的词形、词性和语义作为特征，使用深度信念网络(Deep Belief Networks，DBN)模型来消歧. Chauhan等^[4]将深度学习模型应用于词义消歧，直接优化了给定相似度的文档，在无监督预训练阶段使用叠加去噪自动编码器来学习初始文档表示，最后进行微调. 由于短文本语义的稀疏性，需要深度神经网络来进一步探索语义，但过度的深度堆叠自动编码器容易出现梯度消失问题. Loureiro等^[5]利用堆叠双向长短期记忆网络(Bidirectional Long Short-Term Memory，Bi-LSTM)的双重关注机制，从上下文特征、词义描述特征等方面计算词义之间的关联性. 然而，该模型侧重于句子的长距离依赖，没有考虑文本的局部特征. Li等^[6]根据双向LSTM编码器捕获了提及的词汇、句法和本地文本信息，并使用卷积神经网络(Convolutional Neural Network，CNN)与细粒度类型的结构化信息源相结合对实体文档进行建模. 结构化信息包括对知识库的描述. 知识库的质量直接影响到实体消歧结果，而该研究没有考虑到数据集中实体与知识库中的实体不一致的情况. 段宗涛等^[7]提出了用于词义消歧的成对连接. 通过模拟Kruskal算法，使用配对连接算法来近似解决MINTREE(基于树的词义消歧目标)问题，并使用Word2vec中的跳格方法来完成文本矢量化表示. 这种方法生成的词向量与词本身一一对应，不能反映同一词在不同语境中的真实含义. Alokaili等^[8]使用一个联合排名框架来寻找相似或相关的实体以消除歧义，他们提出用一个词义消歧框架来扩展概念性的短文嵌入模型，并使用注意力模型来选择相关的词进行预测. 然而，在处理短文NLP任务时文本中包含的有用信息相对较少，仅靠注意力机制无法获得完整的语义知识.

介词属于助词，在语言中所占的比例并不大，但却是一个重要而常见的词类. 它是虚词中常见的一类，用来表示词与词或句之间的关系，介词不能单独作句子成分，需要与其他实词共同构成介词短语，作用是在句子中修饰、补充谓语，揭示与动作、性状等有关的如时间、地点、比较、施事、受事、对象、方式等.

上下文介词消歧的重要性在于，它在多个NLP任务中扮演着关键角色，包括句法分析、语义理解、机器翻译和信息检索等，准确地识别上下文介词的含义对于这些任务的性能和效果具有重要意义. 然而，由于上下文的复杂性和多义性，上下文介词消歧仍然是一个具有挑战性的问题. 当前的研究主要集中在传统的基于规则和统计的方法上，这些方法通常依赖于手工特征工程和人工定义的规则，限制了其在复杂上下文中的适应能力. 此外，这些方法往往无法充分利用大规模数据和深度学习的优势，因此在处理复杂语义场景时存在一定的局限性.

为了解决上述问题，本文结合人工智能和神经网络技术，提出一种带有长短期记忆基于注意力机制的树递归神经网络模型(ACNN-Tree-LSTM)用于上下文介词消歧，旨在通过深度学习模型的应用，提高消歧任务的准确性和泛化能力. 实验验证了本文模型能够更好地利用上下文信息，并有效地解决了介词歧义问题. 通过与其他先进的深度学习模型在大型词义消歧数据集上的比较结果表明，本文模型在上下文介词消歧任务中具有非常明显的优越性.

本文的研究具有以下几个方面的意义：

方法创新：本文探索并提出了一种新颖的基于人工智能神经网络的上下文介词消歧方法，使用Tree-LSTM作为文本表示，通过树递归神经网络捕获语义信息，并添加自注意力机制来进一步学习特征信息. 该方法将深度学习模型与上下文建模相结合，在介词消歧领域提供了一种新的解决方案，并为上下文理解任务的研究和应用提供了新的思路.

提高任务准确性：本文的方法旨在提高上下文介词消歧任务的准确性. 通过充分利用深度学习模型的学习能力和泛化能力，尽可能减少消歧任务中的误判和错误分类，从而提高整体任务的准确性.

4. 结论

近年来的研究发现，歧义消除和理解主要集中在实词(代词、名词、动词等)的消解，对于高频虚词如介词的歧义消除和理解研究较为有限. 因此，本文提出一种基于人工智能的上下文介词消歧方法，采用带有长短期记忆功能的树递归神经网络模型. 该方法使用带有LSTM的TNN(Tree-LSTM)对上下文进行建模，并通过注意力机制捕捉关键信息，以减少噪声对介词含义的影响. 通过多层神经网络结构，模型能够捕捉上下文中的局部依赖关系和长期依赖关系，更好地理解介词的语义含义. 引入注意力机制能够更准确地捕捉与介词含义相关的信息，从而提高消歧性能. 实验结果证明，本文模型在准确性和泛化能力方面具有优越性. 本文为相关领域的研究人员提供了有益的参考和启发，推动了上下文介词消歧技术的进一步发展和应用. 此外，本文模型还表现出较强的鲁棒性，能够适应不同领域和语义场景下的消歧需求. 然而，我们也注意到该模型在处理长文本和复杂语义场景时仍存在一定的挑战，这可能是注意力机制的局限性所致. 未来的研究将进一步探索改进注意力机制，以提升模型在复杂语境下的性能.

Figure (6) Table (3) Reference (18)

Name
	Name cannot be empty!
E-mail
	Mailbox cannot be empty! Mailbox cannot be empty!
Telephone
	Mobile number cannot be empty! Please enter a valid mobile number!
Title

Content
Verification Code

Message Board

A Context Preposition Disambiguation Method Based on Artificial Intelligence Neural Network

Abstract

References

Access History

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Access History

Other Articles By Authors