A Survey of Information Extraction Based on Deep Neural Networks

DAI Jianhua; PENG Ruoyao; XU Lu; JIANG Chao; ZENG Daojian; LI Yangding

doi:10.13718/j.cnki.xsxb.2022.04.001

2022 Volume 47 Issue 4

Article Contents

Previous Article Next Article

DAI Jianhua, PENG Ruoyao, XU Lu, et al. A Survey of Information Extraction Based on Deep Neural Networks[J]. Journal of Southwest China Normal University(Natural Science Edition), 2022, 47(4): 1-11. doi: 10.13718/j.cnki.xsxb.2022.04.001

Citation:

DAI Jianhua, PENG Ruoyao, XU Lu, et al. A Survey of Information Extraction Based on Deep Neural Networks[J]. Journal of Southwest China Normal University(Natural Science Edition), 2022, 47(4): 1-11. doi: 10.13718/j.cnki.xsxb.2022.04.001

A Survey of Information Extraction Based on Deep Neural Networks

Research Institute of Languages and Cultures/ Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha 410081, China

More Information

Received Date: 24/12/2021
Available Online: 20/04/2022
MSC: TP18

Abstract

Information extraction aims at extracting structured information from unstructured text, realizing automatically classify, extracting and reconstructing massive information, and enhancing the use of information. Recently, information extraction technology based on deep neural network is one of the most significant research topics in the field of natural language processing. It creates an effective way of analyzing unstructured text, and facilitates the realize the resource, knowledge and universality of big data. In addition, it further provides support for higher-level applications and tasks. In this paper, the related research on information extraction based on deep neural network has been reviewed. First, the task definition, goals, and meanings of information extraction has briefly been described, followed by an analysis of the development of the task. And then, the development of key technologies in recent years been summarized from four aspects: entity extraction, entity relation extraction, event extraction, and event relation extraction. Finally, the future development trends in the field of information extraction have been analyzed and looked forward to.
- information extraction,
- deep neural network,
- entity extraction,
- entity relation extraction,
- event extraction,
- event relation extraction

References

[1]	LIU K. A Survey on Neural Relation Extraction[J]. Science China Technological Sciences, 2020, 63(10): 1971-1989. doi: 10.1007/s11431-020-1673-6 CrossRef Google Scholar
[2]	ZENG D, LIU K, LAI S, et al. Relation Classification via Convolutional Deep Neural Network[C]//Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. Dublin, Ireland: Dublin City University and Association for Computational Linguistics, 2014: 2335-2344. Google Scholar
[3]	DEVLIN J, CHANG M W, LEE K, et al. Bert: Pre-training of Deep Bidirectional Transformers for Language Understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Minneapolis, Minnesota: Association for Computational Linguistics, 2019: 4171-4186. Google Scholar
[4]	RAU L F. Extracting Company Names from Text[C]//The Seventh IEEE Conference on Artificial Intelligence Application. Miami Beach, FL, USA: IEEE, 1991: 29-32. Google Scholar
[5]	LI J Q, ZHAO S H, YANG J J, et al. WCP-RNN: a Novel RNN-Based Approach for Bio-NER in Chinese EMRs[J]. The Journal of Supercomputing, 2020, 76(3): 1450-1467. doi: 10.1007/s11227-017-2229-x CrossRef Google Scholar
[6]	HUANG Z H, XU W, YU K. Bidirectional LSTM-CRF Models for Sequence Tagging[J]. CoRR, 2015, abs/1508. 01991: 1-10. Google Scholar
[7]	MA X Z, HOVY E. End-to-End Sequence Labeling via Bi-Directional LSTM-CNNS-CRF[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Berlin, Germany: Association for Computational Linguistics, 2016: 1064-1074. Google Scholar
[8]	LAMPLE G, BALLESTEROS M, SUBRAMANIAN S, et al. Neural Architectures for Named Entity Recognition[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. San Diego, California, USA: Association for Computational Linguistics, 2016: 260-270. Google Scholar
[9]	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is All You Need[C]//31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA: Curran Associates, Inc., 2017: 5998-6008. Google Scholar
[10]	REI M, CRICHTON G K O, PYYSALO S. Attending to Characters in Neural Sequence Labeling Models[C]//Proceedings of {COLING} 2016, the 26th International Conference on Computational Linguistics: Technical Papers. Osaka, Japan: The COLING 2016 Organizing Committee, 2016: 309-318. Google Scholar
[11]	ZHENG K H, SUN L Y, WANG X, et al. Named Entity Recognition in Electric Power Metering Domain Based on Attention Mechanism[J]. IEEE Access, 2021, 9: 152564-152573. doi: 10.1109/ACCESS.2021.3123154 CrossRef Google Scholar
[12]	LI X Y, ZHANG H, ZHOU X H. Chinese Clinical Named Entity Recognition with Variant Neural Structures Based on BERT Methods[J]. Journal of Biomedical Informatics, 2020, 107: 103422. doi: 10.1016/j.jbi.2020.103422 CrossRef Google Scholar
[13]	ALSAARAN N, ALRABIAH M. Classical Arabic Named Entity Recognition Using Variant Deep Neural Network Architectures and BERT[J]. IEEE Access, 2021, 9: 91537-91547. doi: 10.1109/ACCESS.2021.3092261 CrossRef Google Scholar
[14]	应坚超, 蒲飞, 徐晨鸥, 等. 基于互逆和对称关系补全的知识图谱数据扩展方法[J]. 西南大学学报(自然科学版), 2020, 42(11): 43-51. Google Scholar
[15]	王红, 卢林燕, 王童. 航空安全事件知识图谱补全方法[J]. 西南大学学报(自然科学版), 2020, 42(11): 31-42. Google Scholar
[16]	GETOOR L, TASKAR B. Global Inference for Entity and Relation Identification Via a Linear Programming Formulation[J]. Introduction to Statistical Relational Learning, 2007: 553-580. Google Scholar
[17]	MIWA M, SASAKI Y. Modeling Joint Entity and Relation Extraction with Table Representation[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Doha, Qatar: Association for Computational Linguistics, 2014: 1858-1869. Google Scholar
[18]	GUPTA P, SCHUTZE H, ANDRASSY B. Table Filling Multi-Task Recurrent Neural Network for Joint Entity and Relation Extraction[C]//Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. Osaka, Japan: The COLING 2016 Organizing Committee, 2016: 2537-2547. Google Scholar
[19]	BEKOULIS G, DELEU J, DEMEESTER T, et al. Joint Entity Recognition and Relation Extraction as a Multi-Head Selection Problem[J]. Expert Systems With Applications, 2018, 114: 34-45. doi: 10.1016/j.eswa.2018.07.032 CrossRef Google Scholar
[20]	ZENG X R, ZENG D J, HE S Z, et al. Extracting Relational Facts by an End-to-End Neural Model with Copy Mechanism[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Melbourne, Australia: Association for Computational Linguistics, 2018: 506-514. Google Scholar
[21]	ZENG X R, HE S Z, ZENG D J, et al. Learning the Extraction Order of Multiple Relational Facts in a Sentence with Reinforcement Learning[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong, China: Association for Computational Linguistics, 2019: 367-377. Google Scholar
[22]	ZENG D J, ZHANG H R, LIU Q Y. CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(5): 9507-9514. doi: 10.1609/aaai.v34i05.6495 CrossRef Google Scholar
[23]	HAN X, ZHU H, YU P F, et al. FewRel: a Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Brussels, Belgium: Association for Computational Linguistics, 2018: 4803-4809. Google Scholar
[24]	GAO T Y, HAN X, ZHU H, et al. FewRel 2.0: Towards more Challenging Few-Shot Relation Classification[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong, China: Association for Computational Linguistics, 2019: 6250-6255. Google Scholar
[25]	CHRISTOPOULOU F, MIWA M, ANANIADOU S. Connecting the Dots: Document-Level Neural Relation Extraction with Edge-Oriented Graphs[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong, China: Association for Computational Linguistics, 2019: 4925-4936. Google Scholar
[26]	陈珂, 陈振彬. 基于最短依存路径和BERT的关系抽取算法研究[J]. 西南师范大学学报(自然科学版), 2021, 46(11): 56-66. Google Scholar
[27]	ZHOU W X, HUANG K, MA T Y, et al. Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, California, USA: AAAI Press, 2021: 14612-14620. Google Scholar
[28]	RILOFF E. Automatically Constructing a Dictionary for Information Extraction Tasks[C]//Proceedings of the Eleventh National Conference on Artificial Intelligence. Washington D. C., USA: AAAI Press/MIT Press, 1993: 811-816. Google Scholar
[29]	NGUYEN T H, GRISHMAN R. Event Detection and Domain Adaptation with Convolutional Neural Networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). Beijing, China: Association for Computational Linguistics, 2015: 365-371. Google Scholar
[30]	CHEN Y B, XU L H, LIU K, et al. Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Beijing, China: Association for Computational Linguistics, 2015: 167-176. Google Scholar
[31]	SHA L, QIAN F, CHANG B, et al. Jointly Extracting Event Triggers and Arguments by Dependency-Bridge RNN and Tensor-Based Argument Interaction[C]//Proceedings of the Thirty-Second Conference on Artificial Intelligence. New Orleans, Louisiana, USA: AAAI Press, 2018: 5916-5923. Google Scholar
[32]	LAI V D, NGUYEN T N, NGUYEN T H. Event Detection: Gate Diversity and Syntactic Importance Scores for Graph Convolution Neural Networks[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Online: Association for Computational Linguistics, 2020: 5405-5411. Google Scholar
[33]	AHMAD W U, PENG N Y, CHANG K W. GATE: Graph Attention Transformer Encoder for Cross-Lingual Relation and Event Extraction[EB/OL]. (2020-10-6)[2022-2-15]. https://www.researchgate.net/publication/344529795_GATE_Graph_Attention_Transformer_Encoder_for_Cross-lingual_Relation_and_Event_Extraction. Google Scholar
[34]	WADDEN D, WENNBERG U, LUAN Y, et al. Entity, Relation, and Event Extraction with Contextualized Span Representations[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong, China: Association for Computational Linguistics, 2019: 5784-5789. Google Scholar
[35]	LIU P F, YUAN W Z, FU J L, et al. Pre-Train, Prompt, and Predict: a Systematic Survey of Prompting Methods in Natural Language Processing[EB/OL]. (2021-7-28)[2022-2-15]. https://arxiv.org/abs/2107.13586. Google Scholar
[36]	SI J H, PENG X T, LI C, et al. Generating Disentangled Arguments with Prompts: a Simple Event Extraction Framework that Works[EB/OL]. (2021-10-9)[2022-2-15]. https://arxiv.org/abs/2110.04525. Google Scholar
[37]	ZHANG Z S, KONG X, LIU Z Z, et al. A Two-Step Approach for Implicit Event Argument Detection[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Online: Association for Computational Linguistics, 2020: 7479-7485. Google Scholar
[38]	LI S, JI H, HAN J W. Document-Level Event Argument Extraction by Conditional Generation[C]//Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Online: Association for Computational Linguistics, 2021: 894-908. Google Scholar
[39]	LEE K, HE L H, LEWIS M, et al. End-to-End Neural Coreference Resolution[C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Copenhagen, Denmark: Association for Computational Linguistics, 2017: 188-197. Google Scholar
[40]	BUGERT M, REIMERS N, GUREVYCH I. Generalizing Cross-Document Event Coreference Resolution across Multiple Corpora[J]. Computational Linguistics, 2021, 47(3): 575-614. doi: 10.1162/coli_a_00407 CrossRef Google Scholar
[41]	HUANG Y J, LU J, KUROHASHI S, et al. Improving Event Coreference Resolution by Learning Argument Compatibility from Unlabeled Data[C]//Proceedings of the 2019 Conference of the North. Minneapolis, Minnesota: Association for Computational Linguistics, 2019: 785-795. Google Scholar
[42]	DASGUPTA T, SAHA R, DEY L, et al. Automatic Extraction of Causal Relations from Text Using Linguistically Informed Deep Neural Networks[C]//Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue. Melbourne, Australia: Association for Computational Linguistics, 2018: 306-316. Google Scholar
[43]	PHU M T, NGUYEN T H. Graph Convolutional Networks for Event Causality Identification with Rich Document-Level Structures[C]//Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Online: Association for Computational Linguistics, 2021: 3480-3490. Google Scholar
[44]	LIU J, CHEN Y B, ZHAO J. Knowledge Enhanced Event Causality Identification with Mention Masking Generalizations[C]//Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence. Yokohama, Japan: International Joint Conferences on Artificial Intelligence Organization, 2020: 3608-3614. Google Scholar
[45]	CAO P F, ZUO X Y, CHEN Y B, et al. Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Online: Association for Computational Linguistics, 2021: 4862-4872. Google Scholar
[46]	PUSTEJOVSKY J, CASTANO J M, INGRIA R, et al. TimeML: Robust Specification of Event and Temporal Expressions in Text[M]//New Directions in Question Answering. Stanford, USA: AAAI Press, 2003: 28-34. Google Scholar
[47]	LONG Y, LI Z J, WANG X, et al. XJNLP at SemEval-2017 Task 12: Clinical Temporal Information Ex-Traction with a Hybrid Model[C]//Proceedings of the 11th International Workshop on Semantic Evaluation (T-2017). Vancouver, Canada: Association for Computational Linguistics, 2017: 1014-1018. Google Scholar
[48]	HAN R J, NING Q, PENG N Y. Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong, China: Association for Computational Linguistics, 2019: 434-444. Google Scholar

Access History

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Export Citation

PDF

XML

Article Metrics

Article views(11534) PDF downloads(1653) Cited by(0)

Name
	Name cannot be empty!
E-mail
	Mailbox cannot be empty! Mailbox cannot be empty!
Telephone
	Mobile number cannot be empty! Please enter a valid mobile number!
Title

Content
Verification Code

Message Board

A Survey of Information Extraction Based on Deep Neural Networks

Abstract

References

Access History

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Access History

Other Articles By Authors