Semantic Link Based Cyber Community Discovery Model

REN Wei; RUAN Yuqi; HAN Mengkai; QIU Yuhui

doi:10.13718/j.cnki.xdzk.2022.09.018

As a new way of publishing and sharing resources, the network community is one of the hot spots in the research of network growth. Due to the randomness of clustering and the insufficient semantic utilization of individual node as well as link attributes by existing partitioning algorithms, communities in social networks cannot achieve satisfied classifications. This paper proposed a network community discovery model based on semantic reasoning, which extracts multi-layer semantics through the topology of network and the semantic connections between nodes. Experiment results on the ego-Facebook dataset show that the algorithm is more stable and more suitable for networks with rich semantics and sparse structure when the community size grows to more than 1 000.

HTML

开放科学（资源服务）标志码（OSID）：
社区发现(Community Discovery)是将复杂网络拓扑结构分解为有意义的节点集群的任务^[1]. 目前主流的社区发现方法基于聚类计算，通过无监督学习训练发现共同群体^[2-3]. 然而，由于现有网络节点属性不同，对于社区的定义各种各样，聚类结果带有一定的随机性，不能实现精准分类^[4]. 另一方面，现有划分算法只关注平面数据，没有充分利用个体和链接属性信息的语义，导致划分不准确^[5-6]. 同时，当前社区发现算法主要通过将有向图转化为无向图来进行社区发现，此过程可能丢失很多细节导致社区结果划分不准确. 为实现对节点关系的准确描述，在社区网络中加入有向性^[7]，通过描述不同种类的链接，可以更真实模拟社交网络等现实世界网络.

Satuluri算法^[8]与LSW-OCD算法^[9]都是根据节点的矢量将有向图转化为带方向权值的无向图，虽然Satuluri算法的复杂度已经在LSW-OCD得到较大改善，但节点中隐含着的语义与语义的关系却没有在算法中发挥作用. 语义研究不仅关注事物概念的含义，还关注含义间的关系. 在社会网络中，用户的行为都与用户本身的特征、爱好、习惯等紧密相连. 社区发现中语义的引入，为挖掘非数据信息提供了可能，从而支持对网络社区更精确地划分. 基于语义对社区发现的应用潜力，本文提出以语义推理为基础的网络社区发现模型.

1. 语义网络及语义搜索

语义网络是当前万维网的扩展. 在语义网络中，信息被赋予了明确的含义，使计算机和人能够更好合作^[10]. 语义网络中的概念节点按照层次进行组织，可以表现每个层次中不同节点之间的平面关系以及不同层次中节点的纵向关系^[11].

语义搜索是语义网络的核心^[12-14]. 语义搜索过程依据对本体处理原理的差异，可以分为3种：增强型语义搜索、知识型语义搜索及其他搜索. 我们提出基于语义关系的发现模型，通过关注节点间的语义特性[相等(Equal)、相似(Similar)、引用(Reference)、序列(Sequence)、子类(Subclass)、蕴含(Implication)]，采用语义链进行增强型推理，期望得到更贴近真实情况的社区发现结果.

4. 结论

本文提出基于语义推理的网络社区发现模型，创新地从语义网络的角度，探讨网络社区的构成及分割. 模型基于语义的平面和纵面两种特性，利用图加树的空间拓扑结构，进行基于语义推理的社区发现. 同时，本模型通过语义本身特性，将基于语义链的搜索简化到了层次范畴. 实验结果显示，在模块度的评价指标下，本文提出的模型对社区划分结果优于LFW和k-means算法. 另一方面，由于语义的分割及关系的复杂性会影响本模型速度，期望在下一步的研究中解决该问题.

Figure (8) Table (2) Reference (14)

Name
	Name cannot be empty!
E-mail
	Mailbox cannot be empty! Mailbox cannot be empty!
Telephone
	Mobile number cannot be empty! Please enter a valid mobile number!
Title

Content
Verification Code

[1]	CLAUSET A, NEWMAN M E J, MOORE C. Finding Community Structure in very Large Networks[J]. Physical Review E, Statistical, Nonlinear, and Soft Matter Physics, 2004, 70(6): 066111. doi: 10.1103/PhysRevE.70.066111 CrossRef Google Scholar
[2]	LI W M, ZHOU X K, YANG C, et al. Multi-Objective Optimization Algorithm Based on Characteristics Fusion of Dynamic Social Networks for Community Discovery[J]. Information Fusion, 2022, 79: 110-123. doi: 10.1016/j.inffus.2021.10.002 CrossRef Google Scholar
[3]	NAIK D, RAMESH D, GANDOMI A H, et al. Parallel and Distributed Paradigms for Community Detection in Social Networks: a Methodological Review[J]. Expert Systems With Applications, 2022, 187: 115956. doi: 10.1016/j.eswa.2021.115956 CrossRef Google Scholar
[4]	端祥宇, 袁冠, 孟凡荣. 动态社区发现方法研究综述[J]. 计算机科学与探索, 2021, 15(4): 612-630. Google Scholar
[5]	YANG Z L, ZHANG W J, YUAN F, et al. Measuring Topic Network Centrality for Identifying Technology and Technological Development in Online Communities[J]. Technological Forecasting and Social Change, 2021, 167: 120673. doi: 10.1016/j.techfore.2021.120673 CrossRef Google Scholar
[6]	阎海玲. 复杂网络社区发现方法综述[J]. 信息与电脑(理论版), 2018(20): 162-163, 169. Google Scholar
[7]	陈洒然. 面向社会网络的网络抽样与统计推断方法研究[D]. 长沙: 国防科技大学, 2018. Google Scholar
[8]	SATULURI V, PARTHASARATHY S. Symmetrizations for Clustering Directed Graphs[C] //EDBT/ICDT'11: Proceedings of the 14th International Conference on Extending Database Technology. 2011: 343-354. Google Scholar
[9]	张海燕, 梁循, 周小平. 针对有向图的局部扩展的重叠社区发现算法[J]. 数据采集与处理, 2015, 30(3): 683-693. Google Scholar
[10]	BERNERS-LEE T, HENDLER J, LASSILA O. The Semantic Web[J]. Scientific Aamerican, 2001, 284(5): 34-43. doi: 10.1038/scientificamerican0501-34 CrossRef Google Scholar
[11]	谭荧, 张进, 夏立新. 语义网络发展历程与现状研究[J]. 图书情报知识, 2019(6): 102-110. Google Scholar
[12]	ANTONIOU G, HARMELEN F. 语义网基础教程[M]. 陈小平, 译. 北京: 机械工业出版社, 2008. Google Scholar
[13]	黄恒琪, 于娟, 廖晓, 等. 知识图谱研究综述[J]. 计算机系统应用, 2019, 28(6): 1-12. Google Scholar
[14]	庞美玉. 基于TAP的上下文知晓框架研究[D]. 太原: 太原理工大学, 2006. Google Scholar

Message Board

Semantic Link Based Cyber Community Discovery Model

Abstract

References

Access History

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Access History

Other Articles By Authors