Growth Law of User Characteristics in Microblog
-
摘要: 根据抓取到的新浪微博实际用户数据,分析了粉丝数、关注数和微博数3个特征量的增长模式,发现这3个特征量整体上都随时间线性增长,取整后的增长率服从幂律分布.用户特征量增长模式主要呈持续增长和爆发式增长,其中爆发式增长用户按增长的不同阶段又可以划分为前期、中期、后期和阶跃式4种增长模式.使用基于向量余弦距离相似性的K-means聚类算法,对不同排序和不同初始规模实际用户特征量的时间序列进行聚类分析,统计得到不同增长模式的用户数量.发现用户特征量中增速高的用户增长主要以爆发式增长为主,而规模高的用户增长以持续式增长为主.通过对用户粉丝数爆发式增长的过程分析,对比用户微博被转发和被评论二者的增长关系,提出了导致用户粉丝数爆发式增长的原因.
-
关键词:
- 微博 /
- 增长模式 /
- 余弦相似性 /
- K-means聚类算法 /
- 时间序列
Abstract: Based on the actual data crawled from Sina Microblog, this paper mainly analyzes the growth law of three user characteristics: the number of followers, friends and statuses. They all increase linearly with time and the growth rate in round figures obeys the power-law distribution. It is found that these characteristics are mainly in sustainable and explosive growth patterns. Moreover, the user with the explosive growth pattern can be divided into four main categories, such as early-stage growth pattern, middle-stage growth pattern, later-stage growth pattern, and step-stage growth pattern. Furthermore, the users’ number of different growth patterns can be counted using the K-means clustering algorithm, which is based on the vector cosine similarity. The growth patterns of user characteristics are observed by cluster analysis of the actual time series, which are grouped by different sorting methods and initial scales. It is observed that the users with higher growth rate are mainly in explosive growth pattern, and the users with higher initial number tend to be in sustainable growth pattern. Finally, based on the analysis of the explosive growth process of the number of followers, the relationships between the growth of the numbers of retweet and comment are compared, and the reasons for the explosive growth of the users are proposed.-
Keywords:
- Microblog /
- growth patterns /
- cosine similarity /
- K-means clustering /
- time series
-
-
期刊类型引用(5)
1. 胡启实,余卫星. 基于人工智能的多媒体数据库在线整合系统设计. 现代电子技术. 2021(02): 127-130 . 百度学术
2. 许国强,余长州,王林,周春蕾,高阳. 一种增强贝叶斯网络结构学习的自动变量序调整算法. 南京大学学报(自然科学). 2021(02): 255-261 . 百度学术
3. 刘洋,王利民,孙铭会. 基于信息熵函数的启发式贝叶斯因果推理. 计算机学报. 2021(10): 2135-2147 . 百度学术
4. 吕通通,张湛,陆林军,张延猛. 基于互信息贝叶斯网络的交通事故严重程度分析. 交通信息与安全. 2021(06): 36-43 . 百度学术
5. 武梦娇,刘继. 基于微生物遗传算法的贝叶斯网络结构学习. 重庆科技学院学报(自然科学版). 2020(06): 70-74 . 百度学术
其他类型引用(7)
计量
- 文章访问数: 1344
- HTML全文浏览量: 1
- PDF下载量: 1141
- 被引次数: 12