在线社交网络中用户地理位置预测综述

刘乐源; 代雨柔; 曹亚男; 周帆

doi:10.7544/issn1000-1239.202220417

在线社交网络中用户地理位置预测综述

Survey of User Geographic Location Prediction Based on Online Social Network

摘要

摘要: 随着智能移动终端的日益普及，人们越来越多地利用社交网络平台（如Twitter、新浪微博等）获取信息、评论和交流. 虽然全球卫星定位系统（GPS）设备能够精确获取位置信息，但是大量用户出于隐私和安全的考虑不会直接共享自己的位置信息. 因此，如何获取在线用户的地理位置成为了一个前沿的研究领域以及学术界和工业界共同关注的重要课题，并且成为众多下游应用的基础，例如基于位置的定向广告投放、事件/地点的推荐、自然灾害或疾病预警和网络犯罪的追踪等. 详细总结了预测社交网络用户地理位置的方法、数据、评价体系和基础算法. 首先，归纳了不同的定位任务以及相应的评价指标；其次，针对不同的任务梳理所用的数据类型和数据融合方式，并且，详尽分析了已有的信息抽取和特征选择方式及其优缺点；再次，对现有定位模型和算法进行分类，从地名词典、传统机器学习和深度学习3个方面对用户定位方法进行阐述和分析；最后，总结了社交网络用户地理位置预测的难点和面临的挑战，并展望该领域的发展趋势和未来研究所需要关注的方向.

Abstract: With the increasing popularity of intelligent mobile terminals, people increasingly use social network platforms (such as Twitter, Sina Weibo, etc.) for information acquisition, comments, and exchanges. Although GPS devices can accurately obtain location information, many users do not directly share their location information for privacy and security considerations. Therefore, obtaining the geographic location of online users has become an important topic in both academia and industry and is the foundation of many downstream applications, such as location-based targeted advertising, event/location recommendations, early warning of natural disasters or diseases, and criminal tracking, etc. We survey in detail the methods, data types, evaluation metrics, and fundamental algorithms for predicting the geographic location of social network users. First, we discriminate different online user geolocation tasks and corresponding evaluation protocols. Subsequently, we assess the data structures and fusion methods used for individual geolocation tasks. Besides, we analyze the existing information extraction and feature selection approaches, as well as their advantages and disadvantages. Moreover, we provide a taxonomy to categorize existing user geolocation models and algorithms, followed by a thorough analysis of different methods from three aspects: geographic dictionary, traditional machine learning, deep learning and graph neural networks. Finally, we summarize the difficulties and challenges in user location prediction while outlining the possible research trend and opportunities to shed light on future work in this field.

HTML全文

参考文献(187)

施引文献

资源附件(0)