社会网络数据的可视化展示分析.doc
DOC格式手机打开展开
社会网络数据的可视化展示分析,摘要社会网络是指人们根据相识关系所形成的网络形态。随着万维网的迅速发展,其中蕴含的社会网络迅速膨胀,其结构异常复杂,导致人们很难通过数字、表格或文字的方式对其进行表示、分析和管理,并使其中所包含的丰富信息资源难于被发现和有效利用。可视化方法可以将复杂的社会网络结构转换成美观的图形,辅助人们直观地理解社会网络结构并从中挖...
内容介绍
此文档由会员 小花仙66 发布
摘 要
社会网络是指人们根据相识关系所形成的网络形态。随着万维网的迅速发展,其中蕴含的社会网络迅速膨胀,其结构异常复杂,导致人们很难通过数字、表格或文字的方式对其进行表示、分析和管理,并使其中所包含的丰富信息资源难于被发现和有效利用。可视化方法可以将复杂的社会网络结构转换成美观的图形,辅助人们直观地理解社会网络结构并从中挖掘有用的信息,是社会网络分析的一个重要途径。在前人研究的基础上,本论文针对从照片共享网站Flickr中采集的社会网络数据,通过在SqlServer数据库管理平台上的SQL编程对社会网络数据集进行了压缩,并基于Matlab平台设计和实现了布点算法对提取的关键网络结构进行了可视化展示。论文的详细研究工作包括:
(1)通过考察Flickr网站,以E-R图的方式总结了“用户”、“图片”、“标签”等数据的总体组织结构,对选取的Flickr数据集进行了简单说明,并从数据压缩方法和作图算法两个方面对目前社会网络可视化技术的研究现状进行了总结;
(2)为了提高数据压缩的效率,提出了使用SqlServer平台上SQL编程的社会网络数据集压缩方案;按照方案,以给定用户作为中心节点提取了周边的局部社会网络,根据局部社会网络中用户节点度数选取了关键的网络节点,通过查找最短路径将中心节点与关键节点关联起来,最终得到了三个矩阵形式表示的关键社会网络;
(3)鉴于Matlab平台对向量操作和图形化操作的良好支撑,提出了利用Matlab平台实现可视化算法和绘制关键社会网络图的方案;按照方案,首先设计和实现了相关算法按照“同心圆”的模式围绕中心节点对关键社会网络进行了初始布点,然后实现了力引导布点算法对关键社会网络进行了节点位置调节和可视化作图,最后设计和实现了相关算法引入辅助节点对作图结果进行了完善,得到了较为美观、清晰的关键社会网络图。
关键词
复杂网络,社会网络,可视化,可视化算法,压缩算法,布点算法
ABSTRACT
Social network is formed by people with acquaintance relationships. With the rapid grow of the World Wide Web, the social network in cyberworld expands fastly, and its structure is so complex that the traditional methods like figures, tables or text are useless to show, analysis and manage the social network, making a wealth of information resources contained hard to be discovered and used effectively. Visualization, an important way to analysis social network nowadays, can convert complex social network structure into beautiful graph, therefore help people intuitively understand social network structure and dig more useful information in it. Based on previous research efforts, this thesis mainly focuses on a social network dataset retrieved from the photo-sharing website Flickr, by extracting key network for the dataset through SQL programs on SqlServer platform, designing and implementing algorithms to layout and display the key network on Matlab platform. In detail, the research work in the thesis includes:
(1) By study the Flickr website, the overall organizational structure of data containing "user", "image" and "tag" is summed up and illustrated in an E-R diagram, the selected data set of social network in Flickr is introduced, and the current art of state of social network visualization technology is explained in two aspects of data compression methods and layout algorithms;
(2) In order to improve the efficiency of data compression, a new scheme is proposed which bases on SQL programming on SqlServer platform; according to this scheme, the local social network surrounding a given user (noted as the central node in the local network) is extracted, some key nodes with big degrees in local social network are selected, the nodes locating on the shortest path of the central nodes and key nodes are selected, and eventually three key social networks are retrieved and represented in matrix forms.
(3) Because that the Matlab platform can well support vector operations and graphics operations, a visualization scheme is proposed which uses Matlab to implement visualization algorithms and draw the key social network diagrams; according to this scheme, an algorithm is designed and implemented to initially layout nodes in the key social networks in a "concentric circles" pattern, a traditional force-directed layout algorithm is implemented to automatically adjust the positions of nodes in the key social networks and to draw the diagrams of key social networks, another algorithm is designed and implemented to import auxiliary nodes into the diagrams of key social networks to make these diagrams more beautiful and more intuitionistic.
KEYWORDS
Complex network , Social Network , Visualization , Visual algorithm , Compression algorithm , Layout algorithm
目 录
摘 要 I
ABSTRACT II
1.1 研究背景 1
1.2 国内外研究现状 1
1.2.1 国外研究现状 1
1.2.2 国内研究现状 2
1.3 研究内容、方法、手段及预期成果 2
1.3.1 研究内容 2
1.3.2 研究方法及手段 2
1.3.3 预期成果 3
1.4 章节安排 3
第二章 Flickr网站及可视化技术简介 4
2.1 Flickr网站 4
2.1.1 网站简介 4
2.1.2 网站总体结构 5
2.1.3 数据集 6
2.2 数据压缩技术概述 6
2.2.1 压缩与可视化的关系 6
2.2.2 压缩算法的概述 7
2.3 可视化作图技术概述 8
2.3.1 作图算法的要求 8
2.3.2 作图算法的思想 8
2.4 小结 10
..
社会网络是指人们根据相识关系所形成的网络形态。随着万维网的迅速发展,其中蕴含的社会网络迅速膨胀,其结构异常复杂,导致人们很难通过数字、表格或文字的方式对其进行表示、分析和管理,并使其中所包含的丰富信息资源难于被发现和有效利用。可视化方法可以将复杂的社会网络结构转换成美观的图形,辅助人们直观地理解社会网络结构并从中挖掘有用的信息,是社会网络分析的一个重要途径。在前人研究的基础上,本论文针对从照片共享网站Flickr中采集的社会网络数据,通过在SqlServer数据库管理平台上的SQL编程对社会网络数据集进行了压缩,并基于Matlab平台设计和实现了布点算法对提取的关键网络结构进行了可视化展示。论文的详细研究工作包括:
(1)通过考察Flickr网站,以E-R图的方式总结了“用户”、“图片”、“标签”等数据的总体组织结构,对选取的Flickr数据集进行了简单说明,并从数据压缩方法和作图算法两个方面对目前社会网络可视化技术的研究现状进行了总结;
(2)为了提高数据压缩的效率,提出了使用SqlServer平台上SQL编程的社会网络数据集压缩方案;按照方案,以给定用户作为中心节点提取了周边的局部社会网络,根据局部社会网络中用户节点度数选取了关键的网络节点,通过查找最短路径将中心节点与关键节点关联起来,最终得到了三个矩阵形式表示的关键社会网络;
(3)鉴于Matlab平台对向量操作和图形化操作的良好支撑,提出了利用Matlab平台实现可视化算法和绘制关键社会网络图的方案;按照方案,首先设计和实现了相关算法按照“同心圆”的模式围绕中心节点对关键社会网络进行了初始布点,然后实现了力引导布点算法对关键社会网络进行了节点位置调节和可视化作图,最后设计和实现了相关算法引入辅助节点对作图结果进行了完善,得到了较为美观、清晰的关键社会网络图。
关键词
复杂网络,社会网络,可视化,可视化算法,压缩算法,布点算法
ABSTRACT
Social network is formed by people with acquaintance relationships. With the rapid grow of the World Wide Web, the social network in cyberworld expands fastly, and its structure is so complex that the traditional methods like figures, tables or text are useless to show, analysis and manage the social network, making a wealth of information resources contained hard to be discovered and used effectively. Visualization, an important way to analysis social network nowadays, can convert complex social network structure into beautiful graph, therefore help people intuitively understand social network structure and dig more useful information in it. Based on previous research efforts, this thesis mainly focuses on a social network dataset retrieved from the photo-sharing website Flickr, by extracting key network for the dataset through SQL programs on SqlServer platform, designing and implementing algorithms to layout and display the key network on Matlab platform. In detail, the research work in the thesis includes:
(1) By study the Flickr website, the overall organizational structure of data containing "user", "image" and "tag" is summed up and illustrated in an E-R diagram, the selected data set of social network in Flickr is introduced, and the current art of state of social network visualization technology is explained in two aspects of data compression methods and layout algorithms;
(2) In order to improve the efficiency of data compression, a new scheme is proposed which bases on SQL programming on SqlServer platform; according to this scheme, the local social network surrounding a given user (noted as the central node in the local network) is extracted, some key nodes with big degrees in local social network are selected, the nodes locating on the shortest path of the central nodes and key nodes are selected, and eventually three key social networks are retrieved and represented in matrix forms.
(3) Because that the Matlab platform can well support vector operations and graphics operations, a visualization scheme is proposed which uses Matlab to implement visualization algorithms and draw the key social network diagrams; according to this scheme, an algorithm is designed and implemented to initially layout nodes in the key social networks in a "concentric circles" pattern, a traditional force-directed layout algorithm is implemented to automatically adjust the positions of nodes in the key social networks and to draw the diagrams of key social networks, another algorithm is designed and implemented to import auxiliary nodes into the diagrams of key social networks to make these diagrams more beautiful and more intuitionistic.
KEYWORDS
Complex network , Social Network , Visualization , Visual algorithm , Compression algorithm , Layout algorithm
目 录
摘 要 I
ABSTRACT II
1.1 研究背景 1
1.2 国内外研究现状 1
1.2.1 国外研究现状 1
1.2.2 国内研究现状 2
1.3 研究内容、方法、手段及预期成果 2
1.3.1 研究内容 2
1.3.2 研究方法及手段 2
1.3.3 预期成果 3
1.4 章节安排 3
第二章 Flickr网站及可视化技术简介 4
2.1 Flickr网站 4
2.1.1 网站简介 4
2.1.2 网站总体结构 5
2.1.3 数据集 6
2.2 数据压缩技术概述 6
2.2.1 压缩与可视化的关系 6
2.2.2 压缩算法的概述 7
2.3 可视化作图技术概述 8
2.3.1 作图算法的要求 8
2.3.2 作图算法的思想 8
2.4 小结 10
..