前往小程序,Get更优阅读体验!
立即前往
发布
社区首页 >专栏 >大数据实训之数据可视化系列教程

大数据实训之数据可视化系列教程

作者头像
用户9184480
发布2024-12-07 16:29:22
发布2024-12-07 16:29:22
13000
代码可运行
举报
文章被收录于专栏:云计算linux云计算linux
运行总次数:0
代码可运行

数据可视化

github地址

https://github.com/alibaba/DataX/blob/master/introduction.md

Datax的基本安装

下载软件

https://datax-opensource.oss-cn-hangzhou.aliyuncs.com/20220530/datax.tar.gz

上传的服务器的指定目录(hadoop02)

/bigdata/soft

解压到指定目录

代码语言:javascript
代码运行次数:0
复制
tar -zxvf datax.tar.gz  -C /bigdata/server/

运行示例程序

代码语言:javascript
代码运行次数:0
复制
python bin/datax.py  job/job.json

Datax的基本使用

在MySQL中创建对应的表结构

用户城市分布

代码语言:javascript
代码运行次数:0
复制
CREATE TABLE `ads_user_city` (
  city varchar(80) DEFAULT NULL COMMENT '城市',
  province varchar(80) DEFAULT NULL COMMENT '省份',
  area varchar(80) DEFAULT NULL COMMENT '区域',
  dt varchar(80) DEFAULT NULL COMMENT '日期',
  count bigint DEFAULT NULL COMMENT '统计数量'
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT='用户城市分布'

网站访问的上网模式分布

代码语言:javascript
代码运行次数:0
复制
create table ads_visit_type(
url   VARCHAR(80) COMMENT '访问地址',
type  VARCHAR(80) COMMENT '访问模式',
dt  VARCHAR(80) COMMENT '日期',
month  VARCHAR(80) COMMENT '月度',
quarter  VARCHAR(80) COMMENT '季度',
count  bigint COMMENT '统计数量'
) COMMENT '网站访问的上网模式分布'

导出数据脚本

ads_user_city.json

代码语言:javascript
代码运行次数:0
复制
{
    "job": {
        "setting": {
            "speed": {
                "channel": 1
            }
        },
        "content": [
            {
                "reader": {
                    "name": "hdfsreader",
                    "parameter": {
                        "path": "/behavior/ads/ads_user_city/*",
                        "defaultFS": "hdfs://hadoop01:8020",
                        "column": [
                               {
                                "index": 0,
                                "type": "string"
                               },
                               {
                                "index": 1,
                                "type": "string"
                               },
                               {
                                "index": 2,
                                "type": "string"
                               },
                               {
                                "index": 3,
                                "type": "string"
                               },
                               {
                                "index": 4,
                                "type": "long"
                               }
                        ],
                        "fileType": "text",
                        "encoding": "UTF-8",
                        "fieldDelimiter": "\t"
                    }

                },
				"writer": {
                    "name": "mysqlwriter",
                    "parameter": {
                        "writeMode": "insert",
                        "username": "root",
                        "password": "123456",
                        "column": [
                            "city",
                            "province",
			    			"area",
			    			"dt",
			    			"count"
                        ],
                        "session": [
                        	"set session sql_mode='ANSI'"
                        ],
                        "preSql": [
                            "delete from ads_user_city"
                        ],
                        "connection": [
                            {
                                "jdbcUrl": "jdbc:mysql://192.168.3.129:3306/behavior?useUnicode=true&characterEncoding=utf-8&useSSL=false",
                                "table": [
                                    "ads_user_city"
                                ]
                            }
                        ]
                    }
                }
            }
        ]
    }
}

我这里显示3041,数据7000样本

ads_visit_type.json

代码语言:javascript
代码运行次数:0
复制
{
    "job": {
        "setting": {
            "speed": {
                "channel": 1
            }
        },
        "content": [
            {
                "reader": {
                    "name": "hdfsreader",
                    "parameter": {
                        "path": "/behavior/ads/ads_visit_type/*",
                        "defaultFS": "hdfs://hadoop01:8020",
                        "column": [
                               {
                                "index": 0,
                                "type": "string"
                               },
                               {
                                "index": 1,
                                "type": "string"
                               },
                               {
                                "index": 2,
                                "type": "string"
                               },
                               {
                                "index": 3,
                                "type": "string"
                               },
                               {
                                "index": 4,
                                "type": "string"
                               }
                               {
                                "index": 5,
                                "type": "long"
                               }
                        ],
                        "fileType": "text",
                        "encoding": "UTF-8",
                        "fieldDelimiter": "\t"
                    }

                },
				"writer": {
                    "name": "mysqlwriter",
                    "parameter": {
                        "writeMode": "replace",
                        "username": "root",
                        "password": "123456",
                        "column": [
                            "url",
                            "type",
							"dt",
							"month",
							"quarter",
							"count"
                        ],
                        "session": [
                        	"set session sql_mode='ANSI'"
                        ],
                        "preSql": [
                            "delete from ads_visit_type"
                        ],
                        "connection": [
                            {
                                "jdbcUrl": "jdbc:mysql://192.168.3.129:3306/behavior?useUnicode=true&characterEncoding=utf-8&useSSL=false",
                                "table": [
                                    "ads_visit_type"
                                ]
                            }
                        ]
                    }
                }
            }
        ]
    }
}

结果:3410

ads_visit_mode.json

代码语言:javascript
代码运行次数:0
复制
{
    "job": {
        "setting": {
            "speed": {
                "channel": 1
            }
        },
        "content": [
            {
                "reader": {
                    "name": "hdfsreader",
                    "parameter": {
                        "path": "/behavior/ads/ads_visit_mode/*",
                        "defaultFS": "hdfs://hadoop01:8020",
                        "column": [
                               {
                                "index": 0,
                                "type": "string"
                               },
                               {
                                "index": 1,
                                "type": "string"
                               },
                               {
                                "index": 2,
                                "type": "string"
                               },
                               {
                                "index": 3,
                                "type": "long"
                               }
                        ],
                        "fileType": "text",
                        "encoding": "UTF-8",
                        "fieldDelimiter": "\t"
                    }

                },
				"writer": {
                    "name": "mysqlwriter",
                    "parameter": {
                        "writeMode": "replace",
                        "username": "root",
                        "password": "123456",
                        "column": [
                            "url",
                            "device_type",
							"dt",							
							"count"
                        ],
                        "session": [
                        	"set session sql_mode='ANSI'"
                        ],
                        "preSql": [
                            "delete from ads_visit_mode"
                        ],
                        "connection": [
                            {
                                "jdbcUrl": "jdbc:mysql://192.168.113.144:3306/behavior?useUnicode=true&characterEncoding=utf-8&useSSL=false",
                                "table": [
                                    "ads_visit_mode"
                                ]
                            }
                        ]
                    }
                }
            }
        ]
    }
}

Quick BI的可视化报表配置(参考官方文档即可)

本文参与 腾讯云自媒体同步曝光计划,分享自作者个人站点/博客。
原始发表:2024-05-24,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 作者个人站点/博客 前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体同步曝光计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
目录
  • 数据可视化
    • Datax的基本安装
    • Datax的基本使用
    • 在MySQL中创建对应的表结构
    • 导出数据脚本
  • Quick BI的可视化报表配置(参考官方文档即可)
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档