[hadoop@master mapreduce]$ hadoop jar hadoop-mapreduce-examples-2.6.4.jar wordcount /wordcount/input/ /wordcount/output
17/09/22 20:33:50 INFO client.RMProxy: Connecting to ResourceManager at /0.0.0.0:8032
17/09/22 20:33:50 INFO input.FileInputFormat: Total input paths to process : 0
17/09/22 20:3
我运行以下hql:
select new.uid as uid, new.category_id as category_id, new.atag as atag,
new.rank_idx + CASE when old.rank_idx is not NULL then old.rank_idx else 0 END as rank_idx
from (
select a1.uid, a1.category_id, a1.atag, row_number() over(distribute by a1.uid, a1.category_id sort by a1.cmt_
我想备份(然后导入)一个dynamodb表到S3。dynamodb表存在于us-east-2中,但这是aws数据管道不支持的区域。AWS文档似乎表明这应该不是问题,但我似乎不能让数据管道在us-east-2中查找表。
这是我的数据管道的导出。当我运行此命令时,在查找dynamodb表时,我得到一个'resource not found error‘。如果我在运行此数据管道的us-west-2中临时创建了一个同名的表,作业将工作,但会从us-west-2中的表中提取数据,而不是从us-east-2中提取数据。有什么方法可以让这个作业从配置中指定的区域中拉出?
{
"objec