我正在尝试使用spark应用程序访问s3数据。我正在应用Spark SQL来检索数据。它不是group by子句。DataFrame summaryQuery=sql.sql("Select score from summary order by updationDate desc);
summaryQuery.groupByDataFrame summaryQuery=sql.sql("Select score from summary g
val orderedId = sqlContext.sql("SELECT MovieId AS Id,Tag FROM tag ORDER BY MovieId")此查询提供给我的文件由Id、标记组成,作为第二步val eachTagCount =orderedId.groupBy(" Id,Tag").count()的输入,但出现错误 case class DataClass( MovieId:Int/dataset/algorithm3/output")
val ea