这个问题是这个主题的延续.
我遇到的问题是使用Maven打包Scala Spark项目.
当我运行此命令时:
spark-submit --name 28 --master local[2] --class org.test.consumer.TestRunner \
/usr/tests/test1/target/test_service-1.0-SNAPSHOT.jar \
$arg1 $arg2 $arg3 $arg4 $arg5
Run Code Online (Sandbox Code Playgroud)
...,我收到以下错误:
Exception in thread "main" java.lang.NoClassDefFoundError: com/typesafe/config/ConfigFactory
at org.test.consumer.kafka.KafkaConsumer.<init>(KafkaConsumer.scala:38)
at org.test.consumer.TestRunner$.main(TestRunner.scala:19)
at org.test.consumer.TestRunner.main(TestRunner.scala)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:729)
at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:185)
at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:210)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:124)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
Caused by: java.lang.ClassNotFoundException: com.typesafe.config.ConfigFactory
at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
... 12 more
Run Code Online (Sandbox Code Playgroud)
这是我目前pom.xml考虑上面提到的线程的建议(我检查生成的jar包含scala类):
<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>org.test.consumer</groupId>
<artifactId>test_service</artifactId> …Run Code Online (Sandbox Code Playgroud) 我有以下数据:
df = data.frame(c("2012","2012","2012","2013"),
c("tuesday","tuesday","friday","thursday"),
c("AAA","BBB","AAA","AAA"))
colnames(df) = c("year","day","type")
Run Code Online (Sandbox Code Playgroud)
type我想显示每年和每天值(AAA、BBB)出现的次数(绝对频率) 。目前我编写了以下代码,但它要求我再添加一个维度aes,例如aes(type, some_dimension, fill = as.factor(year))。那么,我怎样才能添加类似的东西count(type)呢?
ggplot(dat) +
geom_bar(aes(type, fill = as.factor(year)),
position = "dodge", stat = "identity") +
facet_wrap(~day)
Run Code Online (Sandbox Code Playgroud) 我使用ggplot2库创建了一个饼图,如下所示:
ggplot(dat,
aes(x = factor(""), fill = workers) ) +
geom_bar() +
coord_polar(theta = "y") +
scale_x_discrete("")
Run Code Online (Sandbox Code Playgroud)
问题是数字 ( count) 显示为2e+05。我怎样才能将它们显示为20000?
更新:
dat <- data.frame(workers=c("Q8","A2","S1","A2","A2","S1"))
Run Code Online (Sandbox Code Playgroud)