Run in IntelliJ IdeaΒΆ
You can run your job from IDEA locally. Run -> Edit configurations -> New Application
Option | Value |
---|---|
Main class | com.twitter.scalding.Tool |
VM options | -XX:MaxPermSize=512M -Xmx1024M |
Program arguments | job.class.name –hdfs –param1 value –param2 value –input local-path |
Working directory | ~/projects/scalding-jobs |
Use classpath of module | first-party |
Note that --output
is optional usually. When not specified we write to NullSource which prints to stdout.
Here is sample scalding job:
package com.sample
import com.twitter.scalding._
import com.twitter.scalding.Tsv
class SampleJob(args: Args) extends Job(args) {
val input = args("input")
val output = args.getOrElse("output", null)
val results = Tsv(input).read
//todo do something here
if (output != null)
results.write(Tsv(output))
else
results.debug.write(NullSource)
}