You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Currently scalding-spark only supports a few scalding sources/sinks. I am interested in adding support for some more common use cases.
The current support is:
/** * This has a mappings for some built in scalding sinks currently only WritableSequenceFile and TextLine are * supported * * users can add their own implementations and compose Resolvers using orElse*/valDefault:Resolver[Output, SparkSink] =newResolver[Output, SparkSink] {
defapply[A](i: Output[A]):Option[SparkSink[A]] =
i match {
case ws @WritableSequenceFile(path, fields, sinkMode) =>Some(writableSequenceFile(path, ws.keyType, ws.valueType).asInstanceOf[SparkSink[A]])
casetl: TextLine=>Some(textLine(tl.localPaths.head).asInstanceOf[SparkSink[A]])
case _ =>None
}
}
}
The text was updated successfully, but these errors were encountered:
I think this is a good goal, but trying to make the current common inputs independent of cascading will be pretty hard...
Instead, I think you could probably make it work by admitting cascading to the classpath, but excluding Hadoop and just not triggering hadoop in the runtime, since you are only exercising equals and isInstanceOf to do these kinds of matches.
Is your feature request related to a problem? Please describe.
Currently scalding-spark only supports a few scalding sources/sinks. I am interested in adding support for some more common use cases.
The current support is:
The text was updated successfully, but these errors were encountered: