public abstract class BatchTLinkImpl<T1,T0> extends BaseTLinkWithSchema<T1,T0> implements BatchTLink<T1,T0>
Modifier | Constructor and Description |
---|---|
protected |
BatchTLinkImpl() |
Modifier and Type | Method and Description |
---|---|
StorableTBase<T0> |
cache()
Runs this TSet and caches the data to an in-memory
DataPartition and exposes the data as another TSet. |
<P> ComputeTSet<P,T1> |
compute(ComputeCollectorFunc<P,T1> computeFunction)
Creates a Compute
TSet based on the ComputeCollectorFunc provided. |
<P> ComputeTSet<P,T1> |
compute(ComputeFunc<P,T1> computeFunction)
Creates a Compute
TSet based on the ComputeFunc provided. |
<P> ComputeTSet<P,T1> |
compute(java.lang.String n,
ComputeCollectorFunc<P,T1> computeFunction) |
<P> ComputeTSet<P,T1> |
compute(java.lang.String n,
ComputeFunc<P,T1> computeFunction) |
BatchEnvironment |
getTSetEnv()
tset env
|
StorableTBase<T0> |
persist()
Similar to cache, but the data is stored in a disk based
DataPartition . |
SinkTSet<T1> |
sink(SinkFunc<T1> sinkFunction)
Creates a Sink TSet based on the
SinkFunc . |
getSchema
addChildToGraph, equals, getId, getName, getSourceParallelism, getTargetParallelism, hashCode, rename, setTSetEnv, toString
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
flatmap, forEach, lazyForEach, map, mapToTuple, setName
lazyCache, lazyPersist
build, getEdge
generateID, getTBaseGraph
public BatchEnvironment getTSetEnv()
Buildable
getTSetEnv
in interface Buildable
getTSetEnv
in class BaseTLink<T1,T0>
public <P> ComputeTSet<P,T1> compute(java.lang.String n, ComputeFunc<P,T1> computeFunction)
public <P> ComputeTSet<P,T1> compute(java.lang.String n, ComputeCollectorFunc<P,T1> computeFunction)
public <P> ComputeTSet<P,T1> compute(ComputeFunc<P,T1> computeFunction)
TLink
TSet
based on the ComputeFunc
provided.public <P> ComputeTSet<P,T1> compute(ComputeCollectorFunc<P,T1> computeFunction)
TLink
TSet
based on the ComputeCollectorFunc
provided.compute
in interface BatchTLink<T1,T0>
compute
in interface TLink<T1,T0>
P
- output type (for the RecordCollector
)computeFunction
- compute function with collectorpublic SinkTSet<T1> sink(SinkFunc<T1> sinkFunction)
TLink
SinkFunc
.sink
in interface BatchTLink<T1,T0>
sink
in interface TLink<T1,T0>
sinkFunction
- sink function which takes in <T1>. Similar to a compute, but would not
return any TSetpublic StorableTBase<T0> cache()
StoringData
DataPartition
and exposes the data as another TSet.cache
in interface StoringData<T0>
public StorableTBase<T0> persist()
StoringData
DataPartition
. This method would also expose the
checkpointing ability to TSet
s.persist
in interface StoringData<T0>