public abstract class BatchTLinkImpl<T1,T0> extends BaseTLinkWithSchema<T1,T0> implements BatchTLink<T1,T0>
| Modifier | Constructor and Description |
|---|---|
protected |
BatchTLinkImpl() |
| Modifier and Type | Method and Description |
|---|---|
StorableTBase<T0> |
cache()
Runs this TSet and caches the data to an in-memory
DataPartition and exposes the data as another TSet. |
<P> ComputeTSet<P,T1> |
compute(ComputeCollectorFunc<P,T1> computeFunction)
Creates a Compute
TSet based on the ComputeCollectorFunc provided. |
<P> ComputeTSet<P,T1> |
compute(ComputeFunc<P,T1> computeFunction)
Creates a Compute
TSet based on the ComputeFunc provided. |
<P> ComputeTSet<P,T1> |
compute(java.lang.String n,
ComputeCollectorFunc<P,T1> computeFunction) |
<P> ComputeTSet<P,T1> |
compute(java.lang.String n,
ComputeFunc<P,T1> computeFunction) |
BatchEnvironment |
getTSetEnv()
tset env
|
StorableTBase<T0> |
persist()
Similar to cache, but the data is stored in a disk based
DataPartition. |
SinkTSet<T1> |
sink(SinkFunc<T1> sinkFunction)
Creates a Sink TSet based on the
SinkFunc. |
getSchemaaddChildToGraph, equals, getId, getName, getSourceParallelism, getTargetParallelism, hashCode, rename, setTSetEnv, toStringclone, finalize, getClass, notify, notifyAll, wait, wait, waitflatmap, forEach, lazyForEach, map, mapToTuple, setNamelazyCache, lazyPersistbuild, getEdgegenerateID, getTBaseGraphpublic BatchEnvironment getTSetEnv()
BuildablegetTSetEnv in interface BuildablegetTSetEnv in class BaseTLink<T1,T0>public <P> ComputeTSet<P,T1> compute(java.lang.String n, ComputeFunc<P,T1> computeFunction)
public <P> ComputeTSet<P,T1> compute(java.lang.String n, ComputeCollectorFunc<P,T1> computeFunction)
public <P> ComputeTSet<P,T1> compute(ComputeFunc<P,T1> computeFunction)
TLinkTSet based on the ComputeFunc provided.public <P> ComputeTSet<P,T1> compute(ComputeCollectorFunc<P,T1> computeFunction)
TLinkTSet based on the ComputeCollectorFunc provided.compute in interface BatchTLink<T1,T0>compute in interface TLink<T1,T0>P - output type (for the RecordCollector)computeFunction - compute function with collectorpublic SinkTSet<T1> sink(SinkFunc<T1> sinkFunction)
TLinkSinkFunc.sink in interface BatchTLink<T1,T0>sink in interface TLink<T1,T0>sinkFunction - sink function which takes in <T1>. Similar to a compute, but would not
return any TSetpublic StorableTBase<T0> cache()
StoringDataDataPartition and exposes the data as another TSet.cache in interface StoringData<T0>public StorableTBase<T0> persist()
StoringDataDataPartition. This method would also expose the
checkpointing ability to TSets.persist in interface StoringData<T0>