T
- value typepublic abstract class BatchGatherLink<T> extends BatchTLinkImpl<java.util.Iterator<Tuple<java.lang.Integer,T>>,T>
Modifier | Constructor and Description |
---|---|
protected |
BatchGatherLink() |
Modifier and Type | Method and Description |
---|---|
CachedTSet<T> |
cache()
Runs this TSet and caches the data to an in-memory
DataPartition and exposes the data as another TSet. |
<O> ComputeTSet<O,java.util.Iterator<Tuple<java.lang.Integer,T>>> |
flatmap(FlatMapFunc<O,T> mapFn)
Performs flat map operation based on the
FlatMapFunc provided |
void |
forEach(ApplyFunc<T> applyFunction)
Applies a function elementwise.
|
CachedTSet<T> |
lazyCache()
Performs caching lazily.
|
ComputeTSet<java.lang.Object,java.util.Iterator<Tuple<java.lang.Integer,T>>> |
lazyForEach(ApplyFunc<T> applyFunction) |
PersistedTSet<T> |
lazyPersist()
Performs persisting lazily.
|
<O> ComputeTSet<O,java.util.Iterator<Tuple<java.lang.Integer,T>>> |
map(MapFunc<O,T> mapFn)
Performs elementwise map operation based on the
MapFunc provided |
<K,V> KeyedTSet<K,V> |
mapToTuple(MapFunc<Tuple<K,V>,T> genTupleFn)
|
PersistedTSet<T> |
persist()
Similar to cache, but the data is stored in a disk based
DataPartition . |
compute, compute, compute, compute, getTSetEnv, sink
getSchema
addChildToGraph, equals, getId, getName, getSourceParallelism, getTargetParallelism, hashCode, rename, setTSetEnv, toString
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
setName
build, getEdge
generateID, getTBaseGraph
public <O> ComputeTSet<O,java.util.Iterator<Tuple<java.lang.Integer,T>>> map(MapFunc<O,T> mapFn)
TLink
MapFunc
providedO
- output typemapFn
- map function T0 to Opublic <O> ComputeTSet<O,java.util.Iterator<Tuple<java.lang.Integer,T>>> flatmap(FlatMapFunc<O,T> mapFn)
TLink
FlatMapFunc
providedO
- map function to T0 to multiple elements of <O>mapFn
- map function which can produce multiple elements for a single <T0> elementpublic <K,V> KeyedTSet<K,V> mapToTuple(MapFunc<Tuple<K,V>,T> genTupleFn)
TLink
public void forEach(ApplyFunc<T> applyFunction)
TLink
ApplyFunc
does not
return anything.applyFunction
- apply functionpublic ComputeTSet<java.lang.Object,java.util.Iterator<Tuple<java.lang.Integer,T>>> lazyForEach(ApplyFunc<T> applyFunction)
public CachedTSet<T> lazyCache()
StoringData
TSet
is evaluated explicitly.public CachedTSet<T> cache()
StoringData
DataPartition
and exposes the data as another TSet.cache
in interface StoringData<T>
cache
in class BatchTLinkImpl<java.util.Iterator<Tuple<java.lang.Integer,T>>,T>
public PersistedTSet<T> lazyPersist()
StoringData
public PersistedTSet<T> persist()
StoringData
DataPartition
. This method would also expose the
checkpointing ability to TSet
s.persist
in interface StoringData<T>
persist
in class BatchTLinkImpl<java.util.Iterator<Tuple<java.lang.Integer,T>>,T>