T
- tset typepublic interface BatchTSet<T> extends TSet<T>, AcceptingData<T>, StoringData<T>
TSet
interface. Adds methods related to
Batch OperationMode
and overrides generic
TSet
and TLink
return type to BatchTSet
and BatchTLink
in all methods. Also extends the AcceptingData
interface
because the BatchTSet
s can accept data from other TSet
s.Modifier and Type | Method and Description |
---|---|
BatchTSet<T> |
addInput(java.lang.String key,
StorableTBase<?> input)
Adds data to this TSet
|
BatchTLink<java.util.Iterator<Tuple<java.lang.Integer,T>>,T> |
allGather()
Same as gather, but all the target TSet instances would receive the gathered result in the
runtime.
|
BatchTLink<T,T> |
allReduce(ReduceFunc<T> reduceFn)
Similar to reduce, but all instances of the target
TSet would receive the reduced
result. |
BatchTLink<java.util.Iterator<T>,T> |
direct()
Returns a Direct
TLink that corresponds to the communication operation where the data
will be transferred to another TSet directly. |
BatchTLink<java.util.Iterator<Tuple<java.lang.Integer,T>>,T> |
gather()
Returns a Gather
TLink that would gather data to the target TSet instance with index
0 (in the runtime). |
<K,V> BatchTupleTSet<K,V> |
mapToTuple(MapFunc<Tuple<K,V>,T> mapToTupleFn)
|
BatchTLink<java.util.Iterator<T>,T> |
partition(PartitionFunc<T> partitionFn)
Same as above, but the parallelism will be preserved in the target
TSet . |
BatchTLink<java.util.Iterator<T>,T> |
partition(PartitionFunc<T> partitionFn,
int targetParallelism)
Returns a Partition
TLink that would partition data according based on a function
provided. |
BatchTLink<T,T> |
pipe() |
BatchTLink<T,T> |
reduce(ReduceFunc<T> reduceFn)
Returns a Reduce
TLink that reduce data on to the target TSet instance (in the runtime)
with index 0. |
BatchTLink<java.util.Iterator<T>,T> |
replicate(int replications)
|
BatchTSet<T> |
setName(java.lang.String name)
Sets the name
|
BatchTSet<T> |
union(java.util.Collection<TSet<T>> tSets)
Same as above, but accepts a
Collection of TSet s. |
BatchTSet<T> |
union(TSet<T> unionTSet)
|
withSchema
cache, lazyCache, lazyPersist, persist
BatchTSet<T> setName(java.lang.String name)
TSet
BatchTLink<java.util.Iterator<T>,T> direct()
TSet
TLink
that corresponds to the communication operation where the data
will be transferred to another TSet directly.BatchTLink<T,T> pipe()
BatchTLink<T,T> reduce(ReduceFunc<T> reduceFn)
TSet
TLink
that reduce data on to the target TSet instance (in the runtime)
with index 0.BatchTLink<T,T> allReduce(ReduceFunc<T> reduceFn)
TSet
TSet
would receive the reduced
result.BatchTLink<java.util.Iterator<T>,T> partition(PartitionFunc<T> partitionFn, int targetParallelism)
TSet
BatchTLink<java.util.Iterator<T>,T> partition(PartitionFunc<T> partitionFn)
TSet
TSet
.BatchTLink<java.util.Iterator<Tuple<java.lang.Integer,T>>,T> gather()
TSet
TLink
that would gather data to the target TSet instance with index
0 (in the runtime).BatchTLink<java.util.Iterator<Tuple<java.lang.Integer,T>>,T> allGather()
TSet
<K,V> BatchTupleTSet<K,V> mapToTuple(MapFunc<Tuple<K,V>,T> mapToTupleFn)
TSet
TupleTSet
based on the MapFunc
provided. This will an entry point
to keyed communication operations from a non-keyed TSet
.mapToTuple
in interface TSet<T>
K
- type of keyV
- type of valuemapToTupleFn
- Map functionBatchTLink<java.util.Iterator<T>,T> replicate(int replications)
TSet
BatchTSet<T> union(java.util.Collection<TSet<T>> tSets)
TSet
Collection
of TSet
s.BatchTSet<T> addInput(java.lang.String key, StorableTBase<?> input)
addInput
in interface AcceptingData<T>
key
- the key used to store the given TSetinput
- a @StorableTBase
TSet to be added as an input