BatchTSetImpl

java.lang.Object
- edu.iu.dsc.tws.tset.sets.BaseTSet<T>
- - edu.iu.dsc.tws.tset.sets.BaseTSetWithSchema<T>
  - - edu.iu.dsc.tws.tset.sets.batch.BatchTSetImpl<T>

All Implemented Interfaces:

AcceptingData<T>, BatchTSet<T>, TSet<T>, StoringData<T>, TBase, Buildable, BuildableTSet, java.io.Serializable

Direct Known Subclasses:

ComputeTSet, SourceTSet, StoredTSet
```
public abstract class BatchTSetImpl<T>
extends BaseTSetWithSchema<T>
implements BatchTSet<T>
```
See Also:

Serialized Form

Nested Class Summary
- Nested classes/interfaces inherited from class edu.iu.dsc.tws.tset.sets.BaseTSet
  BaseTSet.StateType

Constructor Summary

Constructors
Constructor and Description

BatchTSetImpl()

Constructors
Constructor and Description
`BatchTSetImpl()`

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`BatchTSetImpl<T>`	`addInput(java.lang.String key, StorableTBase<?> input)` Adds data to this TSet
`AllGatherTLink<T>`	`allGather()` Same as gather, but all the target TSet instances would receive the gathered result in the runtime.
`AllReduceTLink<T>`	`allReduce(ReduceFunc<T> reduceFn)` Similar to reduce, but all instances of the target `TSet` would receive the reduced result.
`CachedTSet<T>`	`cache()` Runs this TSet and caches the data to an in-memory `DataPartition` and exposes the data as another TSet.
`DirectTLink<T>`	`direct()` Returns a Direct `TLink` that corresponds to the communication operation where the data will be transferred to another TSet directly.
`GatherTLink<T>`	`gather()` Returns a Gather `TLink` that would gather data to the target TSet instance with index 0 (in the runtime).
`BatchEnvironment`	`getTSetEnv()` tset env
`CachedTSet<T>`	`lazyCache()` Performs caching lazily.
`PersistedTSet<T>`	`lazyPersist()` Performs persisting lazily.
`<K,V> KeyedTSet<K,V>`	`mapToTuple(MapFunc<Tuple<K,V>,T> mapToTupleFn)` Creates a `TupleTSet` based on the `MapFunc` provided.
`PartitionTLink<T>`	`partition(PartitionFunc<T> partitionFn)` Same as above, but the parallelism will be preserved in the target `TSet`.
`PartitionTLink<T>`	`partition(PartitionFunc<T> partitionFn, int targetParallelism)` Returns a Partition `TLink` that would partition data according based on a function provided.
`PersistedTSet<T>`	`persist()` Similar to cache, but the data is stored in a disk based `DataPartition`.
`PipeTLink<T>`	`pipe()`
`ReduceTLink<T>`	`reduce(ReduceFunc<T> reduceFn)` Returns a Reduce `TLink` that reduce data on to the target TSet instance (in the runtime) with index 0.
`ReplicateTLink<T>`	`replicate(int replications)` Returns a Replicate `TLink` that would clone/broadcast the data from this `TSet`.
`ComputeTSet<T,java.util.Iterator<T>>`	`union(java.util.Collection<TSet<T>> tSets)` Same as above, but accepts a `Collection` of `TSet`s.
`ComputeTSet<T,java.util.Iterator<T>>`	`union(TSet<T> other)` Returns a single `TSet` that create a union of data in both `TSet`s.
`BatchTSetImpl<T>`	`withSchema(Schema schema)` Sets the data type of the `TSet` output.

Methods inherited from class edu.iu.dsc.tws.tset.sets.BaseTSetWithSchema
getInputSchema, getOutputSchema, setOutputSchema

Methods inherited from class edu.iu.dsc.tws.tset.sets.BaseTSet
addChildToGraph, addChildToGraph, equals, getId, getInputs, getName, getParallelism, getStateType, hashCode, isMutable, rename, setMutable, setStateType, setTSetEnv, toString

Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait

Methods inherited from interface edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet
setName

Methods inherited from interface edu.iu.dsc.tws.api.tset.TBase
getId, getName

Methods inherited from interface edu.iu.dsc.tws.tset.sets.BuildableTSet
build, getINode

Methods inherited from interface edu.iu.dsc.tws.tset.Buildable
generateID, getTBaseGraph

- Constructor Detail
  - BatchTSetImpl
```
public BatchTSetImpl()
```
- Method Detail
  - getTSetEnv
```
public BatchEnvironment getTSetEnv()
```
    Description copied from interface: Buildable
    
    tset env
    
    Specified by:
    
    getTSetEnv in interface Buildable
    
    Overrides:
    
    getTSetEnv in class BaseTSet<T>
    
    Returns:
    
    tset env
  - direct
```
public DirectTLink<T> direct()
```
    Description copied from interface: TSet
    
    Returns a Direct TLink that corresponds to the communication operation where the data will be transferred to another TSet directly.
    
    Specified by:
    
    direct in interface BatchTSet<T>
    
    Specified by:
    
    direct in interface TSet<T>
    
    Returns:
    
    Direct TLink
  - pipe
```
public PipeTLink<T> pipe()
```
    Specified by:
    
    pipe in interface BatchTSet<T>
  - reduce
```
public ReduceTLink<T> reduce(ReduceFunc<T> reduceFn)
```
    Description copied from interface: TSet
    
    Returns a Reduce TLink that reduce data on to the target TSet instance (in the runtime) with index 0.
    
    Specified by:
    
    reduce in interface BatchTSet<T>
    
    Specified by:
    
    reduce in interface TSet<T>
    
    Parameters:
    
    reduceFn - Reduce function
    
    Returns:
    
    Reduce TLink
  - partition
```
public PartitionTLink<T> partition(PartitionFunc<T> partitionFn,
                                   int targetParallelism)
```
    Description copied from interface: TSet
    
    Returns a Partition TLink that would partition data according based on a function provided. The parallelism of the target TSet can also be specified.
    
    Specified by:
    
    partition in interface BatchTSet<T>
    
    Specified by:
    
    partition in interface TSet<T>
    
    Parameters:
    
    partitionFn - Partition function
    
    targetParallelism - Target parallelism
    
    Returns:
    
    Partition TLink
  - partition
```
public PartitionTLink<T> partition(PartitionFunc<T> partitionFn)
```
    Description copied from interface: TSet
    
    Same as above, but the parallelism will be preserved in the target TSet.
    
    Specified by:
    
    partition in interface BatchTSet<T>
    
    Specified by:
    
    partition in interface TSet<T>
    
    Parameters:
    
    partitionFn - Partition function
    
    Returns:
    
    Partition TLink
  - gather
```
public GatherTLink<T> gather()
```
    Description copied from interface: TSet
    
    Returns a Gather TLink that would gather data to the target TSet instance with index 0 (in the runtime).
    
    Specified by:
    
    gather in interface BatchTSet<T>
    
    Specified by:
    
    gather in interface TSet<T>
    
    Returns:
    
    Gather TLink
  - allReduce
```
public AllReduceTLink<T> allReduce(ReduceFunc<T> reduceFn)
```
    Description copied from interface: TSet
    
    Similar to reduce, but all instances of the target TSet would receive the reduced result.
    
    Specified by:
    
    allReduce in interface BatchTSet<T>
    
    Specified by:
    
    allReduce in interface TSet<T>
    
    Parameters:
    
    reduceFn - Reduce function
    
    Returns:
    
    AllReduce TLink
  - allGather
```
public AllGatherTLink<T> allGather()
```
    Description copied from interface: TSet
    
    Same as gather, but all the target TSet instances would receive the gathered result in the runtime.
    
    Specified by:
    
    allGather in interface BatchTSet<T>
    
    Specified by:
    
    allGather in interface TSet<T>
    
    Returns:
    
    AllGather TLink
  - mapToTuple
```
public <K,V> KeyedTSet<K,V> mapToTuple(MapFunc<Tuple<K,V>,T> mapToTupleFn)
```
    Description copied from interface: TSet
    
    Creates a TupleTSet based on the MapFunc provided. This will an entry point to keyed communication operations from a non-keyed TSet.
    
    Specified by:
    
    mapToTuple in interface BatchTSet<T>
    
    Specified by:
    
    mapToTuple in interface TSet<T>
    
    Type Parameters:
    
    K - type of key
    
    V - type of value
    
    Parameters:
    
    mapToTupleFn - Map function
    
    Returns:
    
    Tuple TSet
  - union
```
public ComputeTSet<T,java.util.Iterator<T>> union(TSet<T> other)
```
    Description copied from interface: TSet
    
    Returns a single TSet that create a union of data in both TSets. In order for this to work both TSets should be of the same type
    
    Specified by:
    
    union in interface BatchTSet<T>
    
    Specified by:
    
    union in interface TSet<T>
    
    Parameters:
    
    other - TSet to union with
    
    Returns:
    
    Union TSet
  - union
```
public ComputeTSet<T,java.util.Iterator<T>> union(java.util.Collection<TSet<T>> tSets)
```
    Description copied from interface: TSet
    
    Same as above, but accepts a Collection of TSets.
    
    Specified by:
    
    union in interface BatchTSet<T>
    
    Specified by:
    
    union in interface TSet<T>
    
    Parameters:
    
    tSets - a collection of TSet's to union with
    
    Returns:
    
    Union TSet
  - replicate
```
public ReplicateTLink<T> replicate(int replications)
```
    Description copied from interface: TSet
    
    Returns a Replicate TLink that would clone/broadcast the data from this TSet. Note that the parallelism of this TSet should be 1.
    
    Specified by:
    
    replicate in interface BatchTSet<T>
    
    Specified by:
    
    replicate in interface TSet<T>
    
    Parameters:
    
    replications - Replicas of data (= target TSet parallelism)
    
    Returns:
    
    Replicate TLink
  - cache
```
public CachedTSet<T> cache()
```
    Description copied from interface: StoringData
    
    Runs this TSet and caches the data to an in-memory DataPartition and exposes the data as another TSet.
    
    Specified by:
    
    cache in interface StoringData<T>
    
    Returns:
    
    Cached TSet
  - lazyCache
```
public CachedTSet<T> lazyCache()
```
    Description copied from interface: StoringData
    
    Performs caching lazily. i.e. cache operation would only be performed when the TSet is evaluated explicitly.
    
    Specified by:
    
    lazyCache in interface StoringData<T>
    
    Returns:
    
    Cached TSet
  - persist
```
public PersistedTSet<T> persist()
```
    Description copied from interface: StoringData
    
    Similar to cache, but the data is stored in a disk based DataPartition. This method would also expose the checkpointing ability to TSets.
    
    Specified by:
    
    persist in interface StoringData<T>
    
    Returns:
    
    Persisted / Checkpointed TSets
  - lazyPersist
```
public PersistedTSet<T> lazyPersist()
```
    Description copied from interface: StoringData
    
    Performs persisting lazily.
    
    Specified by:
    
    lazyPersist in interface StoringData<T>
    
    Returns:
    
    Persisted / Checkpointed TSets
  - addInput
```
public BatchTSetImpl<T> addInput(java.lang.String key,
                                 StorableTBase<?> input)
```
    Description copied from interface: BatchTSet
    
    Adds data to this TSet
    
    Specified by:
    
    addInput in interface AcceptingData<T>
    
    Specified by:
    
    addInput in interface BatchTSet<T>
    
    Parameters:
    
    key - the key used to store the given TSet
    
    input - a @StorableTBase TSet to be added as an input
    
    Returns:
    
    this test
  - withSchema
```
public BatchTSetImpl<T> withSchema(Schema schema)
```
    Description copied from interface: TSet
    
    Sets the data type of the TSet output. This will be used in the packers for efficient SER-DE operations in the following TLinks
    
    Specified by:
    
    withSchema in interface TSet<T>
    
    Parameters:
    
    schema - data type as a MessageType
    
    Returns:
    
    this TSet

Class BatchTSetImpl<T>

Nested Class Summary

Nested classes/interfaces inherited from class edu.iu.dsc.tws.tset.sets.BaseTSet

Constructor Summary

Method Summary

Methods inherited from class edu.iu.dsc.tws.tset.sets.BaseTSetWithSchema

Methods inherited from class edu.iu.dsc.tws.tset.sets.BaseTSet

Methods inherited from class java.lang.Object

Methods inherited from interface edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet

Methods inherited from interface edu.iu.dsc.tws.api.tset.TBase

Methods inherited from interface edu.iu.dsc.tws.tset.sets.BuildableTSet

Methods inherited from interface edu.iu.dsc.tws.tset.Buildable

Constructor Detail

BatchTSetImpl

Method Detail

getTSetEnv

direct

pipe

reduce

partition

partition

gather

allReduce

allGather

mapToTuple

union

union

replicate

cache

lazyCache

persist

lazyPersist

addInput

withSchema