public class PersistedTSet<T> extends StoredTSet<T>
BaseTSet.StateType
storedSource
Constructor and Description |
---|
PersistedTSet(BatchEnvironment tSetEnv,
SinkFunc<?> sinkFunc,
int parallelism,
Schema inputSchema) |
Modifier and Type | Method and Description |
---|---|
PersistedTSet<T> |
addInput(java.lang.String key,
StorableTBase<?> input)
Adds data to this TSet
|
CachedTSet<T> |
cache()
Runs this TSet and caches the data to an in-memory
DataPartition and exposes the data as another TSet. |
CachedTSet<T> |
lazyCache()
Performs caching lazily.
|
PersistedTSet<T> |
lazyPersist()
Performs persisting lazily.
|
PersistedTSet<T> |
persist()
Similar to cache, but the data is stored in a disk based
DataPartition . |
PersistedTSet<T> |
setName(java.lang.String n)
Sets the name for the
TBase |
PersistedTSet<T> |
withSchema(Schema schema)
Sets the data type of the
TSet output. |
allGather, allReduce, direct, gather, getData, getDataObject, getINode, getStoredSourceTSet, mapToTuple, partition, partition, pipe, reduce, replicate, union, union
getTSetEnv
getInputSchema, getOutputSchema, setOutputSchema
addChildToGraph, addChildToGraph, equals, getId, getInputs, getName, getParallelism, getStateType, hashCode, isMutable, rename, setMutable, setStateType, setTSetEnv, toString
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
build
generateID, getTBaseGraph
public PersistedTSet(BatchEnvironment tSetEnv, SinkFunc<?> sinkFunc, int parallelism, Schema inputSchema)
public PersistedTSet<T> persist()
StoringData
DataPartition
. This method would also expose the
checkpointing ability to TSet
s.persist
in interface StoringData<T>
persist
in class BatchTSetImpl<T>
public PersistedTSet<T> lazyPersist()
StoringData
lazyPersist
in interface StoringData<T>
lazyPersist
in class BatchTSetImpl<T>
public CachedTSet<T> cache()
StoringData
DataPartition
and exposes the data as another TSet.cache
in interface StoringData<T>
cache
in class BatchTSetImpl<T>
public CachedTSet<T> lazyCache()
StoringData
TSet
is evaluated explicitly.lazyCache
in interface StoringData<T>
lazyCache
in class BatchTSetImpl<T>
public PersistedTSet<T> setName(java.lang.String n)
TBase
TBase
public PersistedTSet<T> addInput(java.lang.String key, StorableTBase<?> input)
BatchTSet
addInput
in interface AcceptingData<T>
addInput
in interface BatchTSet<T>
addInput
in class StoredTSet<T>
key
- the key used to store the given TSetinput
- a @StorableTBase
TSet to be added as an inputpublic PersistedTSet<T> withSchema(Schema schema)
TSet
TSet
output. This will be used in the packers for efficient
SER-DE operations in the following TLink
swithSchema
in interface TSet<T>
withSchema
in class BatchTSetImpl<T>
schema
- data type as a MessageType
TSet