public class PipeTLink<T> extends BatchTLinkImpl<T,T>
Constructor and Description |
---|
PipeTLink(BatchEnvironment tSetEnv,
int sourceParallelism,
Schema schema) |
PipeTLink(BatchEnvironment tSetEnv,
java.lang.String name,
int sourceParallelism,
Schema schema) |
Modifier and Type | Method and Description |
---|---|
CachedTSet<T> |
cache()
Runs this TSet and caches the data to an in-memory
DataPartition and exposes the data as another TSet. |
<O> ComputeTSet<O,T> |
flatmap(FlatMapFunc<O,T> mapFn)
Performs flat map operation based on the
FlatMapFunc provided |
void |
forEach(ApplyFunc<T> applyFunction)
Applies a function elementwise.
|
Edge |
getEdge() |
CachedTSet<T> |
lazyCache()
Performs caching lazily.
|
ComputeTSet<java.lang.Object,T> |
lazyForEach(ApplyFunc<T> applyFunction) |
PersistedTSet<T> |
lazyPersist()
Performs persisting lazily.
|
<O> ComputeTSet<O,T> |
map(MapFunc<O,T> mapFn)
Performs elementwise map operation based on the
MapFunc provided |
<K,V> KeyedTSet<K,V> |
mapToTuple(MapFunc<Tuple<K,V>,T> genTupleFn)
|
PersistedTSet<T> |
persist()
Similar to cache, but the data is stored in a disk based
DataPartition . |
PipeTLink<T> |
setName(java.lang.String name)
Name of the TSet and return the same tlink
|
compute, compute, compute, compute, getTSetEnv, sink
getSchema
addChildToGraph, equals, getId, getName, getSourceParallelism, getTargetParallelism, hashCode, rename, setTSetEnv, toString
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
build
generateID, getTBaseGraph
public PipeTLink(BatchEnvironment tSetEnv, int sourceParallelism, Schema schema)
public PipeTLink(BatchEnvironment tSetEnv, java.lang.String name, int sourceParallelism, Schema schema)
public PipeTLink<T> setName(java.lang.String name)
TLink
name
- namepublic void forEach(ApplyFunc<T> applyFunction)
TLink
ApplyFunc
does not
return anything.applyFunction
- apply functionpublic ComputeTSet<java.lang.Object,T> lazyForEach(ApplyFunc<T> applyFunction)
public <K,V> KeyedTSet<K,V> mapToTuple(MapFunc<Tuple<K,V>,T> genTupleFn)
TLink
public <O> ComputeTSet<O,T> flatmap(FlatMapFunc<O,T> mapFn)
TLink
FlatMapFunc
providedO
- map function to T0 to multiple elements of <O>mapFn
- map function which can produce multiple elements for a single <T0> elementpublic <O> ComputeTSet<O,T> map(MapFunc<O,T> mapFn)
TLink
MapFunc
providedO
- output typemapFn
- map function T0 to Opublic Edge getEdge()
public CachedTSet<T> lazyCache()
StoringData
TSet
is evaluated explicitly.public PersistedTSet<T> lazyPersist()
StoringData
public CachedTSet<T> cache()
StoringData
DataPartition
and exposes the data as another TSet.cache
in interface StoringData<T>
cache
in class BatchTLinkImpl<T,T>
public PersistedTSet<T> persist()
StoringData
DataPartition
. This method would also expose the
checkpointing ability to TSet
s.persist
in interface StoringData<T>
persist
in class BatchTLinkImpl<T,T>