public interface BatchRowTSet extends TBase, AcceptingData<Row>, StoringData<Row>
Modifier and Type | Method and Description |
---|---|
BatchRowTLink |
direct()
Direct/pipe communication
|
BatchRowTLink |
join(BatchRowTSet rightTSet,
CommunicationContext.JoinType type,
java.util.Comparator<Row> keyComparator)
Joins with another
BatchTupleTSet . |
BatchRowTLink |
partition(PartitionFunc<Row> partitionFn,
int columnIndex,
int targetParallelism)
Returns a Partition
TLink that would partition data according based on a function
provided. |
BatchRowTSet |
withSchema(RowSchema dataType)
Sets the data type of the
TSet output. |
addInput
cache, lazyCache, lazyPersist, persist
BatchRowTLink direct()
BatchRowTLink partition(PartitionFunc<Row> partitionFn, int columnIndex, int targetParallelism)
TLink
that would partition data according based on a function
provided. The parallelism of the target TSet
can also be specified.partitionFn
- Partition functioncolumnIndex
- column index to usetargetParallelism
- Target parallelismBatchRowTLink join(BatchRowTSet rightTSet, CommunicationContext.JoinType type, java.util.Comparator<Row> keyComparator)
BatchTupleTSet
. Note that this TSet will be considered the left
TSetrightTSet
- right tsettype
- CommunicationContext.JoinType
keyComparator
- key comparatorBatchRowTSet withSchema(RowSchema dataType)
TSet
output. This will be used in the packers for efficient
SER-DE operations in the following TLink
sdataType
- data type as a MessageType
TSet