public interface BatchRowTSet extends TBase, AcceptingData<Row>, StoringData<Row>
| Modifier and Type | Method and Description |
|---|---|
BatchRowTLink |
direct()
Direct/pipe communication
|
BatchRowTLink |
join(BatchRowTSet rightTSet,
CommunicationContext.JoinType type,
java.util.Comparator<Row> keyComparator)
Joins with another
BatchTupleTSet. |
BatchRowTLink |
partition(PartitionFunc<Row> partitionFn,
int columnIndex,
int targetParallelism)
Returns a Partition
TLink that would partition data according based on a function
provided. |
BatchRowTSet |
withSchema(RowSchema dataType)
Sets the data type of the
TSet output. |
addInputcache, lazyCache, lazyPersist, persistBatchRowTLink direct()
BatchRowTLink partition(PartitionFunc<Row> partitionFn, int columnIndex, int targetParallelism)
TLink that would partition data according based on a function
provided. The parallelism of the target TSet can also be specified.partitionFn - Partition functioncolumnIndex - column index to usetargetParallelism - Target parallelismBatchRowTLink join(BatchRowTSet rightTSet, CommunicationContext.JoinType type, java.util.Comparator<Row> keyComparator)
BatchTupleTSet. Note that this TSet will be considered the left
TSetrightTSet - right tsettype - CommunicationContext.JoinTypekeyComparator - key comparatorBatchRowTSet withSchema(RowSchema dataType)
TSet output. This will be used in the packers for efficient
SER-DE operations in the following TLinksdataType - data type as a MessageTypeTSet