Module org.elasticsearch.compute
Class MultivalueDedupeInt
java.lang.Object
org.elasticsearch.compute.operator.mvdedupe.MultivalueDedupeInt
Removes duplicate values from multivalued positions.
This class is generated. Edit
X-MultivalueDedupe.java.st instead.-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionbatchEncoder(int batchSize) Build aBatchEncoderwhich deduplicates values at each position and then encodes the results into awhich can be used for things like hashing many fields together.invalid reference
byte[]dedupeToBlockAdaptive(BlockFactory blockFactory) Remove duplicate values from each position and write the results to aBlockusing an adaptive algorithm based on the size of the input list.dedupeToBlockUsingCopyAndSort(BlockFactory blockFactory) Remove duplicate values from each position and write the results to aBlockusing an algorithm with very low overhead butn^2case complexity for larger.dedupeToBlockUsingCopyMissing(BlockFactory blockFactory) Remove duplicate values from each position and write the results to aBlockusing an algorithm that sorts all values.hashAdd(BlockFactory blockFactory, LongHash hash) Dedupe values, add them to the hash, and build anIntBlockof their hashes.hashLookup(BlockFactory blockFactory, LongHash hash) Dedupe values and build anIntBlockof their hashes.sortToBlock(BlockFactory blockFactory, boolean ascending) Sort values from each position and write the results to aBlock.
-
Constructor Details
-
MultivalueDedupeInt
-
-
Method Details
-
dedupeToBlockAdaptive
Remove duplicate values from each position and write the results to aBlockusing an adaptive algorithm based on the size of the input list. -
dedupeToBlockUsingCopyAndSort
Remove duplicate values from each position and write the results to aBlockusing an algorithm with very low overhead butn^2case complexity for larger. PreferdedupeToBlockAdaptive(org.elasticsearch.compute.data.BlockFactory)which picks based on the number of elements at each position. -
dedupeToBlockUsingCopyMissing
Remove duplicate values from each position and write the results to aBlockusing an algorithm that sorts all values. It has a higher overhead for small numbers of values at each position thandedupeToBlockUsingCopyMissing(org.elasticsearch.compute.data.BlockFactory)for large numbers of values the performance is dominated by then*log nsort. PreferdedupeToBlockAdaptive(org.elasticsearch.compute.data.BlockFactory)unless you need the results sorted. -
sortToBlock
Sort values from each position and write the results to aBlock. -
hashAdd
Dedupe values, add them to the hash, and build anIntBlockof their hashes. This block is suitable for passing as the grouping block to aGroupingAggregatorFunction. -
hashLookup
Dedupe values and build anIntBlockof their hashes. This block is suitable for passing as the grouping block to aGroupingAggregatorFunction. -
batchEncoder
Build aBatchEncoderwhich deduplicates values at each position and then encodes the results into awhich can be used for things like hashing many fields together.invalid reference
byte[]
-