public class DistributedJoin
extends java.lang.Object
Modifier and Type | Class and Description |
---|---|
static class |
DistributedJoin.DJInputFormatArray
Input format that returns a record reader that reads a pair of arrays of
shapes
|
static class |
DistributedJoin.DJInputFormatRTree<S extends Shape>
Input format that returns a record reader that reads a pair of arrays of
shapes
|
static class |
DistributedJoin.RedistributeJoinMap |
static class |
DistributedJoin.RedistributeJoinMapNoDupAvoidance |
static class |
DistributedJoin.RepartitionJoinReduce<T extends Shape> |
static class |
DistributedJoin.SpatialJoinFilter |
Modifier and Type | Field and Description |
---|---|
static boolean |
isFilterOnly |
static boolean |
isGeneralRepartitionMode |
static boolean |
isOneShotReadMode |
static boolean |
isReduceInactive |
static int |
joiningThresholdPerOnce |
static RunningJob |
lastRunningJob |
static int |
maxBytesInOneRead |
static int |
maxShapesInOneRead |
Constructor and Description |
---|
DistributedJoin() |
Modifier and Type | Method and Description |
---|---|
static long |
distributedJoinSmart(Path[] inputFiles,
Path userOutputPath,
OperationsParams params)
Spatially joins two files.
|
static <S extends Shape> |
joinStep(Path[] inFiles,
Path userOutputPath,
OperationsParams params)
Performs a redistribute join between the given files using the
redistribute join algorithm.
|
static void |
main(java.lang.String[] args) |
protected static long |
repartitionJoinStep(Path[] inputFiles,
int fileToRepartition,
Path outputFile,
OperationsParams params)
Spatially joins two datasets by repartitioning the smaller dataset based
on the larger one, then apply one-to-one joining for each partition
|
protected static void |
repartitionStep(Path[] files,
int file_to_repartition,
OperationsParams params)
Repartition a file to match the partitioning of the other file.
|
protected static int |
selectRepartition(Path[] files,
OperationsParams params) |
public static RunningJob lastRunningJob
public static int maxBytesInOneRead
public static int maxShapesInOneRead
public static boolean isOneShotReadMode
public static boolean isGeneralRepartitionMode
public static boolean isReduceInactive
public static boolean isFilterOnly
public static int joiningThresholdPerOnce
protected static int selectRepartition(Path[] files, OperationsParams params) throws java.io.IOException
java.io.IOException
protected static void repartitionStep(Path[] files, int file_to_repartition, OperationsParams params) throws java.io.IOException, java.lang.InterruptedException
fs
- files
- stockShape
- fStatus
- gIndexes
- java.io.IOException
java.lang.InterruptedException
public static <S extends Shape> long joinStep(Path[] inFiles, Path userOutputPath, OperationsParams params) throws java.io.IOException
fs
- inFiles
- output
- java.io.IOException
protected static long repartitionJoinStep(Path[] inputFiles, int fileToRepartition, Path outputFile, OperationsParams params) throws java.io.IOException
inputFiles
- Input datasets to be spatially joinedfileToRepartition
- Index of which file will be repartitionedoutputFile
- Output file contains the joining resultsparams
- Job configurationsjava.io.IOException
public static long distributedJoinSmart(Path[] inputFiles, Path userOutputPath, OperationsParams params) throws java.io.IOException, java.lang.InterruptedException
fs
- inputFiles
- stockShape
- output
- java.io.IOException
java.lang.InterruptedException
public static void main(java.lang.String[] args) throws java.io.IOException, java.lang.InterruptedException
java.io.IOException
java.lang.InterruptedException