|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.mapred.MultiFileSplit
public class MultiFileSplit
A sub-collection of input files. Unlike FileSplit, MultiFileSplit
class does not represent a split of a file, but a split of input files
into smaller sets. The atomic unit of split is a file.
MultiFileSplit can be used to implement RecordReader's, with
reading one record per file.
FileSplit,
MultiFileInputFormat| Constructor Summary | |
|---|---|
MultiFileSplit(JobConf job,
Path[] files,
long[] lengths)
|
|
| Method Summary | |
|---|---|
long |
getLength()
Get the total number of bytes in the data of the InputSplit. |
long |
getLength(int i)
Returns the length of the ith Path |
long[] |
getLengths()
Returns an array containing the lengths of the files in the split |
String[] |
getLocations()
Get the list of hostnames where the input split is located. |
int |
getNumPaths()
Returns the number of Paths in the split |
Path |
getPath(int i)
Returns the ith Path |
Path[] |
getPaths()
Returns all the Paths in the split |
void |
readFields(DataInput in)
Deserialize the fields of this object from in. |
String |
toString()
|
void |
write(DataOutput out)
Serialize the fields of this object to out. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
| Constructor Detail |
|---|
public MultiFileSplit(JobConf job,
Path[] files,
long[] lengths)
| Method Detail |
|---|
public long getLength()
InputSplitInputSplit.
getLength in interface InputSplitpublic long[] getLengths()
public long getLength(int i)
public int getNumPaths()
public Path getPath(int i)
public Path[] getPaths()
public String[] getLocations()
throws IOException
InputSplit
getLocations in interface InputSplitInputSplit is
located as an array of Strings.
IOException
public void readFields(DataInput in)
throws IOException
Writablein.
For efficiency, implementations should attempt to re-use storage in the existing object where possible.
readFields in interface Writablein - DataInput to deseriablize this object from.
IOException
public void write(DataOutput out)
throws IOException
Writableout.
write in interface Writableout - DataOuput to serialize this object into.
IOExceptionpublic String toString()
toString in class Object
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||