matlab.io.datastore.HadoopFileBased Class
Namespace: matlab.io.datastore
(Not recommended) Add Hadoop file support to datastore
matlab.io.datastore.HadoopFileBased
is not recommended. Use
matlab.io.datastore.HadoopLocationBased
instead.
Description
matlab.io.datastore.HadoopFileBased
is an abstract mixin class that
adds Hadoop® support to your custom datastore.
To use this mixin class, you must inherit from the
matlab.io.datastore.HadoopFileBased
class in addition to
inheriting from the matlab.io.Datastore
base class. Type the
following syntax as the first line of your class definition file:
classdef MyDatastore < matlab.io.Datastore & ... matlab.io.datastore.HadoopFileBased ... end
To add Hadoop support along with parallel processing support, use these lines in your class definition file:
classdef MyDatastore < matlab.io.Datastore & ... matlab.io.datastore.Partitionable & ... matlab.io.datastore.HadoopFileBased ... end
To add support for Hadoop to your custom datastore, you must:
Inherit from the additional class
matlab.io.datastore.HadoopFileBased
Define these additional methods:
getLocation
,initializeDatastore
, andisfullfile
For more details and steps to create your custom datastore with support for Hadoop, see Develop Custom Datastore.
Methods
getLocation | (Not recommended) Location of files in Hadoop |
initializeDatastore | (Not recommended) Initialize datastore with information from Hadoop |
isfullfile | (Not recommended) Check if datastore reads full files |
Examples
Version History
Introduced in R2017b
See Also
mapreduce
| matlab.io.datastore.Partitionable
| matlab.io.Datastore
| matlab.io.datastore.DsFileSet
| tall
Topics
- Add Support for Hadoop
- Use Tall Arrays on a Spark Cluster (Parallel Computing Toolbox)
- Big Data Workflow Using Tall Arrays and Datastores (Parallel Computing Toolbox)