Table of Contents
Overview
Previously, the job configuration files could only be loaded from and monitored in the local file system. Efforts have been made to change the limitation and now Gobblin can also load job configuration files in other file systems. Users can easily submit .pull
files through their preferred file system and specify it in system configuration accordingly.
This page will use the wikipedia example of Gobblin-standalone interacting with job configuration files in HDFS.
How to submit .pull
file through HDFS
Here are the steps to change the system configuration:
- Set fs.uri
to the HDFS uri that the .pull
file will be submitted to.
- Use jobconf.fullyQualifiedPath
to specify the fully qualified location where pull files should be searched for (this replaces the previously used key jobconf.dir
)
With all these changes to gobblin-standalone.properties
, you can now submit the .pull
to the target file system path.