Introduction to Oozie

Oozie is a server-based Workflow Engine specialized in running workflow jobs with actions that run Hadoop Map/Reduce and Pig jobs. It is a Java Web Application that runs in a Java servlet container. Oozie workflow actions start jobs in remote systems (i.e. Hadoop, Pig). Upon auction completion, the remote systems callback Oozie to notify the action completion, at this point Oozie proceeds to the next action in the workflow. It is an extensible, scalable, and data-aware service that we can use to orchestrate dependencies among jobs running on Hadoop.