OOZIE: properties defined in file referenced in global job-xml not visible in workflow.xml

OK, you are making two big mistakes. 1. Let’s start with a quick exegesis of some parts of the Oozie documentation (V4.2) Workflow Functional Specification has a section 19 about Global Configuration has sections 3.2.x about core Action types i.e. MapReduce, Pig, Java, etc. the XML schema specification clearly shows the <global> element Sqoop action … Read more

Sqoop import : composite primary key and textual primary key

Specify split column manually. Split column is not necessarily equal to PK. You can have complex PK and some int Split column. You can specify any integer column or even simple function (some simple function like substring or cast, not aggregation or analytic). Split column preferably should be evenly distributed integer. For example if your … Read more

How to use Sqoop in Java Program?

You can run sqoop from inside your java code by including the sqoop jar in your classpath and calling the Sqoop.runTool() method. You would have to create the required parameters to sqoop programmatically as if it were the command line (e.g. –connect etc.). Please pay attention to the following: Make sure that the sqoop tool … Read more