I find the tutorials from hortonworks a rather good starting point
http://hortonworks.com/tutorials/#tuts-developers
To deep dive a must read is Tom White’s ‘Hadoop: The Definitve Guide’.
‘Hadoop in Practice’ shows a lot of cookbook like examples.
More Related Contents:
- Job queue for Hive action in oozie
- Begenner at spark Big data programming (spark code)
- How does Hadoop process records split across block boundaries?
- Failed to locate the winutils binary in the hadoop binary path
- merge output files after reduce phase
- What is the difference between partitioning and bucketing a table in Hive ?
- Setting the number of map tasks and reduce tasks
- How to transpose/pivot data in hive?
- How does Hadoop Namenode failover process works?
- How to open/stream .zip files through Spark?
- Is it better to use the mapred or the mapreduce package to create a Hadoop Job?
- Sqoop import : composite primary key and textual primary key
- When do reduce tasks start in Hadoop?
- Hadoop java.io.IOException: Mkdirs failed to create /some/path
- How can I access S3/S3n from a local Hadoop 2.6 installation?
- java.lang.NoClassDefFoundError: org/apache/hadoop/fs/StorageStatistics
- Hadoop speculative task execution
- Is it better to have one large parquet file or lots of smaller parquet files?
- Hive unable to manually set number of reducers
- How to update table in Hive 0.13?
- hadoop map reduce secondary sorting
- Easiest way to install Python dependencies on Spark executor nodes?
- Parallel Algorithms for Generating Prime Numbers (possibly using Hadoop’s map reduce)
- Spark iterate HDFS directory
- Hadoop: …be replicated to 0 nodes instead of minReplication (=1). There are 1 datanode(s) running and no node(s) are excluded in this operation
- Pig Latin: Load multiple files from a date range (part of the directory structure)
- Hadoop input split size vs block size
- Difference between HBase and Hadoop/HDFS
- What is Hive: Return Code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
- Merging multiple files into one within Hadoop