The CDK Examples project provides examples of how to use the CDK.
Each example is a standalone Maven module with associated documentation.
The easiest way to run the examples is on the Cloudera QuickStart VM, which has all the necessary Hadoop services pre-installed, configured, and running locally.
dataset
shows how to create datasets and perform streaming writes and reads over them.logging
is an example of logging events from a command-line programs to Hadoop via Flume, using log4j as the logging API.logging-webapp
is likelogging
, but the logging source is a webapp.
demo
is a full end-to-end example of a webapp that logs events using Flume and performs session analysis using Crunch and Hive.