hive_stream

a function which is executed on each worker node. The
    so-called mapper typically maps input key/value pairs to a set of
    intermediate key/value pairs.

mapper

a function which is executed on each worker node. The
    so-called reducer reduces a set of intermediate values which share a
    key to a smaller set of values. If no reducer is used leave empty.

reducer

specifies the directory holding the data in the DFS.

input

specifies the output directory in the DFS containing the
    results after the streaming job finished.

output

henv

mapper_args

reducer_args

additional arguments passed as environment variables
    to distributed tasks.

cmdenv_arg

High-level functions for using Hadoop Streaming.

Hadoop InteractiVE, is an R extension facilitating
        distributed computing via the MapReduce paradigm. It provides
        an easy to use interface to Hadoop, the Hadoop Distributed File
        System (HDFS), and Hadoop Streaming.

hive_stream: Hadoop Streaming with hive

Description

Usage

Arguments

Details

References