博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
MapReduce Commands Guide
阅读量:4153 次
发布时间:2019-05-25

本文共 4954 字,大约阅读时间需要 16 分钟。

MapReduce Commands Guide

Overview

All mapreduce commands are invoked by the bin/mapred script. Running the mapred script without any arguments prints the description for all commands.

Usage: mapred [SHELL_OPTIONS] COMMAND [GENERIC_OPTIONS] [COMMAND_OPTIONS]

Hadoop has an option parsing framework that employs parsing generic options as well as running classes.

COMMAND_OPTIONS Description
SHELL_OPTIONS The common set of shell options. These are documented on the  page.
GENERIC_OPTIONS The common set of options supported by multiple commands. See the  for more information.
COMMAND COMMAND_OPTIONS Various commands with their options are described in the following sections. The commands have been grouped into  and.

User Commands

Commands useful for users of a hadoop cluster.

archive

Creates a hadoop archive. More information can be found at .

classpath

Prints the class path needed to get the Hadoop jar and the required libraries.

Usage: mapred classpath

distcp

Copy file or directories recursively. More information can be found at .

job

Command to interact with Map Reduce Jobs.

Usage: mapred job | [GENERIC_OPTIONS] | [-submit <job-file>] | [-status <job-id>] | [-counter <job-id> <group-name> <counter-name>] | [-kill <job-id>] | [-events <job-id> <from-event-#> <#-of-events>] | [-history [all] <jobOutputDir>] | [-list [all]] | [-kill-task <task-id>] | [-fail-task <task-id>] | [-set-priority <job-id> <priority>]

COMMAND_OPTION Description
-submit job-file Submits the job.
-status job-id Prints the map and reduce completion percentage and all job counters.
-counter job-id group-namecounter-name Prints the counter value.
-kill job-id Kills the job.
-events job-id from-event-# #-of-events Prints the events’ details received by jobtracker for the given range.
-history [all]jobOutputDir Prints job details, failed and killed tip details. More details about the job such as successful tasks and task attempts made for each task can be viewed by specifying the [all] option.
-list [all] Displays jobs which are yet to complete. -list all displays all jobs.
-kill-task task-id Kills the task. Killed tasks are NOT counted against failed attempts.
-fail-task task-id Fails the task. Failed tasks are counted against failed attempts.
-set-priority job-id priority Changes the priority of the job. Allowed priority values are VERY_HIGH, HIGH, NORMAL, LOW, VERY_LOW

pipes

Runs a pipes job.

Usage: mapred pipes [-conf <path>] [-jobconf <key=value>, <key=value>, ...] [-input <path>] [-output <path>] [-jar <jar file>] [-inputformat <class>] [-map <class>] [-partitioner <class>] [-reduce <class>] [-writer <class>] [-program <executable>] [-reduces <num>]

COMMAND_OPTION Description
-conf path Configuration for job
-jobconf key=valuekey=value, … Add/override configuration for job
-input path Input directory
-output path Output directory
-jar jar file Jar filename
-inputformat class InputFormat class
-map class Java Map class
-partitioner class Java Partitioner
-reduce class Java Reduce class
-writer class Java RecordWriter
-program executable Executable URI
-reduces num Number of reduces

queue

command to interact and view Job Queue information

Usage: mapred queue [-list] | [-info <job-queue-name> [-showJobs]] | [-showacls]

COMMAND_OPTION Description
-list Gets list of Job Queues configured in the system. Along with scheduling information associated with the job queues.
-info job-queue-name [-showJobs] Displays the job queue information and associated scheduling information of particular job queue. If -showJobs options is present a list of jobs submitted to the particular job queue is displayed.
-showacls Displays the queue name and associated queue operations allowed for the current user. The list consists of only those queues to which the user has access.

version

Prints the version.

Usage: mapred version

Administration Commands

Commands useful for administrators of a hadoop cluster.

historyserver

Start JobHistoryServer.

Usage: mapred historyserver

hsadmin

Runs a MapReduce hsadmin client for execute JobHistoryServer administrative commands.

Usage: mapred hsadmin [-refreshUserToGroupsMappings] | [-refreshSuperUserGroupsConfiguration] | [-refreshAdminAcls] | [-refreshLoadedJobCache] | [-refreshLogRetentionSettings] | [-refreshJobRetentionSettings] | [-getGroups [username]] | [-help [cmd]]

COMMAND_OPTION Description
-refreshUserToGroupsMappings Refresh user-to-groups mappings
-refreshSuperUserGroupsConfiguration Refresh superuser proxy groups mappings
-refreshAdminAcls Refresh acls for administration of Job history server
-refreshLoadedJobCache Refresh loaded job cache of Job history server
-refreshJobRetentionSettings Refresh job history period, job cleaner settings
-refreshLogRetentionSettings Refresh log retention period and log retention check interval
-getGroups [username] Get the groups which given user belongs to
-help [cmd] Displays help for the given command or all commands if none is specified.

转载地址:http://kxqti.baihongyu.com/

你可能感兴趣的文章
linux tool objdump
查看>>
linux tool nm
查看>>
字节对齐
查看>>
Pyston 0.4发布说明中文版
查看>>
iTunes10.5出现安装汇编microsoft.vc80.crt...失败的错误解决办法。
查看>>
用link方法安装Eclipse插件(Windows和Ubuntu都适用)
查看>>
Eclipse下配置CDT和MinGW终极方法。
查看>>
Java出现No enclosing instance of type E is accessible. Must qualify the allocation with an enclosing
查看>>
2011年11月编程语言排行榜:Objective-C有望成为2011年年度编程语言。
查看>>
Java学习笔记:前言(Java入门及推荐一本Java教材)
查看>>
Java学习笔记:基本类型、字符串及控制台输入输出
查看>>
Eclipse 4.x学习笔记(1、运行第一个Java程序)
查看>>
Java学习笔记:运算符
查看>>
Java学习笔记:字符串与类的初步
查看>>
Ubuntu下配置安装JDK1.7+Eclipse
查看>>
Java学习笔记:流程控制之分支语句
查看>>
Ubuntu11.10字体修改及gedit乱码处理
查看>>
Ubuntu11.10设置桌面启动器(桌面快捷方式)
查看>>
Ubuntu11.10安装Matlab2011b及相关问题解决
查看>>
找不到libexpat.so.0怎么办?
查看>>