彭志伟, 谷建华. 一种基于动态资源采集的Hadoop作业调度算法[J]. 微电子学与计算机, 2014, 31(4): 64-67.
引用本文: 彭志伟, 谷建华. 一种基于动态资源采集的Hadoop作业调度算法[J]. 微电子学与计算机, 2014, 31(4): 64-67.
PENG Zhi-wei, GU Jian-hua. A Scheduler for Hadoop Based on Dynamic Resource Collection[J]. Microelectronics & Computer, 2014, 31(4): 64-67.
Citation: PENG Zhi-wei, GU Jian-hua. A Scheduler for Hadoop Based on Dynamic Resource Collection[J]. Microelectronics & Computer, 2014, 31(4): 64-67.

一种基于动态资源采集的Hadoop作业调度算法

A Scheduler for Hadoop Based on Dynamic Resource Collection

  • 摘要: 为了克服Hadoop中基于槽的资源模型对集群资源的浪费,提高Hadoop集群的资源利用率,提出了一种基于动态资源采集的Hadoop作业调度算法.通过在作业运行过程中动态采集部分任务的CPU,内存和IO的资源利用率来评估同一作业中其他任务的资源需求,然后根据任务的实际资源需求以及TaskTracker节点的负载情况进行任务调度,以充分利用各TaskTracker节点的计算资源.通过对比实验表明该调度算法能明显提高集群的资源利用率,缩短作业的完成时间.

     

    Abstract: In order to overcome the problem of wasting of resources in hadoop cluser based on the slot resource model,a scheduler based on the dynamic resource collection has been proposed.According to collect theCPU,Memory and IO resource utilization of part of tasks in the hadoop job,We estimate the resource demand of the rest tasks in the job,and then schedule these tasks according to their resource demand and the load of every Tasktracker.The contrast experiment shows the scheduler we proposed can increase the resource utilization of the hadoop cluser and reduce the completion time of the Hadoop job.

     

/

返回文章
返回