"An approach to cluster system task flow monitoring, analysis, and visualization"
Adinetz A.V., Bryzgalov P.A., Voevodin Vad.V., Zhumatiy S.A., Nikitenko D.A.

Big cluster systems are spreading wide, so that the efficiency of use of such systems is a very actual task for now. In order to solve this task, it is needed to identify the efficiency problems appearing during the task execution, to notify users about appeared problems, and to suggest possible ways to resolve them. This can be achieved by the continuous monitoring of running tasks and by data analysis. This paper discusses an approach to solve these tasks and describes a working prototype.

Keywords: parallel computing, monitoring, tasks flow, computing cluster

Adinetz A.V.   e-mail: adinetz@gmail.com;   Bryzgalov P.A.   e-mail: pyotr777@guru.ru;   Voevodin Vad.V.   e-mail: vadim_voevodin@mail.ru;   Zhumatiy S.A.   e-mail: serg@parallel.ru;   Nikitenko D.A.   e-mail: dan@parallel.ru