Hadoop resource utilization and performance analysis

Teknoloji

17 Jun 2009

Recently, we’ve been taking a look at Hadoop performance on CMT machines.  (Quick summary: it can work great, once you set the system up properly.)  In the course of doing that, we’ve had to try a number of configurations and monitor the performance of each run.  A few details of that are included here — more detail later.

One part of Hadoop performance analysis is monitoring the task timeline — when tasks that correspond to different phases begin and end, how they overlap, and so on.  This is an example:

Hadoop task timeline

Another operation that’s useful is the monitor the utilization of various resources — cpu, network, disk — as Hadoop is running.  An example is this:

Hadoop resource utilization

This is just a teaser — in future entries, I’ll descript how to generate and analyze these sorts of graphs. 

Source/Kaynak : http://blogs.sun.com/jgebis/entry/hadoop_resource_utilization_and_performance

Comment Form

Content In Different Language


Recent Comments


  • dima: Guys, thank you very much it helped me a lot ! [...]
  • jitendra: i tried install ubuntu virtualbox 3.0 on my acer aspire 5610 laptop. i get message " dependency is [...]
  • Gurkan Erdogdu: You can also look at Apache Incubator JSR 299 implementation called OpenWebBeans. It has lots of sam [...]
  • Gurkan Erdogdu: You can also Apache Incubator JSR 299 implementation called OpenWebBeans. It has lots of samples sho [...]
  • vijayanand: Please share your experence getting TOGAF certified? What are the pre-requite eligibility... Basic [...]
  • Our Scores