Runtime Statistics - Key metrics

Last modified by Eva Torken on 2023/10/06 15:17

Please note that this microlearning is for the new monitoring stack only.

Assessing the health of your systems is critical in a stable and robust integration solution. One of the aspects of assessing the health of your systems is looking at the runtime statistics.In this microlearning, we will educate you on the Runtime Statistics overview within the Manage phase of eMagiz.

1. Prerequisites

  • Basic knowledge of the eMagiz platform
  • Understanding of the Runtime statistics Dashboard pages

2. Key concepts

This microlearning centers around interpreting runtime statistics. With interpreting we mean: Analyzing the patterns based on the information presented to you. With runtime statistics we mean: Various information elements on system (runtime) level that will help to understand the health of the runtime.

2.1 General interaction principles

All graphs in the Manage phase are created with the aim to provide an easy interaction for users to slice the information as needed. Here are some general guidelines for that interaction.

  • Runtime, Queue and HTTP metrics are send to the Manage phase every minute and that interval is runtime specific
  • Logs and error messages are send instantly to the Manage phase
  • The average value of the last minute of each metric is sent to the Manage phase
  • You can select the time dimension via the graph itself or via the top right menu. Other graphs will also respond to this selection
  • You can return to the main dashboard via the top right button called "Show Main Dashboard"
  • The main dashboard contains filter options to select specific runtimes or queues
      

3. Interpreting Runtime Statistics Key metrics

Assessing the health of your systems is critical in a stable and robust integration solution. One of the aspects of assessing the health of your systems is looking at the runtime statistics. Further to the Crash Course Microlearning around the Runtime Statistics Dashboard, this section will explain the Key metrics graphs.

3.1 Key Metric Graphs

When the runtime statistics are accessed, the first page shows a dashboard of the runtime statistics. An example is mentioned below.
 
intermediate-emagiz-monitoring-runtime-stats-keymetrics-1

3.2 Memory usage - Heap

The heap memory section displays the way in which message traffic is impacting heap memory. Heap memory is used for processing messages including transformation and transport of messages through channels. Whenever the heap memory is high in % used, it means the assigned heap memory might be insufficient for the message traffic.
 
intermediate-emagiz-monitoring-runtime-stats-keymetrics-2

3.3 CPU Usage

This graph shows the usage percentage of the CPU assigned to the process (in this case runtime). The CPU is relevant for eMagiz as that component processes the message transformation, transport etc. A high percentage indicates that the CPU is very busy handling the message traffic of that process, which indicates that other processes handled by that same CPU(s) might experience delays or issues. The graph also shows the System CPU usage - this is the CPU of the system where the runtime is running in (usally the docker instance of machine).
 
intermediate-emagiz-monitoring-runtime-stats-keymetrics-3
  

3.4 Log events

This graph shows the occurrences of log entries through time - the different colored lines represent the type of log entry (Log, Warning, etc.). Please make sure that you can see the colors of each type by scrolling down on the graph level. This information can be helpful to identify the timings and rhythym of log entries.
 
intermediate-emagiz-monitoring-runtime-stats-keymetrics-4
  

3.5 Threads

The graph shows the number of live threads that are currently running on your JVM (runtime). Each thread represents a task that is executed on that JVM. This could be the consumption of a message, the registration of a queue, etc. An ever-growing number of live threads indicates that there might be something wrong.
 
intermediate-emagiz-monitoring-runtime-stats-keymetrics-5

4. Key takeaways

  • There are 4 dashboard graphs that will help the more detailed analysis of a runtime(s)
  • Next to CPU and memory consumption, a first view of logs and threads can be helpful to asses the health of a runtime
  • The added value lies in combining the information from each of them into a coherent interpretation

5. Suggested Additional Readings

If you are interested in this topic and want more information on it please read the help text provided by eMagiz.