Our Service Catalog feature provides a list of all the services in your system, presenting the health of each service. The catalog displays service type, number of requests sent by the service, error rate, and P95 latency (that is, the frequency with which there was a delay in a request).
Step 1. In your navigation pane, click APM. Then select the Service Catalog tab.
Step 2. Select the timeframe for which you want to view information.
Step 3. Select a service to view the service drill-down.
The service drill-down provides you with detailed information about the specific service selected. It is split into two panes: a mini Service Map centered on the root service in the left pane and additional information about the services in the right pane.
The drill-down provides details of the service you are viewing, in addition to all of the information in the main service catalog page. It includes a second row of details that changes if you select a different service from the mini Service Map.
The additional information pane includes the following tabs:
The OVERVIEW tab provides an overview of the service.
The graphs shown in this tab display requests per second, errors, and average P95 latency for the service – all grouped by action for the timeframe selected in the top bar.
The SLI tab provides a view of the service level indicators (SLIs) for the service.
SLIs present the percentage of errors for a service and whether that amount is within the acceptable range of errors for that service. A percentage of errors higher than what is permitted amounts to an SLI breach.
Note: New SLIs take a minimum of seven days for their computation window to complete. Before this time, the SLI will show incomplete data.
To begin tracking SLIs, add a new SLI to the system:
STEP 1. Within the SLI tab of the Service Catalog, click + ADD NEW SLI.
STEP 2. Select SLI type: Error or Latency.
STEP 3. Enter a name and optional description for the SLI.
STEP 4. Select the filters and threshold for your SLI.
STEP 5. Select the SLO percentage and the period for which the SLI is valid.
For example, setting the parameters to 90% for a period of 7 days means that the SLI is valid as long as the error rate over each seven-day period is no higher than 90%.
The RESOURCES tab presents resources used by the service.
View CPU utilization, memory used (bytes), and network usage (bytes) for the timeframe selected in the top bar.
The ACTIONS tab presents the different actions taken by the service.
For each action, view action type, method, time consumed, percentage of errors caused by the action, and what percentage the action comprised of the total number of actions. These are all shown for the timeframe selected in the top bar.
View a deeper drill-down of each action by clicking on an action row.
The deep drill-down shows the time when the action occurred, what the action was, for which service the action was taken, the duration of the action, and how many errors it generated.
The LOGS tab presents all related logs for the selected service.
To open a new tab with the logs open in your Coralogix Explore Screen, click OPEN LOG QUERY in the upper right-hand corner of your screen.
To add correlation mapping to your logs, take the following steps:
STEP 1. Click SET UP CORRELATION in the upper right-hand corner of your screen.
STEP 2. Select the replacement logs key from the dropdown menu.
STEP 3. Click UPDATE CORRELATIONS.
Documentation | Application Performance Monitoring |
Need help?
Our world-class customer success team is available 24/7 to walk you through your setup and answer any questions that may come up.
Feel free to reach out to us via our in-app chat or by sending us an email at [email protected].