Quick Start Observability for Amazon EBS

thank you

Thank you!

We got your information.

Amazon EBS
Amazon EBS icon

Coralogix Extension For Amazon EBS Includes:

Dashboards - 1

Gain instantaneous visualization of all your Amazon EBS data.

Amazon EBS
Amazon EBS

Alerts - 9

Stay on top of Amazon EBS key performance metrics. Keep everyone in the know with integration with Slack, PagerDuty and more.

Low Burst balance

This alert is designed to monitor and manage the burst balance of Amazon EBS volumes, particularly for burstable volumes like General Purpose SSD (gp2/gp3) and Throughput Optimized HDD (st1). Burst balance indicates the available burst credits that allow these volumes to temporarily achieve higher performance levels. Low burst balance can signal that a volume is consistently running at high utilization, potentially leading to performance degradation once the burst credits are depleted. The alert is activated when the burst balance falls below a certain percentage, indicating that the volume is at risk of exhausting its burst credits and experiencing reduced performance. Customization Guidance: Threshold: The threshold for low burst balance can be set based on your application's performance requirements. For example, a common threshold might be set at 20%. Adjust this threshold according to the specific needs and performance characteristics of your workloads. Monitoring Period: The standard monitoring period is 10 minutes. This interval can be adjusted shorter or longer depending on how dynamic the burst balance patterns are and to ensure timely identification of issues without generating noise from normal fluctuations. Instance Specificity: Consider configuring the alert for different volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to less critical environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the burst balance metrics and logs to determine the causes of low burst balance. Look for patterns such as specific times of day when burst balance drops or particular applications generating heavy loads. Review and optimize your read/write operations, consider provisioning additional IOPS or throughput, or switching to a different volume type if necessary. Implement caching strategies to reduce direct load on the EBS volumes. By closely monitoring burst balance, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

High storage latency

This alert is designed to monitor and manage the storage latency of Amazon EBS volumes, which indicates how long it takes to complete read and write operations. High storage latency can negatively impact application performance and user experience, making it crucial to identify and address any underlying issues promptly. The alert is activated when the storage latency exceeds the usual threshold, indicating that read and/or write operations are taking longer than expected to complete. Customization Guidance: Threshold: The threshold for high storage latency should be set based on your application's performance requirements. For example: General Purpose SSD (gp2/gp3): > 5 milliseconds Provisioned IOPS SSD (io1/io2): > 2 milliseconds Throughput Optimized HDD (st1): > 15 milliseconds Cold HDD (sc1): > 100 milliseconds Adjust these thresholds based on your specific requirements and performance characteristics of your workloads. Monitoring Period: The standard monitoring period is 5 minutes. This interval can be adjusted shorter or longer depending on how dynamic the latency patterns are and to ensure timely identification of issues without generating noise from normal fluctuations. Instance Specificity: Consider configuring the alert for different volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to less critical environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the storage latency metrics and logs to determine the common causes of high latency. Look for patterns such as specific times of day when latency spikes or particular applications generating heavy read/write loads. Review and optimize your read/write operations, consider provisioning additional IOPS or throughput, or scaling up the instance size if necessary. Implement caching strategies to reduce direct load on the EBS volumes. By closely monitoring storage latency, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

High Write Latency

This alert is designed to monitor and manage the write latency of Amazon EBS volumes, which indicates how long it takes to complete write operations. High write latency can negatively impact application performance and user experience, making it crucial to identify and address any underlying issues promptly. The alert is activated when the average write latency exceeds 100 milliseconds over a specified period, indicating that write operations are taking longer than usual to complete. High write latency could be due to various factors, such as increased load on the volume, inefficient write patterns, or underlying infrastructure issues. Persistent high latency can lead to slower application performance and may require scaling or optimization efforts. Customization Guidance: Threshold: The default threshold is set at 100 milliseconds. Adjust this threshold based on your application's normal performance and tolerance for latency. For high-performance applications, a lower threshold may be necessary to ensure optimal performance. Instance Specificity: Consider configuring the alert for different volumes or sets of volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to development environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the write latency metrics and logs to determine the common causes of high latency. Look for patterns such as specific times of day when latency spikes or particular applications generating heavy write loads. Review and optimize your write operations, consider provisioning additional IOPS, or scaling up the instance size if necessary. Implement caching strategies to reduce direct write load on the EBS volumes. By closely monitoring write latency, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

High Read Latency

This alert is designed to monitor and manage the read latency of Amazon EBS volumes, which indicates how long it takes to complete read operations. High read latency can negatively impact application performance and user experience, making it crucial to identify and address any underlying issues promptly. The alert is activated when the average read latency exceeds 100 milliseconds over a specified period, indicating that read operations are taking longer than usual to complete. High read latency could be due to various factors, such as increased load on the volume, inefficient read patterns, or underlying infrastructure issues. Persistent high latency can lead to slower application performance and may require scaling or optimization efforts. Customization Guidance: Threshold: The default threshold is set at 100 milliseconds. Adjust this threshold based on your application's normal performance and tolerance for latency. For high-performance applications, a lower threshold may be necessary to ensure optimal performance. Instance Specificity: Consider configuring the alert for different volumes or sets of volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to development environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the read latency metrics and logs to determine the common causes of high latency. Look for patterns such as specific times of day when latency spikes or particular applications generating heavy read loads. Review and optimize your read operations, consider provisioning additional IOPS, or scaling up the instance size if necessary. Implement caching strategies to reduce direct read load on the EBS volumes. By closely monitoring read latency, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

Low Read Throughput

This alert is designed to monitor and manage the read throughput of Amazon EBS volumes, which indicates the rate at which data is read from the volumes. Low read throughput can negatively impact application performance and user experience, making it crucial to identify and address any underlying issues promptly. The alert is activated when the read throughput falls below the usual threshold, indicating that read operations are not performing as expected. Low read throughput could be due to various factors, such as increased load on the volume, inefficient read patterns, or underlying infrastructure issues. Persistent low throughput can lead to slower application performance and may require scaling or optimization efforts. Customization Guidance: Instance Specificity: Consider configuring the alert for different volumes or sets of volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to development environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the read throughput metrics and logs to determine the common causes of low throughput. Look for patterns such as specific times of day when throughput drops or particular applications generating heavy read loads. Review and optimize your read operations, consider provisioning additional IOPS, or scaling up the instance size if necessary. Implement caching strategies to reduce direct read load on the EBS volumes. By closely monitoring read throughput, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

Low Write Throughput

This alert is designed to monitor and manage the write throughput of Amazon EBS volumes, which indicates the rate at which data is written to the volumes. Low write throughput can negatively impact application performance and user experience, making it crucial to identify and address any underlying issues promptly. The alert is activated when the write throughput falls below the usual threshold, indicating that write operations are not performing as expected. Low write throughput could be due to various factors, such as increased load on the volume, inefficient write patterns, or underlying infrastructure issues. Persistent low throughput can lead to slower application performance and may require scaling or optimization efforts. Customization Guidance: Instance Specificity: Consider configuring the alert for different volumes or sets of volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to development environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the write throughput metrics and logs to determine the common causes of low throughput. Look for patterns such as specific times of day when throughput drops or particular applications generating heavy write loads. Review and optimize your write operations, consider provisioning additional IOPS, or scaling up the instance size if necessary. Implement caching strategies to reduce direct write load on the EBS volumes. By closely monitoring write throughput, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

Low Read Operations per Seconds

This alert is designed to monitor and manage the read operations per second (IOPS) of Amazon EBS volumes, which indicates the rate at which read operations are being performed. Low read IOPS can negatively impact application performance and user experience, making it crucial to identify and address any underlying issues promptly. The alert is activated when the read IOPS fall below the usual rate, indicating that read operations are not performing as expected. Low read IOPS could be due to various factors, such as decreased load on the volume, inefficient read patterns, or underlying infrastructure issues. Persistent low IOPS can lead to slower application performance and may require scaling or optimization efforts. Customization Guidance: Instance Specificity: Consider configuring the alert for different volumes or sets of volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to development environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the read IOPS metrics and logs to determine the common causes of low IOPS. Look for patterns such as specific times of day when IOPS drop or particular applications generating heavy read loads. Review and optimize your read operations, consider provisioning additional IOPS, or scaling up the instance size if necessary. Implement caching strategies to reduce direct read load on the EBS volumes. By closely monitoring read IOPS, you can ensure that your applications continue to perform optimally and provide a smooth user experience

Low Write Operations per Seconds

This alert is designed to monitor and manage the write operations per second (IOPS) of Amazon EBS volumes, which indicates the rate at which write operations are being performed. Low write IOPS can negatively impact application performance and user experience, making it crucial to identify and address any underlying issues promptly. The alert is activated when the write IOPS fall below the usual rate, indicating that write operations are not performing as expected. Low write IOPS could be due to various factors, such as decreased load on the volume, inefficient write patterns, or underlying infrastructure issues. Persistent low IOPS can lead to slower application performance and may require scaling or optimization efforts. Customization Guidance: Instance Specificity: Consider configuring the alert for different volumes or sets of volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to development environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the write IOPS metrics and logs to determine the common causes of low IOPS. Look for patterns such as specific times of day when IOPS drop or particular applications generating heavy write loads. Review and optimize your write operations, consider provisioning additional IOPS, or scaling up the instance size if necessary. Implement caching strategies to reduce direct write load on the EBS volumes. By closely monitoring write IOPS, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

High Queue Length

This alert is designed to monitor and manage the queue length of Amazon EBS volumes, which indicates the number of pending I/O operations for a volume. A high queue length can signal that the volume is experiencing high I/O demand that exceeds its capacity, potentially leading to increased latency and degraded performance for applications relying on the volume. The alert is activated when the queue length exceeds the usual threshold, indicating that the volume is struggling to keep up with the I/O operations being requested. Customization Guidance: Threshold: The threshold for high queue length should be set based on your application's performance requirements and typical I/O patterns. A common threshold might be set at 10 pending operations. Adjust this threshold according to the specific needs and performance characteristics of your workloads. Instance Specificity: Consider configuring the alert for different volumes based on their criticality and usage patterns. Critical production volumes might require more stringent monitoring compared to less critical environments. Notification Frequency: Consider the frequency of this alert to optimize the balance between responsiveness and noise. Adjust according to the criticality of the volume’s performance and the need for uninterrupted operation. Action: If this alert triggers, analyze the queue length metrics and logs to determine the common causes of high queue length. Look for patterns such as specific times of day when queue length spikes or particular applications generating heavy I/O loads. Review and optimize your I/O operations, consider provisioning additional IOPS or throughput, or scaling up the instance size if necessary. Implement caching strategies to reduce direct load on the EBS volumes. By closely monitoring queue length, you can ensure that your applications continue to perform optimally and provide a smooth user experience.

Integration

Learn more about Coralogix's out-of-the-box integration with Amazon EBS in our documentation.

Read More
Schedule Demo

Enterprise-Grade Solution