Metrics & Monitoring

SmartReader provides comprehensive metrics and monitoring capabilities through a Prometheus-compatible endpoint. This feature enables real-time monitoring of system health, performance, and operational status, making it easy to integrate with monitoring dashboards and alerting systems.

Overview

Metrics & Monitoring provides:

System Metrics: CPU usage, memory consumption, disk usage, and network statistics
Application Metrics: Application uptime, process information, and performance data
Service Metrics: MQTT connection status, TCP socket clients, HTTP POST status, and more
Prometheus Format: Standard Prometheus text format for easy integration
Real-time Updates: Metrics are updated continuously and available on demand

Accessing Metrics

The metrics endpoint is available at /metrics and returns data in Prometheus text format:

curl -H "Authorization: Basic YWRtaW46YWRtaW4=" \
  https://<reader-ip>:8443/metrics

Response Format:

# HELP metric_name Auto-generated metric
# TYPE metric_name gauge
metric_name value

Available Metrics

System Metrics

These metrics are provided by the MetricsMonitoringService:

Metric

Description

Unit

metricsmonitoringservice_system_cpu_usage____

Current CPU usage percentage

Percentage (0-100)

metricsmonitoringservice_system_memory_usage__mb__

Current memory usage

Megabytes

metricsmonitoringservice_system_disk_usage____

Disk usage percentage

Percentage (0-100)

metricsmonitoringservice_system_os_uptime__seconds__

Operating system uptime

Seconds

metricsmonitoringservice_system_application_uptime__seconds__

Application uptime

Seconds

metricsmonitoringservice_system_cpu_temperature____c__

CPU temperature

Celsius

metricsmonitoringservice_system_cpu_max_allowed_temp____c__

Maximum allowed CPU temperature

Celsius

metricsmonitoringservice_system_cpu_max_recorded_temp____c__

Maximum recorded CPU temperature

Celsius

metricsmonitoringservice_system_min_memory_used__mb__

Minimum memory used since startup

Megabytes

metricsmonitoringservice_system_max_memory_used__mb__

Maximum memory used since startup

Megabytes

metricsmonitoringservice_system_min_cpu_used____

Minimum CPU usage since startup

Percentage

metricsmonitoringservice_system_max_cpu_used____

Maximum CPU usage since startup

Percentage

metricsmonitoringservice_system_network_rx_bytes

Network bytes received

Bytes

metricsmonitoringservice_system_network_tx_bytes

Network bytes transmitted

Bytes

Service Metrics

Additional metrics are provided by various services:

MQTT Service Metrics

Metric

Description

Unit

mqttservice_mqtt_connected

MQTT connection status

Boolean (0 or 1)

mqttservice_messages_pending

Number of pending MQTT messages

Count

TCP Socket Service Metrics

Metric

Description

Unit

tcpsocketservice_socket_server_healthy

TCP socket server health status

Boolean (0 or 1)

tcpsocketservice_connected_clients

Number of connected TCP clients

Count

WebSocket Service Metrics

Metric

Description

Unit

websocketservice_websocket_active_clients

Number of active WebSocket clients

Count

HTTP Event Publisher Metrics

Metric

Description

Unit

httpeventpublisherservice_last_http_post_status

Last HTTP POST response status code

HTTP status code

httpeventpublisherservice_http_post_enabled

HTTP POST enabled status

Boolean (0 or 1)

gRPC Service Metrics

Metric

Description

Unit

grpcservice_grpc_enabled

gRPC enabled status

Boolean (0 or 1)

Metric Naming Convention

Metric names are automatically generated using the following rules:

Provider Name: The .NET type name of the metric provider (e.g., MetricsMonitoringService)
Original Key: The original metric key from the provider (e.g., System.CPU Usage (%))
Transformation:
- Converted to lowercase
- Non-alphanumeric characters replaced with underscores
- Multiple underscores may appear (e.g., System.CPU Usage (%) becomes system_cpu_usage____)

Example:

Provider: MetricsMonitoringService
Original Key: System.CPU Usage (%)
Final Metric Name: metricsmonitoringservice_system_cpu_usage____

Metric Types

All metrics are exported as gauge types in Prometheus format:

Gauge: A metric that represents a single numerical value that can go up and down
Boolean Values: Converted to 0 (false) or 1 (true) for compatibility

Example Metrics Output

# HELP metricsmonitoringservice_system_cpu_usage____ Auto-generated metric
# TYPE metricsmonitoringservice_system_cpu_usage____ gauge
metricsmonitoringservice_system_cpu_usage____ 37.42

# HELP metricsmonitoringservice_system_memory_usage__mb__ Auto-generated metric
# TYPE metricsmonitoringservice_system_memory_usage__mb__ gauge
metricsmonitoringservice_system_memory_usage__mb__ 612

# HELP metricsmonitoringservice_system_network_rx_bytes Auto-generated metric
# TYPE metricsmonitoringservice_system_network_rx_bytes gauge
metricsmonitoringservice_system_network_rx_bytes 18543219

# HELP metricsmonitoringservice_system_application_uptime__seconds__ Auto-generated metric
# TYPE metricsmonitoringservice_system_application_uptime__seconds__ gauge
metricsmonitoringservice_system_application_uptime__seconds__ 86423

# HELP tcpsocketservice_socket_server_healthy Auto-generated metric
# TYPE tcpsocketservice_socket_server_healthy gauge
tcpsocketservice_socket_server_healthy 1

# HELP tcpsocketservice_connected_clients Auto-generated metric
# TYPE tcpsocketservice_connected_clients gauge
tcpsocketservice_connected_clients 3

# HELP mqttservice_mqtt_connected Auto-generated metric
# TYPE mqttservice_mqtt_connected gauge
mqttservice_mqtt_connected 1

# HELP mqttservice_messages_pending Auto-generated metric
# TYPE mqttservice_messages_pending gauge
mqttservice_messages_pending 4

# HELP websocketservice_websocket_active_clients Auto-generated metric
# TYPE websocketservice_websocket_active_clients gauge
websocketservice_websocket_active_clients 5

# HELP httpeventpublisherservice_last_http_post_status Auto-generated metric
# TYPE httpeventpublisherservice_last_http_post_status gauge
httpeventpublisherservice_last_http_post_status 202

# HELP httpeventpublisherservice_http_post_enabled Auto-generated metric
# TYPE httpeventpublisherservice_http_post_enabled gauge
httpeventpublisherservice_http_post_enabled 0

# HELP grpcservice_grpc_enabled Auto-generated metric
# TYPE grpcservice_grpc_enabled gauge
grpcservice_grpc_enabled 1

Integration with Prometheus

Prometheus Configuration

Add the SmartReader metrics endpoint to your prometheus.yml:

scrape_configs:
  - job_name: 'smartreader'
    scrape_interval: 30s
    basic_auth:
      username: 'admin'
      password: 'admin'
    static_configs:
      - targets: ['<reader-ip>:8443']
        labels:
          reader: 'R700-01'
          location: 'Warehouse-A'

Grafana Dashboard

Create a Grafana dashboard to visualize the metrics:

Add Prometheus Data Source: Configure Prometheus as a data source in Grafana
Create Dashboard: Create a new dashboard with panels for:
- CPU Usage over time
- Memory Usage over time
- Network traffic (RX/TX)
- Application uptime
- Service health status (MQTT, TCP, HTTP, etc.)
- Connected clients

Example Query:

metricsmonitoringservice_system_cpu_usage____

Monitoring Best Practices

Scrape Interval: Set an appropriate scrape interval (e.g., 30 seconds) based on your needs
Alerting Rules: Create Prometheus alerting rules for:
- High CPU usage (> 80%)
- High memory usage (> 90%)
- Service disconnections (MQTT, TCP, etc.)
- Application crashes (uptime resets)
Dashboard Organization: Organize dashboards by:
- System health (CPU, memory, disk)
- Service status (MQTT, TCP, HTTP)
- Network performance
- Application metrics
Retention: Configure appropriate metric retention in Prometheus
Authentication: Always use authentication when exposing metrics endpoints

Metric Collection

Metrics are collected:

Continuously: System metrics are updated every 10 seconds by the MetricsMonitoringService
On Demand: Service metrics are collected when the /metrics endpoint is called
Real-time: All metrics reflect current system state

Error Handling

If metric collection fails:

The endpoint returns 200 OK with # Metrics unavailable
Errors are logged but don't affect application operation
Individual metric providers may fail independently without affecting others

Configuration

Metrics collection is configured in appsettings.json:

{
  "MetricsChannelCapacity": 1000
}

MetricsChannelCapacity: Maximum number of metrics that can be queued (default: 1000)

Troubleshooting

No Metrics Available

Check Authentication: Ensure you're providing valid Basic Auth credentials
Check Endpoint: Verify the /metrics endpoint is accessible
Review Logs: Check application logs for metric collection errors
Verify Services: Ensure metric providers are registered and running

Missing Service Metrics

Service Status: Verify the service is enabled and running
Provider Registration: Check that the service implements IMetricProvider
Service Configuration: Ensure the service is properly configured

Inconsistent Metric Names

Metric names are auto-generated from provider type names and metric keys
Names may change if provider types are renamed
Use Prometheus label selectors for more flexible querying

API Reference

For complete API documentation, see the REST API documentation.

REST API - Complete API reference including metrics endpoint
Logging Control - Manage logging levels for troubleshooting
Health Checks - Application health check endpoints

PreviousConfiguration Provisioning NextData Output Options

Last updated 21 days ago

hashtagOverview

hashtagAccessing Metrics

hashtagAvailable Metrics

hashtagSystem Metrics

hashtagService Metrics

hashtagMQTT Service Metrics

hashtagTCP Socket Service Metrics

hashtagWebSocket Service Metrics

hashtagHTTP Event Publisher Metrics

hashtaggRPC Service Metrics

hashtagMetric Naming Convention

hashtagMetric Types

hashtagExample Metrics Output

hashtagIntegration with Prometheus

hashtagPrometheus Configuration

hashtagGrafana Dashboard

hashtagMonitoring Best Practices

hashtagMetric Collection

hashtagError Handling

hashtagConfiguration

hashtagTroubleshooting

hashtagNo Metrics Available

hashtagMissing Service Metrics

hashtagInconsistent Metric Names

hashtagAPI Reference

hashtagRelated Features

Overview

Accessing Metrics

Available Metrics

System Metrics

Service Metrics

MQTT Service Metrics

TCP Socket Service Metrics

WebSocket Service Metrics

HTTP Event Publisher Metrics

gRPC Service Metrics

Metric Naming Convention

Metric Types

Example Metrics Output

Integration with Prometheus

Prometheus Configuration

Grafana Dashboard

Monitoring Best Practices

Metric Collection

Error Handling

Configuration

Troubleshooting

No Metrics Available

Missing Service Metrics

Inconsistent Metric Names

API Reference

Related Features