Monitoring
This guide covers the monitoring capabilities of the Flowise service for tracking system health, performance, and logs.Monitoring Overview
Flowise provides comprehensive monitoring capabilities to help you ensure the reliability, performance, and availability of your AI workflows. These monitoring tools enable you to track system health, identify issues, and optimize resource usage.Monitoring Endpoints
Health Check
Get the current health status of the Flowise service.System Metrics
Get detailed system metrics for the Flowise service.Logs Retrieval
Retrieve application logs.Configure Logging
Configure logging settings.Alert Configuration
Configure alert settings.Monitoring Parameters
Logs Parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
| limit | number | Maximum number of logs to return | 100 |
| offset | number | Offset for pagination | 0 |
| level | string | Log level (debug, info, warn, error, critical) | info |
| startDate | string | Start date in ISO format | 24 hours ago |
| endDate | string | End date in ISO format | Current time |
| chatflowId | string | Filter logs by chatflow ID | All chatflows |
| search | string | Search term within log messages | None |
Alert Thresholds
| Threshold | Type | Description | Default |
|---|---|---|---|
| cpuUsage | number | CPU usage percentage threshold | 90 |
| memoryUsage | number | Memory usage percentage threshold | 85 |
| errorRate | number | Error rate percentage threshold | 5 |
| responseTime | number | Average response time threshold in ms | 2000 |
| diskUsage | number | Disk usage percentage threshold | 85 |
| concurrentRequests | number | Concurrent requests threshold | 100 |
Alert Event Types
| Event Type | Description |
|---|---|
| system.warning | System warning events |
| system.error | System error events |
| system.critical | System critical events |
| chatflow.failure | Chatflow execution failures |
| chatflow.timeout | Chatflow execution timeouts |
| resource.low | Resource usage approaching thresholds |
| security.violation | Security policy violations |
Usage Examples
Basic Health Monitoring
Performance Monitoring Dashboard
Error Log Analysis
Integration with Monitoring Tools
Prometheus Integration
ELK Stack Integration
Custom Webhook Integration
Best Practices
-
Health Monitoring Strategy
- Implement regular health checks
- Set up automated alerting for unhealthy status
- Monitor key system resources
- Establish baseline performance metrics
-
Log Management
- Configure appropriate log levels
- Implement log rotation to manage storage
- Centralize logs for easier analysis
- Use structured logging format (JSON)
-
Alert Configuration
- Define meaningful alert thresholds
- Avoid alert fatigue with proper prioritization
- Configure multiple notification channels
- Implement escalation procedures for critical issues
-
Performance Optimization
- Monitor response time trends
- Track resource utilization patterns
- Identify and address performance bottlenecks
- Implement resource-based scaling