Files
Pulse/docs
rcourtman a1dc451ed4 Document alert reliability features and DLQ API
Add comprehensive documentation for new alert system reliability features:

**API Documentation (docs/API.md):**
- Dead Letter Queue (DLQ) API endpoints
  - GET /api/notifications/dlq - Retrieve failed notifications
  - GET /api/notifications/queue/stats - Queue statistics
  - POST /api/notifications/dlq/retry - Retry DLQ items
  - POST /api/notifications/dlq/delete - Delete DLQ items
- Prometheus metrics endpoint documentation
  - 18 metrics covering alerts, notifications, and queue health
  - Example Prometheus configuration
  - Example PromQL queries for common monitoring scenarios

**Configuration Documentation (docs/CONFIGURATION.md):**
- Alert TTL configuration
  - maxAlertAgeDays, maxAcknowledgedAgeDays, autoAcknowledgeAfterHours
- Flapping detection configuration
  - flappingEnabled, flappingWindowSeconds, flappingThreshold, flappingCooldownMinutes
- Usage examples and common scenarios
- Best practices for preventing notification storms

All new features are fully documented with examples and default values.
2025-11-06 17:34:05 +00:00
..

Pulse Documentation Index

Use this index to navigate the documentation bundled with the repository. Each section groups related guides so you can jump straight to the material you need.


Getting Started

  • INSTALL.md Installation guide covering script, Docker, and Helm paths.
  • FAQ.md Common questions and troubleshooting quick answers.
  • MIGRATION.md Export/import process for moving between hosts.
  • DEV-QUICK-START.md Hot reload workflow for local development.

Deployment Guides

Operations & Monitoring

Security

Reference

Development & Contribution

Have an idea for a new guide? Update this index when you add documentation so discoverability stays high.