1747365d0d
feat(metrics): add backup_type label and PITR metrics
...
CI/CD / Test (push) Successful in 1m54s
CI/CD / Lint (push) Successful in 1m47s
CI/CD / Integration Tests (push) Successful in 1m28s
CI/CD / Build & Release (push) Successful in 10m57s
- Add backup_type label (full/incremental/pitr_base) to core metrics
- Add new dbbackup_backup_by_type metric for backup type distribution
- Add complete PITR metrics: pitr_enabled, pitr_archive_lag_seconds,
pitr_chain_valid, pitr_gap_count, pitr_recovery_window_minutes
- Add PITR-specific alerting rules for archive lag and chain integrity
- Update METRICS.md and EXPORTER.md documentation
- Bump version to 4.1.0
2026-01-27 14:44:27 +01:00
7b5aafbb02
feat(monitoring): enhance Grafana dashboard and add alerting rules
...
- Add dashboard description and panel descriptions for all 17 panels
- Add new Verification Status panel using dbbackup_backup_verified metric
- Add collapsible Backup Overview row for better organization
- Enable shared crosshair (graphTooltip=1) for correlated analysis
- Fix overlapping dedup panel positions (y: 31/36 → 22/27/32)
- Adjust top row panel widths for better balance (5+5+5+4+5=24)
- Add 'monitoring' tag for dashboard discovery
- Add grafana/alerting-rules.yaml with 9 Prometheus alerts:
- DBBackupRPOCritical, DBBackupRPOWarning, DBBackupFailure
- DBBackupNotVerified, DBBackupDedupRatioLow, DBBackupDedupDiskGrowth
- DBBackupExporterDown, DBBackupMetricsStale, DBBackupNeverSucceeded
- Add Makefile for streamlined development workflow
- Add logger tests and optimizations (buffer pooling, early level check)
- Fix deprecated netErr.Temporary() call (Go 1.18+)
- Fix staticcheck warnings for redundant fmt.Sprintf calls
- Clone engine now validates disk space before operations
2026-01-24 13:18:59 +01:00