dbbackup

Author	SHA1	Message	Date
Alexander Renz	25162b58d1	Add missing dedup Prometheus metrics New metrics: - dbbackup_dedup_compression_ratio (separate from dedup ratio) - dbbackup_dedup_oldest_chunk_timestamp (retention monitoring) - dbbackup_dedup_newest_chunk_timestamp - dbbackup_dedup_database_total_bytes (per-db logical size) - dbbackup_dedup_database_stored_bytes (per-db actual storage)	2026-01-23 14:46:48 +01:00
Alexander Renz	d353f1317a	Fix Ctrl+C responsiveness during globals backup backupGlobals() used cmd.Output() which blocks until completion even when context is cancelled. Changed to Start/Wait pattern with proper context handling for immediate Ctrl+C response.	2026-01-23 14:25:22 +01:00
Alexander Renz	25c4bf82f7	Use pgzip for parallel cluster restore decompression Replace compress/gzip with github.com/klauspost/pgzip in: - internal/restore/extract.go - internal/restore/diagnose.go This enables multi-threaded gzip decompression for faster cluster backup extraction on multi-core systems.	2026-01-23 14:17:34 +01:00
Alexander Renz	5b75512bf8	v3.42.100: Fix dedup CIFS/NFS mkdir visibility lag The actual bug: MkdirAll returns success on CIFS/NFS but the directory isn't immediately visible for file operations. Fix: - Verify directory exists with os.Stat after MkdirAll - Retry loop (5 attempts, 20ms delay) until directory is visible - Add write retry loop with re-mkdir on failure - Keep rename retry as fallback	2026-01-23 13:27:36 +01:00
Alexander Renz	63b7b07da9	Remove internal docs (GARANTIE, LEGAL_DOCUMENTATION, OPENSOURCE_ALTERNATIVE)	2026-01-23 13:18:10 +01:00
Alexander Renz	17d447900f	v3.42.99: Fix dedup CIFS/SMB rename bug On network filesystems (CIFS/SMB), atomic renames can fail with 'no such file or directory' due to stale directory caches. Fix: - Add MkdirAll before rename to refresh directory cache - Retry rename up to 3 times with 10ms delay - Re-ensure directory exists on each retry attempt	2026-01-23 13:07:53 +01:00
Alexander Renz	46950cdcf6	Reorganize docs: move to docs/, clean up obsolete files Moved to docs/: - AZURE.md, GCS.md, CLOUD.md (cloud storage) - PITR.md, MYSQL_PITR.md (point-in-time recovery) - ENGINES.md, DOCKER.md, SYSTEMD.md (deployment) - RESTORE_PROFILES.md, LOCK_DEBUGGING.md (troubleshooting) - LEGAL_DOCUMENTATION.md, GARANTIE.md, OPENSOURCE_ALTERNATIVE.md Removed obsolete: - RELEASE_85_FALLBACK.md - release-notes-v3.42.77.md - CODE_FLOW_PROOF.md - RESTORE_PROGRESS_PROPOSAL.md - RELEASE_NOTES.md (superseded by CHANGELOG.md) Root now has only: README, QUICK, CHANGELOG, CONTRIBUTING, SECURITY, LICENSE	2026-01-23 13:00:36 +01:00
Alexander Renz	7703f35696	Remove email_infra_team.txt	2026-01-23 12:58:31 +01:00
Alexander Renz	85ee8b2783	Add QUICK.md - real-world examples cheat sheet	2026-01-23 12:57:15 +01:00
Alexander Renz	3934417d67	v3.42.98: Fix CGO/SQLite and MySQL db name bugs FIXES: - Switch from mattn/go-sqlite3 (CGO) to modernc.org/sqlite (pure Go) Binaries compiled with CGO_ENABLED=0 now work correctly - Fix MySQL positional database argument being ignored 'dbbackup backup single gitea --db-type mysql' now uses 'gitea' correctly	2026-01-23 12:11:30 +01:00
Alexander Renz	c82f1d8234	v3.42.97: Add bandwidth throttling for cloud uploads Feature requested by DBA: Limit upload/download speed during business hours. - New --bandwidth-limit flag for cloud operations (S3, GCS, Azure, MinIO, B2) - Supports human-readable formats: 10MB/s, 50MiB/s, 100Mbps, unlimited - Environment variable: DBBACKUP_BANDWIDTH_LIMIT - Token-bucket style throttling with 100ms windows for smooth limiting - Reduces multipart concurrency when throttled for better rate control - Unit tests for parsing and throttle behavior	2026-01-23 11:27:45 +01:00
Alexander Renz	4e2ea9c7b2	fix: FreeBSD build - int64/uint64 type mismatch in statfs - tmpfs.go: Convert stat.Blocks/Bavail/Bfree to int64 for cross-platform math - large_db_guard.go: Same fix for disk space calculation - FreeBSD uses int64 for these fields, Linux uses uint64	2026-01-23 11:15:58 +01:00
Alexander Renz	342cccecec	v3.42.96: Complete elimination of shell tar/gzip dependencies - Remove ALL remaining exec.Command tar/gzip/gunzip calls from internal code - diagnose.go: Replace 'tar -tzf' test with direct file open check - large_restore_check.go: Replace 'gzip -t' and 'gzip -l' with in-process pgzip verification - pitr/restore.go: Replace 'tar -xf' with in-process archive/tar extraction - All backup/restore operations now 100% in-process using github.com/klauspost/pgzip - Benefits: No external tool dependencies, 2-4x faster on multi-core, reliable error handling - Note: Docker drill container commands still use gunzip for in-container ops (intentional)	2026-01-23 10:44:52 +01:00
Alexander Renz	eeff783915	perf: use in-process pgzip for MySQL streaming backup - Add fs.NewParallelGzipWriter() for streaming compression - Replace shell gzip with pgzip in executeMySQLWithCompression() - Replace shell gzip with pgzip in executeMySQLWithProgressAndCompression() - No external gzip binary dependency for MySQL backups - 2-4x faster compression on multi-core systems	2026-01-23 10:30:18 +01:00
Alexander Renz	4210fd8c90	perf: use in-process parallel compression for backup - Add fs.CreateTarGzParallel() using pgzip for archive creation - Replace shell tar/pigz with in-process parallel compression - 2-4x faster compression on multi-core systems - No external process dependencies (tar, pigz not required) - Matches parallel extraction already in place - Both backup and restore now use pgzip for maximum performance	2026-01-23 10:24:48 +01:00
Alexander Renz	474293e9c5	refactor: use parallel tar.gz extraction everywhere - Replace shell 'tar -xzf' with fs.ExtractTarGzParallel() in engine.go - Replace shell 'tar -xzf' with fs.ExtractTarGzParallel() in diagnose.go - All extraction now uses pgzip with runtime.NumCPU() cores - 2-4x faster extraction on multi-core systems - Includes path traversal protection and secure permissions	2026-01-23 10:13:35 +01:00
Alexander Renz	e8175e9b3b	perf: parallel tar.gz extraction using pgzip (2-4x faster) - Added github.com/klauspost/pgzip for parallel gzip decompression - New fs.ExtractTarGzParallel() uses all CPU cores - Replaced shell 'tar -xzf' with pure Go parallel extraction - Security: path traversal protection, symlink validation - Secure permissions: 0700 for directories, 0600 for files - Progress callback for extraction monitoring Performance on multi-core systems: - 4 cores: ~2x faster than standard gzip - 8 cores: ~3x faster - 16 cores: ~4x faster Applied to: - Cluster restore (safety.go) - PITR restore (restore.go)	2026-01-23 10:06:56 +01:00
Alexander Renz	5af2d25856	feat: expert panel improvements - security, performance, reliability 🔴 HIGH PRIORITY FIXES: - Fix goroutine leak: semaphore acquisition now context-aware (prevents hang on cancel) - Incremental lock boosting: 2048→4096→8192→16384→32768→65536 based on BLOB count (no longer jumps straight to 65536 which uses too much shared memory) 🟡 MEDIUM PRIORITY: - Resume capability: RestoreCheckpoint tracks completed/failed DBs for --resume - Secure temp files: 0700 permissions prevent other users reading dump contents - SecureMkdirTemp() and SecureWriteFile() utilities in fs package 🟢 LOW PRIORITY: - PostgreSQL checkpoint tuning: checkpoint_timeout=30min, checkpoint_completion_target=0.9 - Added checkpoint_timeout and checkpoint_completion_target to RevertPostgresSettings() Security improvements: - Temp extraction directories now use 0700 (owner-only) - Checkpoint files use 0600 permissions	2026-01-23 09:58:52 +01:00
Alexander Renz	81472e464f	fix(lint): avoid copying mutex in GetSnapshot - use ProgressSnapshot struct - Created ProgressSnapshot struct without sync.RWMutex - GetSnapshot() now returns ProgressSnapshot instead of UnifiedClusterProgress - Fixes govet copylocks error	2026-01-23 09:48:27 +01:00
Alexander Renz	28e0bac13b	feat(tui): 3-way work directory toggle with clear visual indicators - Press 'w' cycles: SYSTEM → CONFIG → BACKUP → SYSTEM - Clear labels: [SYS] SYSTEM TEMP, [CFG] CONFIG, [BKP] BACKUP DIR - Shows actual path for each option - Warning only shown when using /tmp (space issues) - build_all.sh: reduced to 5 platforms (Linux/macOS only)	2026-01-23 09:44:33 +01:00
Alexander Renz	0afbdfb655	feat(progress): add UnifiedClusterProgress for combined backup/restore progress - Single unified progress tracker replaces 3 separate callbacks - Phase-based weighting: Extract(20%), Globals(5%), Databases(70%), Verify(5%) - Real-time ETA calculation based on completion rate - Per-database progress with byte-level tracking - Thread-safe with mutex protection - FormatStatus() and FormatBar() for display - GetSnapshot() for safe state copying - Full test coverage including thread safety Example output: [67%] DB 12/18: orders_db (2.4 GB / 3.1 GB) \| Elapsed: 34m12s ETA: 17m30s [██████████████████████████████░░░░░░░░░░░░] 67%	2026-01-23 09:31:48 +01:00
Alexander Renz	f1da65d099	fix(ci): add --db-type postgres --no-config to verify-locks test	2026-01-23 09:26:26 +01:00
Alexander Renz	3963a6eeba	feat: streaming BLOB detection + MySQL restore tuning (no memory explosion) Critical improvements: - StreamCountBLOBs() - streams pg_restore -l output line by line - StreamAnalyzeDump() - analyze dumps without loading into memory - detectLargeObjects() now uses streaming (was: cmd.Output() into memory) - TuneMySQLForRestore() - disable sync, constraints for fast restore - RevertMySQLSettings() - restore safe defaults after restore For 119GB restore: prevents OOM during dump analysis phase	2026-01-23 09:25:39 +01:00
Alexander Renz	2ddf3fa5ab	fix(ci): add --database testdb for MySQL connection	2026-01-23 09:17:17 +01:00
Alexander Renz	bdede4ae6f	fix(ci): add --port 3306 for MySQL test	2026-01-23 09:11:31 +01:00
Alexander Renz	0c9b44d313	fix(ci): add --allow-root for container environment	2026-01-23 09:06:20 +01:00
Alexander Renz	0418bbe70f	fix(ci): database name is positional arg, not --database flag - backup single testdb (positional) instead of --database testdb - Add --no-config to avoid loading stale .dbbackup.conf	2026-01-23 08:57:15 +01:00
Alexander Renz	1c5ed9c85e	fix: remove all hardcoded tmpfs paths - discover dynamically from /proc/mounts - discoverTmpfsMounts() reads /proc/mounts for ALL tmpfs/devtmpfs - No hardcoded /dev/shm, /tmp, /run paths - Recommend any writable tmpfs with enough space - Pick tmpfs with most free space	2026-01-23 08:50:09 +01:00
Alexander Renz	ed4719f156	feat(restore): add tmpfs detection for fast temp storage (no root needed) - Add TmpfsRecommendation to LargeDBGuard - CheckTmpfsAvailable() scans /dev/shm, /run/shm, /tmp for writable tmpfs - GetOptimalTempDir() returns best temp dir (tmpfs preferred) - Add internal/fs/tmpfs.go with TmpfsManager utility - All works without root - uses existing system tmpfs mounts For 119GB restore on 32GB RAM: - If /dev/shm has space, use it for faster temp files - Falls back to disk if tmpfs too small	2026-01-23 08:41:53 +01:00
Alexander Renz	ecf62118fa	fix(ci): use --backup-dir instead of non-existent --output flag	2026-01-23 08:38:02 +01:00
Alexander Renz	d835bef8d4	fix(prepare_system): Smart swap handling - check existing swap first - If already have 4GB+ swap, skip creation - Only add additional swap if needed - Target: 8GB total swap - Shows current vs new swap size	2026-01-23 08:33:44 +01:00
Alexander Renz	4944bee92e	refactor: Split into prepare_system.sh (root) and prepare_postgres.sh (postgres) prepare_system.sh (run as root): - Swap creation (auto-detects size) - OOM killer protection - Kernel tuning prepare_postgres.sh (run as postgres user): - PostgreSQL memory tuning - Lock limit increase - Disable parallel workers No more connection issues - each script runs as the right user && git push origin main	2026-01-23 08:28:46 +01:00
Alexander Renz	3fca383b85	fix(prepare_restore): Write directly to postgresql.auto.conf - no psql connection needed! New approach: 1. Find PostgreSQL data directory (checks common locations) 2. Write settings directly to postgresql.auto.conf file 3. Falls back to psql only if direct write fails 4. No environment variables, no passwords, no connection issues Supports: RHEL/CentOS, Debian/Ubuntu, multiple PostgreSQL versions	2026-01-23 08:26:34 +01:00
Alexander Renz	fbf21c4cfa	fix(prepare_restore): Prioritize sudo -u postgres when running as root When running as root, use 'sudo -u postgres psql' first (local socket). This is most reliable for ALTER SYSTEM commands on local PostgreSQL.	2026-01-23 08:24:31 +01:00
Alexander Renz	4e7b5726ee	fix(prepare_restore): Improve PostgreSQL connection handling - Try multiple connection methods (env vars, sudo, sockets) - Support PGHOST, PGPORT, PGUSER, PGPASSWORD environment variables - Try /var/run/postgresql and /tmp socket paths - Add connection info to --help output - Version bump to 1.1.0	2026-01-23 08:22:55 +01:00
Alexander Renz	ad5bd975d0	fix(prepare_restore): More aggressive swap size auto-detection - 4GB available → 3GB swap (was 1GB) - 6GB available → 4GB swap (was 2GB) - 12GB available → 8GB swap (was 4GB) - 20GB available → 16GB swap (was 8GB) - 40GB available → 32GB swap (was 16GB)	2026-01-23 08:18:50 +01:00
Alexander Renz	90c9603376	fix(ci): Use correct command syntax (backup single --db-type instead of backup --engine)	2026-01-23 08:17:16 +01:00
Alexander Renz	f2c6ae9cc2	fix(prepare_restore): Auto-detect swap size based on available disk space - --swap auto now detects optimal size based on available disk - --fix uses auto-detection instead of hardcoded 16G - Reduces swap size automatically if disk space is limited - Minimum 2GB buffer kept for system operations - Works with as little as 3GB free disk space (creates 1GB swap)	2026-01-23 08:15:24 +01:00
Alexander Renz	e31d03f5eb	fix(ci): Use service names instead of 127.0.0.1 for container networking In Gitea Actions with service containers, services must be accessed by their service name (postgres, mysql) not localhost/127.0.0.1	2026-01-23 08:10:01 +01:00
Alexander Renz	7d0601d023	refactor: Consolidate shell scripts into single prepare_restore.sh Removed obsolete/duplicate scripts: - DEPLOY_FIX.sh (old deployment script) - TEST_PROOF.sh (binary verification, no longer needed) - diagnose_postgres_memory.sh (merged into prepare_restore.sh) - diagnose_restore_oom.sh (merged into prepare_restore.sh) - fix_postgres_locks.sh (merged into prepare_restore.sh) - verify_postgres_locks.sh (merged into prepare_restore.sh) New comprehensive script: prepare_restore.sh - Full system diagnosis (memory, swap, PostgreSQL, disk, OOM) - Automatic swap creation with configurable size - PostgreSQL tuning for low-memory restores - OOM killer protection - Single command to apply all fixes: --fix Usage: ./prepare_restore.sh # Run diagnostics sudo ./prepare_restore.sh --fix # Apply all fixes sudo ./prepare_restore.sh --swap 32G # Create specific swap	2026-01-23 08:06:39 +01:00
Alexander Renz	f7bd655c66	feat(restore): Add OOM protection and memory checking for large database restores - Add CheckSystemMemory() to LargeDBGuard for pre-restore memory analysis - Add memory info parsing from /proc/meminfo - Add TunePostgresForRestore() and RevertPostgresSettings() SQL helpers - Integrate memory checking into restore engine with automatic low-memory mode - Add --oom-protection and --low-memory flags to cluster restore command - Add diagnose_restore_oom.sh emergency script for production OOM issues For 119GB+ backups on 32GB RAM systems: - Automatically detects insufficient memory and enables single-threaded mode - Recommends swap creation when backup size exceeds available memory - Provides PostgreSQL tuning recommendations (work_mem=64MB, disable parallel) - Estimates restore time based on backup size	2026-01-23 07:57:11 +01:00
Alexander Renz	25ef07ffc9	ci: trigger rebuild after verify_locks fix	2026-01-23 07:42:31 +01:00
Alexander Renz	6a2bd9198f	feat: add systematic verification tool for large database restores with BLOB support - Add LargeRestoreChecker for 100% reliable verification of restored databases - Support PostgreSQL large objects (lo) and bytea columns - Support MySQL BLOB columns (blob, mediumblob, longblob, etc.) - Streaming checksum calculation for very large files (64MB chunks) - Table integrity verification (row counts, checksums) - Database-level integrity checks (orphaned objects, invalid indexes) - Parallel verification for multiple databases - Source vs target database comparison - Backup file format detection and verification - New CLI command: dbbackup verify-restore - Comprehensive test coverage	2026-01-23 07:39:57 +01:00
Alexander Renz	e85388931b	ci: add comprehensive integration tests for PostgreSQL, MySQL and verify-locks	2026-01-23 07:32:05 +01:00
Alexander Renz	9657c045df	ci: restore exact working CI from release v3.42.85	2026-01-23 07:31:15 +01:00
Alexander Renz	afa4b4ca13	ci: restore robust, working pipeline and document release 85 fallback	2026-01-23 07:28:47 +01:00
Alexander Renz	019f195bf1	ci: trigger pipeline after checkout hardening	2026-01-23 07:21:12 +01:00
Alexander Renz	29efbe0203	ci(checkout): robustly fetch branch HEAD (fix typo)	2026-01-23 07:20:57 +01:00
Alexander Renz	53b8ada98b	ci(lint): run 'go mod download' and 'go build' before golangci-lint to catch typecheck/build errors	2026-01-23 07:17:22 +01:00
Alexander Renz	3d9d15d33b	ci: add main-only integration job 'integration-verify-locks' (smoke) + backup ci.yml	2026-01-23 07:07:29 +01:00

... 2 3 4 5 6 ...

578 Commits