homelab-infra

Author	SHA1	Message	Date
Micha	4b96d13510	security(authelia): borg-ui und code-server auf two_factor heben Beide UIs haben effektiv Host-/Backup-Zugriff (Borg-Restore-Scope inkl. /local/secrets, code-server mit Workspace-Mounts). Bisher liefen sie ueber die catch-all-Regel mit nur one_factor. Files und Scrutiny waren bereits two_factor; die Liste wird konsistent gezogen. Wirkung erst nach manuellem Host-Merge (Ausnahme laut docs/WORKFLOW.md): 1. /mnt/user/appdata/authelia/config/configuration.yml mergen 2. docker restart authelia 3. Smoke-Test auf einer der vier 2FA-Domains 4. services/authelia-diff.sh muss exit 0 liefern Audit-Restliste nachgezogen: Tier-1-Operator-2FA geschlossen, restliche geparkte Auth-Themen (OIDC, CrowdSec, Nextcloud-2FA) bewusst weiter offen mit aktualisierter Begruendung.	2026-06-03 15:03:15 +02:00
Micha	642eb88b40	docs(restore): traefik restore successful - 11 of 12 tests green Traefik-Restore am 2026-06-03 erfolgreich: dynamic/ (2 Files) + letsencrypt/acme.json (426K) aus Borg, File-Provider-Boot, /ping 200. Erster Versuch, kein shfs-Problem. 11 von 12 Restore-Tests sind jetzt gruen. Einzig Nextcloud bleibt blockiert durch Unraids shfs-chmod-Inkompatibilitaet. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 14:45:27 +02:00
Micha	dd494046ce	feat(restore): traefik restore smoke test Borg-Extract von dynamic/ und letsencrypt/, Traefik startet mit File-Provider gegen restaurierte Config, /ping Health antwortet. Bewusst kein docker.sock (wuerde produktive Container discovern), kein CF-Token (keine DNS-Challenge), keine produktiven Ports. acme.json-Existenz und -Groesse wird geprueft, TLS-Validitaet nicht. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 14:42:56 +02:00
Micha	16d3b8f2fa	docs(restore): mailarchiver restore successful, update matrix and backlog Mail-Archiver-Restore am 2026-06-03 erfolgreich: Data-Protection-Keys aus Borg + 645M pg_restore + HTTP 200. Erster Versuch, kein shfs-Problem. 10 von 12 Restore-Tests sind jetzt gruen. Verbleibend: Nextcloud (blockiert/shfs-chmod) und Traefik (komplex, niedrigere Prio). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 14:08:28 +02:00
Micha	a9b232195d	feat(restore): mail-archiver restore smoke test Borg-Extract der Data-Protection-Keys + pg_restore des 645M mailarchiver-Dumps in isoliertes Test-Postgres + Container-Boot + HTTP-Smoke. Wegwerf-DB-Connection und Auth-Password, kein produktiver Stack-ENV, kein Authelia-ForwardAuth im Smoke. Machbarkeit vorab verifiziert: Dump vorhanden, App-Image gepinnt, Data-Protection-Keys im Borg, .NET-App hat kein shfs-chmod-Problem. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 14:01:47 +02:00
Micha	5ee4a158d6	docs(restore): mealie restore successful, update matrix and backlog Mealie-Restore-Test am 2026-06-03 erfolgreich: Borg-Data + pg_restore + HTTP 200, 3 Rezepte im Test-DB-Check. Erster Versuch, kein shfs-Problem (Mealie startet als root, kein chmod auf User Shares). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 13:54:05 +02:00
Micha	86435d4091	feat(restore): mealie restore test + freshness check negativ-test fix Mealie-Restore-Test: Borg-Extract der App-Daten + pg_restore in isoliertes Test-Postgres + Mealie-Boot + HTTP /api/app/about Smoke. Machbarkeit vorab verifiziert (kein shfs-chmod-Problem, Mealie laeuft als root und switcht intern auf PUID 99). Freshness-Check: pg_header_ok() Docker-Fallback lieferte bei korruptem Dump return 2 (unchecked) statt return 1 (invalid). Negativ-Test am 2026-06-03 bewiesen: korrupter mealie.dump wird jetzt als DUMP_HEADER_INVALID erkannt (Critical, Exit 1). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 13:49:51 +02:00
Micha	5e52316fab	fix(restore): freshness check pg_header_ok returns 1 on corrupt dump Negativ-Test 2026-06-03: korrupter mealie.dump wurde nicht erkannt, weil der Docker-Fallback-Pfad nach gescheitertem pg_restore --list zu return 2 (unchecked) durchfiel statt return 1 (invalid). Fix: explizites if/else statt &&-Kette, damit fehlgeschlagene Header-Validierung return 1 liefert und als DUMP_HEADER_INVALID in den Critical-Zaehler geht. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 13:47:08 +02:00
Micha	8a4df239fa	fix(restore): pin komodo test mongo to 8.0.23 matching production Produktive Mongo ist 8.0.23, Test-Composes pinnten noch 7.0.32. Eliminiert die Cross-Version-Warnung beim mongorestore. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 13:44:58 +02:00
Micha	893b34a585	docs(restore): shared pg cluster drill successful, all 5 DBs restored Shared PostgreSQL 18 Cluster Restore Drill am 2026-06-03 erfolgreich: Globals + 5 per-DB Custom-Format-Dumps, 290 Tabellen gesamt, data_checksums=on. Alle P1-Backlog-Punkte sind damit erledigt. Ergebnis pro DB: - paperless: 72 Tabellen - mailarchiver: 1 Tabelle - authelia: 25 Tabellen - nextcloud: 126 Tabellen - mealie: 66 Tabellen Mailarchiver-Bootstrap-Rollenkonflikt wurde wie dokumentiert toleriert. Lauf dauerte ~14 Minuten (mailarchiver.dump = 645M). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 13:17:35 +02:00
Micha	d1f9491b24	feat(restore): shared postgresql 18 cluster restore drill Kompletter Restore-Drill fuer den Shared-PostgreSQL-18-Cluster: globals (Rollen) + 5 per-DB Custom-Format-Dumps (paperless, mailarchiver, authelia, nextcloud, mealie). Bekannter mailarchiver-Bootstrap-Rollenkonflikt wird toleriert. Authelia/Nextcloud/Mealie-Dumps als optional markiert. Tabellen-Count pro DB als fachlicher Sanity-Check. Machbarkeit vorab verifiziert: alle Dumps auf Host vorhanden, pg_restore im postgres:18.4-Image verfuegbar, Postgres auf shfs bewiesen durch bestehende Tests. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 13:02:16 +02:00
Micha	14de2f4801	docs(restore): komodo mongo restore successful, update matrix and backlog Komodo-Mongo-Daten-Restore am 2026-06-03 erfolgreich: mongorestore von komodo-mongo.archive.gz in Wegwerf-Mongo, 86904 Dokumente (inkl. 32 Stack-Definitionen). Damit ist die kanonische Quelle fuer KOMODO_*-Stack-ENV-Werte im DR-Fall als wiederherstellbar belegt. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 11:25:32 +02:00
Micha	90d1595285	fix(restore): komodo mongo restore own compose to avoid container name collision Zweiter Lauf scheiterte mit Auth-Failure weil der Container-Name restoretest-komodo-mongo mit dem alten Bootstrap-Test kollidierte (stale Datadir auf shfs mit anderen Credentials). Fix: eigenes Compose mit eigenem Container-Namen (restoretest-komodo-mongorestore) und eigenem Project-Name, damit keine Namenskollision mit dem bestehenden Bootstrap-Test entsteht. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 11:23:17 +02:00
Micha	c1985e177b	fix(restore): komodo mongorestore --noIndexRestore for auth compat Erstlauf 2026-06-03: 86904 Dokumente (inkl. 32 Stack-Dokumente) erfolgreich restored, aber Exit 1 weil der Index-Rebuild mit "Command createIndexes requires authentication" scheitert (Test-User hat keine dbAdmin-Rolle). Fix: --noIndexRestore. Fuer den Smoke-Zweck (Stack-Definitionen lesbar, KOMODO_*-ENV-Werte rekonstruierbar) reicht das. Indexe werden bei einem echten Komodo-Restart ohnehin neu aufgebaut. Nebenbefund: produktive Mongo ist 8.0.23, Test-Compose pinnt 7.0.32. Cross-Version-Warning ist fuer den Lesetest harmlos, aber der Bootstrap-Compose-Pin sollte separat auf 8.0 nachgezogen werden. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 11:20:53 +02:00
Micha	a244f2d677	feat(restore): komodo mongo data restore test Neuer Test: mongorestore von komodo-mongo.archive.gz in eine frische Wegwerf-Mongo. Beweist, dass die Stack-Definitionen und damit die KOMODO_*-Stack-ENV-Werte aus dem Dump rekonstruiert werden koennen (kanonische Quelle laut docs/DISASTER_RECOVERY.md 6.2.1). Machbarkeit vorab verifiziert: Dump 6.0M auf Host vorhanden, mongorestore im mongo:7.0.32-Image verfuegbar, shfs-Write funktioniert. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 11:18:39 +02:00
Micha	ef032f2dde	docs(restore): document nextcloud shfs-chmod blocker Nextcloud-Restore-Test Erstlauf 2026-06-03 nach 5 Iterationen als strukturell blockiert durch Unraid shfs/FUSE eingestuft. Ursache: Nextcloud 33 fuehrt zur Laufzeit chmod() auf Dateien unter /var/www/html aus (OC_Util.php#486). Auf Unraids FUSE/shfs User Shares ist chmod nicht moeglich - weder vom Host (chown ignoriert) noch aus dem Container (Operation not permitted), auch nicht ohne no-new-privileges. In Produktion funktioniert Nextcloud, weil die Daten dort auf einem Cache-Drive (XFS/BTRFS direkt) statt ueber shfs liegen. Scaffold (Skript + Compose) bleibt im Repo als Ausgangspunkt fuer die Loesung. Drei Optionen dokumentiert: a) Restore-Lab auf Cache-Drive b) Docker-Volumes statt Bind-Mounts c) tmpfs + rsync Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 11:14:39 +02:00
Micha	6fec64d0a1	fix(restore): nextcloud dump from host path instead of borg extract Erstlauf 2026-06-03: borg_extract fuer den Nextcloud-Dump scheiterte still (Pfad local/borg-dumps/latest/nextcloud.dump existiert im Archiv moeglicherweise unter einem anderen Prefix). Der Dump liegt taeglich frisch auf dem Host unter /mnt/user/backups/borg/dumps/latest/ und wird von dort in Borg gesichert - der Smoke-Wert ist identisch. HTML (App-Code + config) kommt weiterhin aus dem Borg-Archiv. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 11:03:45 +02:00
Micha	5d1ae68705	fix(restore): nextcloud permissions on unraid shfs (no-new-privileges removal) Zweiter Erstlauf 2026-06-03 scheiterte weiterhin mit 503, obwohl config.php korrekt gepatcht war. Ursache: Unraid's FUSE/shfs-Dateisystem auf User-Shares ignoriert chown -R 33:33 still — Dateien bleiben bei sshd:sshd. Der Nextcloud-Entrypoint versucht intern chmod/chown auf /var/www/html und /var/www/html/data, was mit no-new-privileges:true blockiert wird. Fix: - no-new-privileges vom restoretest-nextcloud Container entfernt, damit der Entrypoint Rechte im Container selbst setzen kann (Test-Postgres und Test-Redis behalten no-new-privileges) - Host-seitiger chown durch chmod a+rwX ersetzt (funktioniert auf shfs) - Vertretbar im isolierten Smoke-Kontext (127.0.0.1, Wegwerf-Daten, kein Traefik) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 10:55:56 +02:00
Micha	2913e1005f	fix(restore): nextcloud chown 33:33 for www-data after borg extract Erstlauf 2026-06-03 scheiterte mit dauerhaft 503. config.php-Patching (Redis-Host + trusted_domains) war korrekt, aber Nextcloud konnte die restaurierten Dateien nicht lesen/schreiben: "chmod(): Operation not permitted at OC_Util.php#486". Ursache: Borg-Extract ueber den borg-ui Container legt Dateien mit dem borg-ui-User (sshd o.ae.) an. Nextcloud im Container laeuft als www-data (UID 33). Mit no-new-privileges:true scheitert jeder chmod/ chown-Versuch im Container. Fix: chown -R 33:33 auf html/ und data/ nach dem Extract, bevor der Nextcloud-Container startet. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 10:44:12 +02:00
Micha	6f0e6f0d5a	fix(restore): nextcloud config.php patching for redis host and trusted_domains Erstlauf 2026-06-03 scheiterte mit 503: Redis-Host war noch auf dem produktiven 'nextcloud-redis' statt 'restoretest-nextcloud-redis', und trusted_domains enthielt kein 127.0.0.1 (Nextcloud blockt mit "Access through untrusted domain"). Ursache: das sed-Pattern fuer Redis versuchte den ganzen Array-Block einzeilig zu ersetzen, traf aber das PHP-Mehrzeilenformat nicht. Und das trusted_domains-sed fand das Schliessmuster nicht zuverlaessig. Fix: - Redis-Host separat per sed patchen (nur den 'host'-Wert im Block) - trusted_domains per PHP-CLI rewrite (robuster als sed auf PHP-Arrays) - Fallback auf sed fuer Hosts ohne php Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 10:34:30 +02:00
Micha	f473fbaa8b	feat(restore): nextcloud restore smoke test scaffold Nextcloud-Restore-Test nach dem Muster der anderen Restore-Smokes: - Borg-Extract von html (App-Code + config.php) und nextcloud.dump - pg_restore in isoliertes Test-Postgres (mit Retry-Schleife) - config.php wird im Restore-Lab auf Test-DB-Credentials gepatcht (produktive Secrets werden nicht gemountet) - Nextcloud startet gegen restaurierte Daten + Test-Redis - Smoke prueft HTTP /status.php und occ status (maintenance mode) - Produktive Nutzdaten unter /mnt/user/documents/nextcloud-data werden bewusst NICHT gemountet (zu gross fuer regelmaessigen Smoke) Erster Lauf steht aus und braucht Operator-Freigabe auf dem Host. Dispatcher und ntfy-Wrapper um Nextcloud erweitert. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 10:05:10 +02:00
Micha	c922d1f241	docs(restore): finalize audit - handbook update, reifegrad matrix, backlog Schliesst das Restore-Skills-Audit 2026-06-02/03 ab: - RESTORE_HANDBOOK.md auf Stand 2026-06-03: alle 6 verifizierten Tests (Vaultwarden, Gitea, Paperless, Immich, Authelia, Komodo-Bootstrap) dokumentiert, Frequenz-Tabelle aktualisiert, Betriebsmodus auf V1+ (mit ntfy), Schnellstart um Immich/Authelia/Komodo ergaenzt, Report-Aufbewahrungsregel dokumentiert, Ausbaustufen priorisiert. - RESTORE_MATRIX.md: neue Sektion "Restore-Test-Reifegrad" mit Uebersichtstabelle (pro Dienst: Tier, letzter Test, Typ, naechster Lauf) und priorisierter Kandidatenliste fuer fehlende Tests. - Gitea-Restore: SSH-Check im Report korrekt als "TCP connect only" benannt statt "SSH port open" (war Audit-Finding M3). - AUDIT_2026-05-25_TODO.md: Restore-Audit-Backlog ergaenzt mit den verbleibenden 8 offenen Punkten (Nextcloud, Shared PG18, Komodo-Mongo, Mailarchiver, Mealie, Traefik, Negativ-Test, E2E-DR-Drill). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 09:31:19 +02:00
Micha	ba3ef8fcfc	docs(restore): mark authelia smoke successful and schedule	2026-06-03 08:55:04 +02:00
Micha	52fc007123	fix(restore): authelia smoke without dump-restore, drop bogus env, disable ntp Erstlauf 2026-06-03 hat einen by-design-Konflikt offengelegt: pg_restore des produktiven postgresql17-authelia.dump in eine Test-Instanz mit Wegwerf AUTHELIA_STORAGE_ENCRYPTION_KEY scheitert im Authelia-Startup-Check mit "the configured encryption key does not appear to be valid for this database". Productive Storage-Werte werden mit dem produktiven Key verschluesselt; ein Wegwerf-Key kann sie nicht entschluesseln. Smoke ist deshalb explizit auf Config-Restore + Boot reduziert, nicht Daten-Decrypt. Zwei Nebenbefunde aus demselben Lauf: - AUTHELIA__SERVER__ADDRESS (Doppel-Underscore) wurde von Authelia 4.39 abgelehnt ("configuration environment variable not expected"). ENV entfernt; server.address kommt eh aus der generierten configuration.yml. - ntp-Startup-Check schlug fehl ("Could not determine the clock offset ... lookup time.cloudflare.com on 127.0.0.1:53: server misbehaving"), weil das isolierte Test-Compose-Netz keinen DNS-Resolver fuer NTP hat. Neuer Test-Config-Block setzt ntp.disable_startup_check: true. Doku nachgezogen (Plan + Runbook): Encryption-Key-Konflikt ist explizit als "nicht Teil dieses Smokes" dokumentiert; Fehler-Matrix hat Eintraege fuer Doppel-Underscore-ENV und NTP-Lookup. Frische des produktiven authelia-Dumps wird unveraendert ueber check-restore-freshness.sh ueberwacht; Daten-Decrypt-Drill ist eine eigene DR-Aufgabe mit kontrollierter Schluessel-Verwendung. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 08:27:40 +02:00
Micha	8d71dfb9ad	fix(restore): authelia smoke default_policy two_factor (rules-less) Authelia 4.39 verlangt: ohne access_control.rules muss default_policy 'two_factor' oder 'one_factor' sein. 'bypass' war nur historisch zulaessig, mit 4.39 schlaegt config validate fehl mit "'default_policy' option 'bypass' is invalid: when no rules are specified it must be 'two_factor' or 'one_factor'". /api/health ist public und laeuft nicht durch access_control - die Smoke-Semantik bleibt unveraendert. Beobachtet im Erstlauf 2026-06-03 nach Refactor auf Minimal-Testkonfig (Commits 541c7be..440000c). Mit diesem Fix sollte 'authelia config validate' durchlaufen; HTTP /api/health-Smoke ist der Folgeschritt. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 08:09:35 +02:00
Micha	440000c085	fix(restore): generate minimal authelia smoke config	2026-06-03 08:04:59 +02:00
Micha	cacf77bfb0	fix(restore): avoid authelia smtp env in smoke test	2026-06-03 08:01:10 +02:00
Micha	cd4dd178ed	fix(restore): isolate authelia runtime config mount	2026-06-03 07:57:57 +02:00
Micha	541c7be853	fix(restore): generate sanitized authelia test config	2026-06-03 07:43:57 +02:00
Micha	b1ae9f3c26	fix(restore): harden restore checks and add authelia smoke scaffold	2026-06-03 07:39:05 +02:00
Micha	e2624796f0	fix: set vaultwarden DNS resolvers	2026-06-02 20:05:55 +02:00
Micha	9f63e6e3bc	docs: archive rollback volumes after burn-in	2026-06-02 19:55:02 +02:00
Micha	8eb367f0b5	revert: remove social-to-mealie-plus stack	2026-06-02 19:44:35 +02:00
Micha	745761f518	feat: add social-to-mealie-plus stack	2026-06-02 19:17:59 +02:00
Micha	ac637d30fb	docs: record n8n encryption key restore source	2026-06-02 06:47:00 +02:00
Micha	b0a6244e21	apps: pin super-productivity and n8n image digests	2026-06-02 06:44:03 +02:00
Micha	4fb17a09e6	apps: add n8n + mail-to-gitea-issue workflow (n8n.kaleschke.info)	2026-06-02 06:28:01 +02:00
Micha	be5c68751f	apps: add super-productivity stack (sp.kaleschke.info, Authelia)	2026-06-02 06:27:00 +02:00
Micha	3bfd065326	Update Scrutiny image digest	2026-06-01 16:42:31 +02:00
Micha	eeebeec804	Switch Paperless GPT to OpenAI API	2026-06-01 16:18:58 +02:00
Micha	55fdb13532	Enable Vaultwarden SMTP invites	2026-06-01 15:52:31 +02:00
Micha	8709fe8239	Focus family onboarding on core apps	2026-06-01 15:25:48 +02:00
Micha	89114b1b12	Record append-only operator decision	2026-06-01 15:16:56 +02:00
Micha	3da19421d0	Document hetzner account hygiene	2026-06-01 15:09:37 +02:00
Micha	16e661be87	Document fritzbox config backup	2026-06-01 14:19:13 +02:00
Micha	12c05376d0	Close fritzbox service window docs	2026-06-01 13:02:03 +02:00
Micha	dfd0ccbb9a	Refine external IPv6 operator check	2026-06-01 12:51:16 +02:00
Micha	ae5d4aedfc	Prepare external operator checks	2026-06-01 12:48:00 +02:00
Micha	479eb291c4	Prepare final homelab cleanup gates	2026-06-01 12:19:17 +02:00
Micha	c3222e800b	Validate backup follow-up and harden nearline pull	2026-06-01 08:27:52 +02:00

1 2 3 4 5 ...

688 Commits