SoulSync

Commit Graph

Author	SHA1	Message	Date
nick2000713	bf5affd03c	resolve merge conflict in style.css	5 days ago
BoulderBadgeDad	fece771dd0	Security UI: show saved login password / recovery question state After saving a password or recovery question, a refresh made the section look unset (passwords are never echoed back to the browser), so it seemed like you had to redo it. Now the saved state is reflected: - "✓ A login password is set" appears when the admin has a password; the field becomes "Enter a new password to change it". - "✓ Recovery question saved: <question>" appears, the saved question is pre- selected (preset or custom), and the answer field becomes "Enter a new answer to change it". - Shown both on load (applyLoginSavedState from /api/profiles, which now includes recovery_question — not secret, already shown on the sign-in screen) and immediately after saving. 64 integrity tests pass.	6 days ago
BoulderBadgeDad	2bb9bc1357	Settings: reorganize Security into clear groups with visible prerequisites The security section had grown into a flat pile of toggles with hidden dependencies. Regrouped into three labelled cards so it reads top-to-bottom: - 🔑 Lock with a PIN — set PIN (Step 1) → Require PIN - 👤 User accounts (login) — Step 1 admin password → Step 2 recovery question → Step 3 Require login. The Step 3 toggle is now visually LOCKED (greyed + disabled + "set the admin password first" hint) until an admin password exists, so the anti-lockout rule is obvious instead of surfacing as a 400 on save. It unlocks the moment the password is saved. - 🌐 Reverse proxy & remote access — the proxy toggle, with the auth-proxy header nested under it (indented), plus WebSocket origins. - get_all_profiles/get_profile now expose has_password + has_recovery so the UI can reflect setup state; updateRequireLoginGate() drives the lock. - New .security-subgroup/.security-subhead/.security-nested/.security-locked CSS. All IDs + handlers preserved. Inert unless used; default install unaffected. 64 script-split integrity tests pass.	6 days ago
BoulderBadgeDad	613688a9ad	Login recovery (DB + backend): security question to reset a forgotten password Closes the forgot-login-password gap. A per-profile recovery question + answer lets a locked-out user reset their own password. - DB: additive recovery_question + recovery_answer_hash columns (idempotent migration). set/get-question/verify/has methods; answer is hashed (pbkdf2) and matched forgivingly (trim + lowercase + collapse whitespace). No recovery set → never verifies. - Endpoints (allowlisted in the login gate so they work pre-auth): GET /api/auth/recovery-question?username= (generic 404 when absent), POST /api/auth/recovery-reset {username, answer, new_password} — brute-force limited; a correct answer sets the new password + authenticates the session. POST /api/profiles/<id>/set-recovery (admin or self) to configure it. Tests: set/get/verify, forgiving match, hashed-not-plaintext, no-recovery-never- verifies, full reset flow (wrong answer rejected + password intact; correct answer resets), unknown-user 404. 25 tests pass. Next: the Settings + login-screen UI.	6 days ago
BoulderBadgeDad	8e1b678d6f	Native login (increment 1/3): per-profile password DB layer Opt-in username/password login — profiles become real accounts. This is the data layer: a per-profile login password, kept SEPARATE from the quick-switch PIN (different security purpose; a 4-digit PIN must not become the password guarding a public instance). - Additive migration: profiles.password_hash column (idempotent, metadata-flagged). - set_profile_password / verify_profile_password / profile_has_password / get_profile_by_name (the login username = profile name, unique + case-insensitive). - Security default: a profile with NO password is NOT loginable (verify returns False) — unlike the PIN where "no PIN = always valid". You can't authenticate to an account with no credential. Tests: migration adds the column; set/verify; no-password-never-loginable; clearing; name lookup; and password is fully independent of the PIN. 6 tests pass. Next: the login endpoint + require_login gate (increment 2).	6 days ago
BoulderBadgeDad	27d738e7b1	Fix: Find & Add library search buried exact matches (case-sensitive ordering) Reported via Find & Add (Billie Eilish "bad guy"): the track was in the library and on Plex, but never showed in the modal's 20 results. Root cause (proven against the real 307k-track DB): the search did `ORDER BY tracks.title`, which is case-SENSITIVE in SQLite (BINARY collation sorts 'B' before 'b'). Billie's title is lowercase "bad guy"; everyone else's is "Bad Guy", so all the capitalised ones sorted first, filled the LIMIT, and her exact match landed at ~#25 — cut off. - search_tracks now ranks by relevance: exact title match first (case-insensitive via unidecode_lower), then prefix, then alphabetical — so an exact match can't be sorted below the limit by a capital letter. Helps every caller. - Added a rank-only `rank_artist` hint (never filters): Find & Add already knows the source track's artist, so it now passes it and the exact title+artist match floats to #1. Filtering was deliberately avoided — if the track is tagged under a slightly different artist on the server, a filter would re-hide it. Verified on the real DB: title-only "bad guy" now surfaces Billie at #4 (was >#20); with the artist hint she's #1. Seam tests: lowercase exact title isn't buried; rank hint floats the match without filtering; exact title beats a superstring title. 10 tests pass.	6 days ago
dev	97b40cbd43	feat(verification): review queue — listen/compare/approve/delete unverified downloads - ⚠ Unverified filter rows gain actions: inline play (range-streamed from the history file path, server-side only), YouTube compare, Approve -> new human_verified status (tag + history + tracks; AcoustID scanner skips these entirely), Delete (file + entry) - API: /api/verification/<id>/stream\|approve\|delete (path only from DB row) - backfill: history rows with acoustid_result='fail' that exist at all were imported despite the failure = force_imported (covers pre-fix fallback imports like the user's 'My Ordinary Life') Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	6 days ago
dev	41536384c3	fix(verification): persist status on ALL pipeline success exits + history backfill The pipeline has three success exits (simple download, playlist folder mode, main) but only the main one persisted the verification status — force-imported playlist tracks got no tag, no history status, and never appeared in the Unverified filter. Extracted _persist_verification_status() and call it at every exit. One-time idempotent backfill derives status for existing history rows from their recorded acoustid_result (pass->verified, skip->unverified). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	6 days ago
dev	2a11dc961a	feat(verification): persist status into library_history, badge on Downloads completed list The persistent Completed list is built from library_history (not live tasks), so the badge never showed after a session ended. Column added (additive), written at import, passed through _build_history_download_item, rendered by _adlVerifBadge next to the status label. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	6 days ago
dev	8e6820dbdf	feat(verification): status vocabulary, DB column, SOULSYNC_VERIFICATION tag Also: evaluate() treats an empty expected artist as title-only comparison (old scanner behaviour — a missing DB artist is no evidence of a wrong file), and the thresholds are now defined once in the core and re-exported. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	6 days ago
BoulderBadgeDad	60b9fe10e9	Profiles: per-profile Tidal self-auth (playlists) — with a safe token-save redirect Second service. Each profile connects its own Tidal; its playlist reads use that account, everything else stays global. The gotcha vs Spotify: TidalClient loads AND saves tokens to one global slot (tidal_tokens), so a naive per-profile client would clobber the admin's tokens on refresh. - get_tidal_client_for_profile builds a dedicated TidalClient seeded with the profile's tokens, refreshed via the shared/global app creds, and OVERRIDES its _save_tokens to persist to the PROFILE row — never the global slot. Admin (profile 1) + unconnected profiles use the global client unchanged. Cached per profile + evicted on (dis)connect. - DB: set_profile_tidal_tokens / get_profile_tidal (encrypted); the OAuth callback now uses them + evicts the cached client. - Wired the Tidal playlist reads (list + tracks) to the per-profile client; the module import line left intact. - My Accounts: Tidal row (Connect via /auth/tidal?profile_id=, status, Disconnect). Connections API extended; disconnect made generic (/<service>/disconnect). Admin sees "managed in Settings" for every service. Tests: per-profile token refresh writes to the profile and leaves the global tidal_tokens untouched (the safety guarantee); connect status + disconnect; admin/unconnected → global client. 22 endpoint tests pass.	6 days ago
BoulderBadgeDad	daee96f814	Profiles Phase 0: service-credential-sets foundation (data + resolver, dormant) Groundwork for admin-created, per-profile-switchable credential sets ("pills") across auth services (Spotify/Tidal/Deezer/Qobuz/Plex/Jellyfin/Navidrome). Strictly additive and dormant — nothing reads it at runtime yet, so zero behaviour change for existing installs. - core/credentials/store.py: pure service registry + payload validation + stale-safe active-set selection (pick_active_credential falls back to None when a selected set was deleted, so a profile never breaks). - migration service_credentials_v1: two new tables — service_credentials (admin-created named sets; payload Fernet-encrypted at rest) and profile_service_credentials (each profile's selected set per service). - MusicDatabase CRUD: create/update/delete/list/get_service_credential (list never returns the payload; get decrypts for the resolver), plus set/get_profile_service_credential and resolve_profile_service_credential (returns the profile's active payload or None → caller uses global default). Tests: 12 — pure validation + stale-safe selection, and real-temp-DB storage proving encryption round-trips, payload never lists, dup(service,label) rejected, per-profile/per-service resolution, and delete clearing dangling selections to a clean fallback. 95 migration/DB tests still pass.	7 days ago
BoulderBadgeDad	9f12bdfef6	Watchlist: bespoke live scan deck + persistent per-run Scan History (#831 round 2) Boulder: the live display was a cramped ~600px box showing a fraction of the data the scan already tracks, with no animation and no history. Live scan deck (replaces the three-column box, full width): - Header: pulsing live dot, "x / y artists" progress text, and two live counter chips (found / added) that pop when they change. - Animated progress bar (artist index / total) with a shimmer sweep. - Stage: artist avatar with accent glow + name + readable phase line ("Checking album 2 of 5"), album art + album + current track. - "Added to wishlist this run" feed: taller, bigger art, slide-in animation that plays once per new track (feed re-renders only when it changes). - All data was already in scan_state (current_artist_index, total_artists, tracks_found/added_this_scan, current_phase) — just never displayed. The legacy fullscreen-modal markup shares element ids and lacks the new ones, so it keeps working untouched. Scan History (persistent): - New watchlist_scan_runs table — one row per run (status, timestamps, artists/found/added counts) + the full track ledger JSON. Saved at scan completion AND cancellation; idempotent on run_id; pruned to the last 100 runs. Wishlist rows erode as tracks download, so this is the durable record. - GET /api/watchlist/scan/history (runs) + /history/<run_id>/tracks (ledger). - New History button on the Watchlist page → modal in the origins/blocklist house style: run cards (date, cancelled chip, artists/found/added stats) expanding into the Added / Skipped track lists with art and badges. Tests: save+fetch with ledger, idempotent re-save, prune keeps newest, unknown-run empty, cancelled runs recorded. 398 watchlist/wishlist/history tests pass; JS syntax-checked; all rendered strings escaped.	7 days ago
BoulderBadgeDad	0939585620	Matcher: bracketed subtitles no longer read as different songs (#825 ) carlosjfcasero round 2 (manual-add fix didn't help — different path). His log pinned it: the mirrored sync auto-added 'Llamando a la tierra (Serenade From the Stars)' by M-Clan every run even though his library has the song (stored bare). Reproduced exactly: the subtitle restates no album context, so the #808 context strip keeps it, and the length-ratio penalty in _calculate_track_confidence crushes the pair to 0.142 (needs 0.7). Sync → "missing" → wishlist, forever; and the cleanup uses the SAME matcher, so it deterministically never removed it. Self-reinforcing. Fix at the matcher seam (benefits sync, cleanup, downloads, discography alike): core/text/title_match.strip_subtitle_qualifiers(title, other) strips a bracketed qualifier only when it (a) isn't restated in the other title, (b) contains no version-marker token (EN + ES: live/remix/acoustic/version/ dueto/directo/vivo/...), and (c) introduces no new digit token ('(Pt. 2)', '(2007)' stay different releases). Wired as a third comparison variant in _calculate_track_confidence with its own length guard. Verified against his log's other unmatched tracks: '(Live)' 0.15, '(Dueto 2007)' 0.179, '(Versión 1988)' 0.167 all still correctly blocked — version qualifiers keep their meaning; the M-Clan case goes 0.142 → 1.0 in both directions. Also: sync's check_track_exists call now passes album= (cleanup already did), enabling the album-aware fallback for multi-artist albums during sync. Tests: tests/test_subtitle_qualifier_match.py — the reported case verbatim (end-to-end through check_track_exists, both directions, batched candidate path included), EN+ES version qualifiers still blocked, numeric guard, '#769 Dani California' and '#808 OurVinyl' guards still hold. 1396 matcher/wishlist/sync tests pass.	1 week ago
BoulderBadgeDad	1d16ac7978	Downloads: reuse an album's existing folder so batches don't split it (#829 ) Tacobell444: when tracks land in an album across multiple batches (a wishlist run, the Album Completeness job, a missed track re-downloaded later), the folder is rebuilt from API metadata each time — so when $albumtype or $year come back blank/different on a later batch, the folder NAME changes and the album splits, forcing a Reorganize. Fix: build_final_path_for_track now checks whether the album already lives in a single folder on disk and, if so, drops the new track there instead of a freshly templated folder. Match (chosen): exact stored Spotify album id first, then a STRICT >=0.85 name+artist match (vs the 0.7 used elsewhere) — a wrong match here misplaces a file. New core/library/existing_album_folder.resolve_existing_album_folder holds the logic; always-on with template fallback. Safety rails: only returns a folder UNDER the transfer dir (never a read-only library/NAS mount), only when the album lives in EXACTLY ONE folder (multiple = disc subfolders, which DatabaseTrack can't disambiguate — those defer to the template), and any failure falls through to the template path. Added MusicDatabase.get_album_by_spotify_album_id for the id-first lookup. Tests: single-folder reuse, no-match, below-threshold, multi-folder defer, outside-transfer reject, id-first, missing transfer dir, no-files-on-disk. 8 tests; 1556 path/import/download tests pass (only the known soundcloud failures remain).	1 week ago
BoulderBadgeDad	a79816ad69	Full release dates: store + write yyyy-mm-dd end to end (#824 part 2) Part 1 stopped existing full dates being destroyed; this adds first-class support for full release dates so they can be set + persisted instead of truncated to a year at the DB layer. - Schema: new nullable `release_date TEXT` on the albums table (idempotent ALTER-ADD-COLUMN repair on startup + the live CREATE). NULL = year-only, every reader falls back to albums.year, so it ships safe/dormant. - Tag writer: write_tags_to_file + build_tag_diff prefer db_data['release_date'] (the full date) over the year int; _date_to_write writes the full date. When there's no release_date it's exactly Part-1 behavior (year, preserving an equally-specific existing file date). - Retag read path: SELECT al.release_date in the tag-preview/write queries and thread it into _build_library_tag_db_data. - Manual edit: release_date added to ALBUM_EDITABLE_FIELDS + a "Release Date" field (YYYY-MM-DD, validated client-side) in the album editor; the artist-album query returns it so existing values show. User-set dates are authoritative. - Enrichment: Spotify + iTunes workers store the source's full release_date (YYYY-MM / YYYY-MM-DD) when present, only when empty — never clobbering a manual value. Tests: writer uses release_date over year + overrides an existing file date; falls back to year when absent; diff compares the full date. Migration verified idempotent + enrichment no-clobber. 1435 tag/retag/db/library tests pass.	1 week ago
BoulderBadgeDad	696119d5ac	Expired Download Cleaner: retention-based cleanup of watchlist/playlist downloads (Boulder) A Library Maintenance job that cleans up downloads tracked by Download Origins once they pass a per-origin retention window — findings by default, opt-in auto-delete. A download is only ever proposed for deletion when ALL hold: older than its origin's retention, NOT still in an actively-mirrored playlist / watched artist, and played fewer than the keep-threshold (default 2 → "played more than once is kept"). Only touches downloads recorded from the Download Origins feature forward — never pre-existing or manual library. - core/library/expired_cleanup.py: pure decision core (retention_cutoff, is_expired, select_expired) — no DB/clock, fully tested. play_count is the reliable listen signal (last_played is often unpopulated, so recency isn't used). - ExpiredDownloadCleanerJob: gathers facts (play_count via a new get_origin_cleanup_candidates join; active-mirror via get_mirrored_playlists; watch via get_watchlist_artists) and either creates 'expired_download' findings or, with auto_delete on, deletes in-scan. Default OFF, both retentions default 'off'. Settings auto-render in the Library Maintenance panel (same as Cover Art / Lyrics / Re-tag). - delete_origin_download(): shared delete (resolve path → remove file → drop track row → drop history row); a file that won't delete keeps its row + reports. Used by auto mode AND the _fix_expired_download apply handler. - Frontend: type/action ('Delete')/result labels + finding detail render. Tests: 9 on the pure brain (windows, off, per-origin, protected, play-count threshold, bad age) + 7 on the job (no-op when off, findings, mirror/watch protection, auto-delete, delete helper missing/real file). 185 repair/origin tests pass.	1 week ago
BoulderBadgeDad	45badf588c	Blocklist Phase 2a: gate the download queue (playlist sync / album / discography) Phase 1 guarded the wishlist; Phase 2a closes the other auto-acquisition path. Playlist sync, album download, and discography backfill all flow through run_full_missing_tracks_process, which queues missing tracks at one point — right where the explicit-content filter already drops tracks. The blocklist filter slots in beside it: each missing track is checked and a banned artist/album/track is dropped before queueing (logged with a count), so a blocked item can't slip in via these flows. Same brain as Phase 1: the wishlist guard's matcher is generalized to db.blocklist_reason_for_track(profile_id, track_data, source=None) — the new `source` param lets the queue path supply the batch source, since an analysis track dict may not carry a 'provider' field (artists still match by name fallback regardless). One method, two callers (wishlist + queue), one cascade. Manual single-track downloads (/api/download, candidate picker, redownload) are deliberately NOT gated here — that's Phase 2b, pending a block-vs-warn-vs- override policy decision. Tests: source-fallback isolation (album id-only proves source drives the ID match; artist name still matches sourceless), and a queue-filter simulation mirroring master.py. 35 blocklist tests pass (the only failures in the download family are the pre-existing soundcloud /app ones).	1 week ago
BoulderBadgeDad	43c798a76e	Blocklist Phase 1 (backend): artist/album/track bans enforced at the wishlist chokepoint A proper artist/album/track blacklist (distinct from download_blacklist, which stays untouched). ID-keyed across metadata sources so a ban survives a source switch; profile-scoped; cascade artist→album→track. - core/blocklist/matching.py — pure decision core (no I/O): build an index from rows, candidate_block_reason() walks track→album→artist. Same-source ID match is primary; artist NAME is a fallback (covers the ID-backfill window); albums/tracks are ID-only (common titles like "Greatest Hits" must not false-positive across artists). Source-isolated so a numeric Deezer id can't collide with a numeric iTunes id of a different entity. - DB: new `blocklist` table (profile_id, entity_type, name, 4 source-id cols, match_status) + CRUD, match-row fetch, backfill-pending query, id-backfill update (COALESCE — fills NULLs only). - Guard: _wishlist_blocklist_reason at the top of add_to_wishlist — every auto-acquisition path funnels through it, so one check covers watchlist, discography backfill, repair, manual add. Fails OPEN (a guard error never blocks a legitimate add). - Discovery unified IN: legacy discovery_artist_blacklist is migrated into the blocklist on upgrade (replicated to every profile so no global ban silently stops working; idempotent; legacy table kept for rollback). Discovery reads (hero + personalized-playlist SQL) now union the blocklist, so a new-modal ban filters discovery too. Tests: 13 on the pure matcher (cascade, id-vs-name rules, source isolation, precedence) + 10 on the DB/guard (CRUD, profile isolation, dedup, backfill, end-to-end wishlist refusal + cascade + the discovery migration upgrade path). 50 blocklist/personalized tests pass.	1 week ago
BoulderBadgeDad	58df4632c4	Watchlist: repair iTunes ids that are actually Deezer ids (the `37725457` corruption, proven live) `37725457` fixed _match_to_itunes to use the real iTunes client and flagged the cross-source corruption as a possibility. Boulder's live DB proves it happened: 6 of his 9 watchlist "iTunes" ids EQUAL the artist's Deezer id (Taylor Swift's "iTunes" id was her Deezer id 12246; the real one is 159260351) — written back when the misnamed MetadataService.itunes slot held a DeezerClient. The June-4 batch (Green Day, SOAD, Vulfpeck, ...) got NULL instead because the slot now holds the Spotify primary. The fix alone can't heal those rows: the backfill only fills EMPTY ids, so a wrong non-empty id is permanent. New migration clears itunes_artist_id where it equals deezer_artist_id (the corruption signature — distinct id spaces, so a legitimate equal pair is effectively impossible, and the worst case is a NULL that re-matches correctly on the next scan). Idempotent by construction; similar_artists checked clean (its backfill always used the registry correctly). Tests: corrupted row cleared / legit + no-deezer rows kept / idempotent — via a real re-init with the per-process init memo cleared (an app restart).	1 week ago
BoulderBadgeDad	f250eaa228	#808 : album-context qualifiers stop blocking library-presence matching carlosjfcasero: 'Champagne Supernova (OurVinyl Sessions)' is in the library but the artist page shows it unowned and wishlist cleanup never removes it. Measured with the real catalogs: Deezer/iTunes title the TRACK with the qualifier while the library track is bare (the qualifier lives in the album title) — and _calculate_track_confidence crushed that pair to ~0.17: the "clean" titles keep parenthetical words, so the length-ratio penalty treats 'Champagne Supernova' vs 'Champagne Supernova (OurVinyl Sessions)' as different songs. (Also confirmed: the OurVinyl release is absent from Deezer's discography for the artist, so the standard page's 25-release list not showing it is the source catalog, not a bug.) Fix 1 — core.text.title_match.strip_redundant_context_qualifiers: a parenthetical qualifier whose text appears (word-bounded) in the db track's ALBUM title — or in the other title — restates release context and is stripped for a comparison variant scored with its own length guard. Genuine version markers keep their penalty: '(Live)' on a studio album appears in no context and still blocks; '(Live)' on 'Live at Wembley' correctly matches — owning the live album IS owning the live cut. Wired into _calculate_track_confidence, so every check_track_exists consumer (wishlist cleanup, discography dedup, repair jobs) benefits. Fix 2 — the artist-page ownership endpoint's album gate: when album-aware narrowing eliminates EVERY library candidate (the source's album naming just doesn't resemble the library's — 'Jillette Johnson \| OurVinyl Sessions' vs 'Champagne Supernova (OurVinyl Sessions)' ~0.5), fall back to artist-wide title matching instead of declaring everything unowned off a failed album-NAME comparison. Tests: 8 — the exact reported pair end-to-end through check_track_exists, word-boundary containment ('live' in 'alive' doesn't count), version-marker safety both ways, and prefix songs still blocked. 1125 matching/wishlist/ library tests pass.	1 week ago
BoulderBadgeDad	1f7834cc7b	Download Origins: see (and delete) exactly what watchlist + playlist syncs downloaded User ask: "a modal that lists the tracks downloaded via watchlist" — extended, as discussed, to playlists too. One modal, two tabs, opened from the Watchlist page (watchlist tab preselected) and the Sync page (playlists tab) — same shared-modal-different-entry-points UX as the rest of the app. The data: library_history recorded which SERVICE a file came from but never what TRIGGERED it. New origin/origin_context columns (migration + index) are written once at the import chokepoint via core/downloads/origin.py, a pure tested deriver that reads, in priority: an explicit _dl_origin stamp (set at batch-task creation for direct playlist batches, where the playlist context otherwise only survived in folder mode), the wishlist provenance already riding in track_info.source_info (watchlist_artist_name / playlist_name — watchlist_scanner has stamped these for ages), and the folder-mode playlist thread. Manual downloads stay unclassified by design. History starts from now — provenance can't be conjured retroactively. API: GET /api/download-origins?origin=watchlist\|playlist (paged) and POST /api/download-origins/delete — deletes the file on disk (resolved through the shared container/host path resolver), the matching library track row, and the history entries; a file that refuses deletion keeps its row and reports the error instead of lying. UI: webui/static/origin-history.js — tabbed modal in the revamp design language (accent light-edge, pill tabs, entry rows reusing the library-history-entry components), per-row delete + select-all bulk delete with honest result toasts, empty/loading states, per-tab totals. Tests: 8 — deriver priority/shapes (incl. the exact watchlist_scanner source_info shape and JSON-string survival), origin filtering + counts, row fetch/delete isolation between origins, delete-track-by-path.	1 week ago
BoulderBadgeDad	2d2ee34df8	#758 : a manual album match pins + locks the canonical version Users manually match an album to the regular edition, but enrichment/ repair keeps treating it as the deluxe (missing songs, renumbered tracks). Root cause: an album has TWO identities — the enrichment match (spotify_album_id, which manual-match sets and the worker already honors) and a SEPARATE canonical version pin (canonical_album_id, added by #777). The canonical pin is what track-number repair / reorganize / missing-track detection actually read, and library_manual_match never wrote it — so it was resolved independently and landed on the deluxe edition. (So #777 did NOT solve #758: it added canonical pinning, but manual matches didn't write the pin.) Fix: a manual ALBUM match on a canonical-recognised source now also pins AND locks the canonical version to the chosen release: - new canonical_locked column (same migration pattern as the other canonical cols). - set_album_canonical(..., locked=False) gains an atomic WHERE-clause guard: an auto write can't overwrite a locked pin; a manual write (locked=True) always wins. get_album_canonical exposes `locked`. - library_manual_match pins canonical for album matches via the pure should_pin_manual_canonical(entity_type, source). The auto resolve job already skips already-pinned albums, so the lock is protected on two fronts; the new guard also covers any future re-resolution. A new manual match still overrides. 18 tests: the pure gate (+ a sync-invariant test vs _ALBUM_ID_COLUMNS) and the DB lock seam (auto can't clobber a manual lock; manual overrides; auto-over-auto still works). Additive — locked defaults False, so the auto path is unchanged unless a manual lock exists. Full suite clean.	2 weeks ago
BoulderBadgeDad	83c1cd92aa	Auto-reconcile embedded IDs for new tracks on library scans Extends the manual "Import IDs from File Tags" backfill so newly-scanned files get their embedded provider IDs pulled into the DB automatically — no button press needed to keep up with new music. How it works: - insert_or_update_media_track now returns 'inserted' / 'updated' / False (truthy-compatible; existing `if track_success` callers unaffected) so the scan worker can tell a genuinely new row from an update. - DatabaseUpdateWorker collects the ids it newly INSERTED this run (self._new_track_ids) across all insert paths (Plex/Jellyfin/deep). - After run()/run_deep_scan(), web_server calls _reconcile_after_scan(), which gap-fills embedded IDs for just those new tracks. Runs as a post-scan pass (the scan loop itself is untouched/fast — the media server API never exposes these custom IDs, so the file must be read once regardless; batching at the end keeps it out of the hot loop and best-effort so it can never abort a scan). A progress phase ("Reading file tags for N new tracks…") surfaces the full-refresh tail. Shared engine: - New reconcile_library() in core does the paging + lazy parent-map loading (only loads albums/artists actually referenced — cheap when scoped to a few new tracks) + per-page commits. BOTH the manual button and the scan hook call it, so there's one tested orchestration, no duplication. The backfill job was refactored onto it. Same hardened safety: gap-fill only, atomically guarded against overwrite, schema-introspected, idempotent. Scoped to new arrivals for incremental/deep; full refresh re-inserts everything as new (recovering the IDs a full-refresh wipe destroys). +10 reconcile tests (reconcile_library scope/idempotency/progress/stop + the engine). Full suite clean (only pre-existing soundcloud /app env failures remain).	2 weeks ago
BoulderBadgeDad	55c9b52aee	Auto-repair duplicated source ids on startup (one-time migration) Ships the source-id cleanup to all users: a marker-gated one-time migration in MusicDatabase init clears any source id (deezer/spotify/itunes/musicbrainz/ discogs/audiodb/qobuz/tidal) shared across differently-named artists — the enrichment-corruption signature. Same-name cross-server duplicates are left untouched (DISTINCT-name check). Cleared rows re-derive correct ids on the next enrichment pass; the now name-guarded workers won't re-corrupt. Runs once (CREATE TABLE _source_id_dedupe_v1 marker), idempotent, per-column try/except so a missing column can't abort it. Test forces a re-run and asserts corruption is cleared while a legit same-name dup survives.	2 weeks ago
BoulderBadgeDad	3b155411c2	Fix #787 : Find & Add now records a durable manual match that survives a rescan Find & Add on the playlist-sync page only wrote sync_match_cache, which is DELETEd wholesale after every DB scan — so the source->library pairing (and the user's manual matches) reverted to 'extra'/red-dot on the next shallow scan. The three match stores (sync_match_cache, manual_library_track_matches, discovery extra_data) were disconnected and all pointed at tracks.id, which a rescan re-keys (esp. Jellyfin/Navidrome GUIDs). Unify the match so it's one durable fact, recorded once, honored everywhere: - Find & Add also writes a durable manual_library_track_matches row (one-way; the manual-match tool has no playlist to act on, so no reverse). Carries the library file path. - New library_file_path column (idempotent migration) + find_track_id_by_file_path: re-resolve a stale library_track_id after a rescan re-keys the track, and self-heal the row. - The sync compare display's override lookup now falls back to the durable manual match (resolve_durable_match_server_id) when sync_match_cache misses — so the pairing persists across a scan instead of reverting to a red dot. Purely additive: only adds matches when the cache returns nothing. Tests: durable resolver (valid / stale-reresolve+self-heal / no-match / not-in- playlist / missing-methods), file_path persistence + find_track_id_by_file_path.	2 weeks ago
BoulderBadgeDad	a977d28144	Fix #780 : Deezer/non-Spotify organize-by-playlist resolved the wrong row resolve_mirrored_playlist tried the mirrored-playlists primary key FIRST for any all-digit ref. Deezer upstream ids are all-numeric, so a Deezer playlist id was mistaken for the PK and the organize-by-playlist toggle resolved a wrong row (or nothing) — the toggle silently wouldn't save / 'Open in Mirrored' missed. Resolve by (source, source_playlist_id) first, fall back to PK only when the source lookup misses. Thread the batch/wishlist source through the download-path callers so numeric upstream ids resolve correctly there too. Spotify (base62 ids) is unaffected. Seam tests: numeric Deezer id resolves by source (not PK), spotify alphanumeric by source, PK fallback still works, profile-scoped, empty refs -> None.	2 weeks ago
BoulderBadgeDad	0353d365d6	Merge pull request #780 from kekkokk/feature/organize-by-playlist-library Fix organize-by-playlist: library registration, wishlist after failed downloads, and stale playlist cache	2 weeks ago
BoulderBadgeDad	f333607d76	Recommendations: explain WHICH of your artists drive each suggestion Adds get_recommendation_sources() — for each recommended similar artist it resolves the polymorphic similar_artists.source_artist_id back to the display names of the user's OWN artists (library + watchlist) that list it, by matching against every provider-id column on both tables. The /api/discover/similar-artists endpoint now attaches a 'because' array per recommendation so the UI can show 'because you have X, Y, Z' instead of just a count. Seam tests cover: library + watchlist resolution across different provider-id columns, dedup + name-sort, max_per cap, orphan source omission, profile scoping.	2 weeks ago
BoulderBadgeDad	89e3486e84	Similar Artists enrichment worker (MusicMap → match → store) for library artists Closes the gap where similar artists only existed for WATCHLIST artists: a new background worker populates them for the whole LIBRARY, slotting into the existing enrichment-worker pattern (bubble + Manage Enrichment Workers modal, status/pause/resume, matched/not_found/pending/errors). Per source-matched library artist → get_musicmap_similar_artists(name, 25) (the same matcher the artist-detail page uses: fetches MusicMap names, matches each to the user's source chain — primary + active fallbacks — returns only matched artists) → store via add_or_update_similar_artist keyed by the artist's metadata source id, the SAME key the watchlist scanner + artist map use, so the two cooperate (idempotent upsert + retry_days window). - core/similar_artists_worker.py: pure seams (pick_source_artist_id, map_payload_to_store_kwargs, process_artist) + the threaded worker; skips artists not yet source-matched; classifies not_found vs transient error (retry after 30d). - DB migration: similar_artists_match_status / _last_attempted on artists (mirrors every other source worker's tracking columns). - Registered in EnrichmentService + instantiated in web_server, DEFAULT-PAUSED (opt-in) like Amazon — MusicMap is scraped/outage-prone + this is library-wide. - SERVICE_ENTITY_SUPPORT['similar_artists']=('artist',) so the modal breakdown ('artists with / without similars') + Retry work; manual-match (inapplicable to a relationship) is gated out via relationship:true. - 10 seam tests; existing 80 enrichment tests still pass. Note: keys under profile 1 (single-profile setups); multi-profile is future work.	2 weeks ago
Francesco Durighetto	9ff2e7084a	Fix organize-by-playlist downloads: library entries, wishlist, and stale Spotify cache Persist organize_by_playlist on mirrored playlists and run playlist-folder downloads from the auto-sync pipeline instead of the global wishlist phase. Register SoulSync library rows after playlist-folder post-processing, route failed organize batches to the wishlist correctly, and skip sync-time unmatched wishlist only when organize download handles retries. Invalidate stale playlist track caches on refresh (Spotify and Deezer ARL), re-mirror on refetch, and improve standalone playlist modals (re-analysis, Open in Mirrored). Add filesystem missing-track detection and tests. Co-authored-by: Cursor <cursoragent@cursor.com>	2 weeks ago
BoulderBadgeDad	fc9a9f1c90	Enrichment manager v2: working retry + bulk retry-all-failed Fixes a correctness bug and adds bulk re-queuing. - Bug: per-row 'Retry' used clear-match, which sets an item to not_found with last_attempted=NULL. The worker only retries not_found items where last_attempted < (now - 30d), and 'NULL < cutoff' is false in SQLite, so those items were never re-queued. Fixed by resetting match_status to NULL (pending), which every worker's queue picks up on the next pass. - New POST /api/enrichment/<id>/retry with scope 'item' \| 'failed' (failed = re-queue every not_found item of an entity type), backed by a pure whitelisted build_reset_query + MusicDatabase.reset_enrichment(). - UI: per-row Retry now hits /retry; a 'Retry all failed' bulk button appears when the current entity has not-found items (confirm + count toast); a hint line explains retry/match/auto-retry behaviour. - 11 new tests (38 enrichment tests total, all green).	2 weeks ago
BoulderBadgeDad	0b3c3f656d	Add Manage Enrichment Workers modal (v1 + polish) Dashboard 'enrichment bubbles' could pause/hover but offered no way to manage a worker. This adds a full management modal opened from a new header button, covering all 11 enrichment sources. Backend (testable core helper + seam tests; no live-DB dependency): - core/enrichment/unmatched.py: pure, whitelisted SQL builders for the unmatched browser. service/entity validated against a support map (never interpolated raw); search + pagination bound as params; tracks join albums for artwork; limit capped at 200. - database/music_database.py: get_enrichment_unmatched() + get_enrichment_breakdown() (the breakdown splits matched/not_found/pending, which the existing get_stats().progress lumps together). - core/enrichment/api.py: GET /api/enrichment/<id>/{unmatched,breakdown} on the existing blueprint + a db_getter hook. - web_server.py: wire db_getter=get_database. - tests/enrichment/test_unmatched.py: 19 tests across builders, DB methods, and Flask routes. Frontend (vanilla, matches app conventions): - webui/static/enrichment-manager.js: worker rail with live status + coverage micro-bars, accent-themed detail panel (hero header, segmented matched/ not_found/pending stat cards, current item, pause/resume), and a searchable paginated unmatched browser with inline manual match (reusing search-service + manual-match) and retry (clear-match re-queues). - Polish: entrance/exit motion, scroll-lock, Escape, refresh control, flicker-free polling (in-place updates), skeleton loaders, relative timestamps, per-worker accent theming, real dashboard logos reused at runtime (with the same invert/circle treatment), responsive rail. - index.html: header button + script include. style.css: full styling. Reuses existing pause/resume, status, and manual search+assign endpoints. Backend tests green (19 new + 11 existing enrichment tests).	2 weeks ago
BoulderBadgeDad	f37bc34082	Canonical album version — Stage 2 (core): resolver + persistence (dormant) Turns the Stage-1 scorer into an end-to-end resolver + persists the result. Still DORMANT — no consumer reads it yet, so zero behavior change. - core/metadata/canonical_resolver.py — resolve_canonical_for_album(): builds candidate releases from the album's per-source IDs (in source-priority order), fetches each tracklist via an INJECTED fetch_tracklist (so it's unit-testable without live APIs), scores them with pick_canonical_release, and returns the best-fit {source, album_id, score}. Skips sources with no id / failed fetch; returns None when there are no files, no candidates, or nothing clears the confidence floor. - database/music_database.py — set_album_canonical() / get_album_canonical() write/read the Stage-1 columns. get returns None when unresolved, which every consumer will treat as "fall back to today's behavior". Tests: tests/test_canonical_resolver.py (7) — best-fit beats priority, priority breaks true ties, skips missing-id/failed-fetch sources, None on no-candidates/no-files/below-floor, score rounding. tests/test_canonical_db.py (4) — set/get round-trip incl. timestamp, unresolved -> None, overwrite, missing-album -> False. 34 canonical + DB-migration tests pass. Remaining for Stage 2 (the trigger): read on-disk file durations/titles for an album, gather its source IDs, call the resolver, store — wired via a backfill repair job + an enrichment hook. Then Stages 3-4 wire the Reorganizer and Track Number Repair to READ the pinned canonical.	2 weeks ago
BoulderBadgeDad	818c4f0bff	Canonical album version — Stage 1: schema + pure scorer (dormant) First stage of the canonical-album-version fix (#765 + #767-Bug2). Pins ONE canonical (source, album_id) per album, chosen by best-fit to the user's actual files, so the Reorganizer, Track Number Repair, and tagging stop re-resolving independently and contradicting each other. Ships DORMANT — nothing reads or writes the new data yet, so zero behavior change. Later stages populate (Stage 2) and consume (Stages 3-4) it. - core/metadata/canonical_version.py — pure scorer (the testable heart): score_release_against_files() rates a candidate release by track-count fit + duration alignment (greedy nearest within ±3s) + title overlap, dropping and renormalizing missing signals so it never crashes on sparse metadata. pick_canonical_release() takes candidates in source-priority order, picks the best fit, breaks ties toward the earlier (higher-priority) candidate so the choice is DETERMINISTIC — that determinism is what makes every tool agree (#765), while count/duration fit picks the right EDITION (#767-Bug2). A confidence floor (default 0.5) means a low-confidence guess is never pinned. - database/music_database.py — additive, nullable columns on albums (canonical_source / canonical_album_id / canonical_score / canonical_resolved_at), guarded by the existing PRAGMA-table_info pattern. NULL = unresolved = every consumer falls back to today's behavior. Tests: tests/test_canonical_version.py (11) — edition discrimination (11 files -> standard, 17 -> deluxe), deterministic priority tiebreak, duration disambiguation on count ties, graceful degradation (no durations / counts only / fuzzy titles), confidence floor, empty-input safety. tests/test_canonical_ columns_migration.py (4) — fresh DB has the columns, they're nullable w/ NULL default, migration is idempotent, and it ALTERs them onto an old albums table. 60 DB/schema regression tests still pass.	2 weeks ago
BoulderBadgeDad	174513d351	Fix #769 : playlist sync matched wrong same-artist track with high confidence Tracks NOT in the library were matched to a DIFFERENT song by the SAME artist and reported with high confidence instead of as missing — e.g. "Dani California" -> "Californication" (Red Hot Chili Peppers), "Under The Bridge" -> "Around the World". Root cause: _calculate_track_confidence scores 0.5title + 0.5artist. A same-artist comparison always yields artist = 1.0, so the title score is the only thing that can tell two of an artist's songs apart — but that score is a SequenceMatcher CHARACTER ratio, which over-credits unrelated titles that share a long substring ("californi…" = 0.67) or just a stopword ("the" = 0.62). With the flat 0.5 artist term, anything clearing the weak 0.6 char floor lands at ~0.81-0.83, well over the 0.7 sync threshold. Reproduced on dev: both reported pairs score 0.81/0.83. Fix: new core/text/title_match.py:titles_plausibly_same, called in _calculate_track_confidence right before the floor. It accepts a pair only when it's near-identical char-wise (>=0.85, so typos / punctuation / casing like "Beleive"->"Believe", "HUMBLE."->"Humble" still match) OR the titles share at least one significant (non-stopword) word. Two different songs by the same artist share no content word, so they're rejected and the real track is correctly reported missing. ("the" is a stopword — that's what leaked "Under The Bridge"/"Around the World".) Scoped deliberately: the word-overlap test fires ONLY when at least one side has 2+ content words. For single-word titles there is no other word to share, so it defers to the existing char floor — otherwise legitimate stylized spellings ("Grey"/"Gray", "Tonite"/"Tonight", "4ever"/"Forever") would become new false-negatives. Verified those still match. The few single-word variants that do score low (Ok/Okay, Thru/Through) were already rejected by the pre-existing length-ratio penalty, not by this gate. Both reported false positives now score 0.33/0.31 -> missing. Does NOT address the harder case of two different same-artist songs that DO share a content word (e.g. "Believe"/"Believer") — pre-existing and unworsened. Any residual error fails safe: a false-missing is re-downloaded/wishlisted, vs the old behavior which silently substituted the wrong song. Tests: tests/test_title_match_guard.py (14) — pure-guard unit tests + a 13-pair battery driving the REAL _calculate_track_confidence (genuine matches stay >=0.7, same-artist different songs drop below), plus an explicit no-regression test for stylized single-word spellings. 292 matching/sync tests pass.	2 weeks ago
BoulderBadgeDad	efe3895d5d	Fix: metadata cache tables silently missing after DB recovery (stale migration marker) Nothing was landing in the metadata cache browser because the metadata_cache_entities / metadata_cache_searches tables did not exist, so every cache write no-op-ed. Root cause: _add_metadata_cache_tables short-circuited on a marker-only guard (if the metadata_cache_v1 marker row exists, return). After a DB corruption-recovery the small metadata table (with the marker) survived but the large cache tables did not, so the stale marker permanently blocked the idempotent CREATE TABLE IF NOT EXISTS and the cache was dead forever. Guard now skips only when the marker is set AND the tables actually exist, so a stale marker self-heals: the tables are re-created on the next init. Tests: marker present but tables dropped -> re-created; marker + tables present -> no-op (idempotent).	2 weeks ago
BoulderBadgeDad	ce9ec3f6f4	Manual library match: accept non-numeric library track ids (#754 ) The save endpoint coerced library_track_id with int(), which rejected every non-numeric id with "Invalid library track id". Library ids are str(ratingKey) — numeric for Plex but GUIDs/hashes for Navidrome, Jellyfin, and other Subsonic servers — and are stored in the TEXT tracks.id column, so the coercion broke manual matching on every non-Plex server. Replace the int() coercion with a normalize_library_track_id() helper that trims and rejects only empty input, passing the opaque string id straight through. Plex numeric ids are unaffected (SQLite INTEGER affinity still stores a clean numeric string as an int, so existing matches are byte-identical) and no schema migration is needed (the INTEGER column already stores non-numeric ids as text). Tests: pure-helper cases (numeric/GUID/whitespace/empty) plus a real-DB round-trip proving a GUID id saves, reads back unchanged, and enriches.	2 weeks ago
BoulderBadgeDad	bf2a2ca928	Player: log SoulSync web-player plays (recently-played + smart-radio recency) listening_history was populated ONLY from the media server; the web player recorded nothing. Now a play heard ~10s logs to listening_history AND bumps tracks.play_count/last_played — so the existing 'recently played' query reflects actual SoulSync listening, and the Phase-2 smart-radio recency signal gets real data. - core/playback/play_log.build_play_event(): pure, DB-agnostic normalizer from player payload -> listening_history event shape. Caller supplies the timestamp (stays pure). Composite/streamed ids never become the int db_track_id; bool ids rejected; missing title -> skip. 9 unit tests. - MusicDatabase.record_web_player_play(): inserts the history row + increments play_count/last_played for the library track in one call. - /api/library/log-play: thin endpoint, server-side timestamp, best-effort (logging failure never 500s / never affects playback). - Frontend: npMaybeLogPlay on timeupdate fires once per track at the 10s threshold (flag reset in setTrackInfo, set-before-fetch so it can't double-fire), fully fire-and-forget. Pure builder is unit-tested; the DB write can't run in-sandbox (real DB throws) so it's a thin straightforward insert+update. JS + web_server parse clean.	2 weeks ago
BoulderBadgeDad	c3aea58b03	Player revamp Phase 2: smart radio ranking (play-count + popularity) Replaces radio's pure ORDER BY RANDOM() with weighted ranking. Each tier now fetches a generous random POOL (4x the needed count, floored) and core/radio/selection ranks it before the collector keeps the best: score_candidate = play_count(log-damped, w=1.0) + lastfm_playcount(log-damped, w=0.5) - recently_played penalty(w=2.0) + stable per-id jitter(w=1.0, hash-derived so runs vary but tests stay reproducible) Modest weights so popularity guides without burying lesser-played tracks, and jitter keeps radio from being identical every run. All intelligence is in pure functions (rank_candidates / score_candidate) so it's tunable + unit-testable without SQL. Defensive: the DB method probes PRAGMA table_info(tracks) and omits play_count/lastfm_playcount from the SELECT when absent (older DBs predating the listening-history migration) — the scorer treats missing signals as 0, so radio degrades to jitter-only instead of crashing on 'no such column'. Tests (tests/radio/, 43 total): - score_candidate / rank_candidates: deterministic unit coverage (popularity ordering, lastfm contribution, recency penalty, garbage→0, stable jitter). These CANNOT pass against pre-Phase-2 code. - DB end-to-end: ranking surfaces the heavily-played track first out of a decoy pool (wiring proof — probabilistic vs old random, documented honestly); plus a no-rank-columns DB proving the defensive degrade path. - All Phase-0a behavioral/refactor-equivalence tests still green. 60 radio + adjacent-DB tests pass; ruff clean.	2 weeks ago
BoulderBadgeDad	cbc001e283	Player revamp Phase 0a: extract radio selection into testable core/radio/ First step of the stream/player/radio revamp (see revamp_plan.md). The radio algorithm lived inline inside database.music_database.get_radio_tracks as raw SQL tangled with selection logic — untestable without a live DB (which also throws in the dev sandbox). Lifted the pure DECISIONS into core/radio/selection.py: - parse_tags / merge_tags — JSON-or-CSV tag fields → ordered deduped list - same_artist_cap — tier-1 30%-floored-at-5 cap - build_like_conditions — OR-of-LIKEs SQL fragment + params per tier - RadioCollector — dedup + cap + exclude-set + NOT-IN placeholder/value tracking The DB method keeps the cursor work and now delegates every decision to these helpers. Faithful extraction, not a rewrite — behavior unchanged. This is the kettui foundation move: radio is now unit-testable, so Phase 2 (smart ranking — play-count / recency / feature seeding) becomes 'evolve a tested function' instead of 'rewrite SQL and pray'. Tests (tests/radio/): - test_selection.py (22): unit coverage of every extracted helper - test_get_radio_tracks_db.py (7): drive the REAL get_radio_tracks against in-memory sqlite — tier fallback, dedup, exclude, file_path filter. Behavior-pinned: these 7 pass against BOTH old inline and new extracted code (refactor-equivalence proof). 52 adjacent DB+radio tests green.	2 weeks ago
BoulderBadgeDad	b55faff54b	DB: add schema_migrations ledger + PRAGMA user_version backstop Migration state was scattered across PRAGMA-table_info guards, sentinel marker tables (_genius_search_fix_applied, ...) and metadata-flag rows (id_columns_migrated, ...), with no single source of truth and no schema version — so a half-migrated DB was undetectable. Add a non-gating backstop: a schema_migrations(name, applied_at) ledger plus a _sync_migration_ledger pass (runs last in init) that back-fills the ledger from the existing signals and stamps PRAGMA user_version. ADDITIVE only — existing migrations keep their own idempotency gates; nothing decides whether a migration runs based on the ledger or the version. New one-time migrations call _record_migration (the genres migration already does). Tests: tests/test_db_migration_ledger.py — table exists, user_version stamped, record idempotent, genres recorded on fresh init, backfill from flag + marker, absent signals not recorded.	3 weeks ago
BoulderBadgeDad	c5b02c0026	DB: normalize legacy comma-separated genres to canonical JSON artists.genres / albums.genres stored EITHER a JSON array (new writes) OR a legacy comma-separated string (old writes), forcing every reader to try-JSON-then-split. Add a marker-gated one-time migration (_normalize_genres_to_json) that rewrites legacy rows to JSON in place, mirroring the readers' exact parse (JSON list, else comma-split/strip/ drop-empties) so genre VALUES are unchanged — only the storage format. Per-row diffed (already-canonical rows untouched, no churn) and non-fatal on error, consistent with the other migrations. Readers still tolerate both formats, so this breaks nothing; it just removes the dual-format debt. Tests: tests/test_db_genres_json_normalization.py — CSV->JSON, JSON-unchanged, whitespace/empties dropped, albums table, legacy-reader-equivalence, idempotent re-run, marker set on fresh init.	3 weeks ago
BoulderBadgeDad	2bb935b9d7	DB: stop watchlist_artists rebuilds from dropping amazon_artist_id amazon_artist_id is added to watchlist_artists via ALTER (music_database.py ~1732), but both table-rebuild migrations — the spotify_id-nullable fix (_fix_watchlist_spotify_id_nullable, two CREATE variants) and the profile-scoped UNIQUE rebuild — recreated the table from a hardcoded column list that omitted amazon_artist_id. Because shared_cols filters new_cols against the old table, the column and any stored Amazon artist IDs were silently dropped on every init (fresh OR upgraded), so Amazon watchlist IDs never persisted at all. Fix: add amazon_artist_id to all three rebuild CREATE schemas, both rebuild new_cols lists, and the base CREATE TABLE (so fresh installs are consistent and don't rely on the ALTER). Purely additive, column-named inserts + Row factory mean column position is irrelevant. Tests (tests/test_db_watchlist_amazon_id_migration.py): drive the real migrations via MusicDatabase() against a seeded pre-migration temp DB and assert the column + data survive; differential-proven to FAIL pre-fix.	3 weeks ago
BoulderBadgeDad	f7ed41867d	Fix: enhanced artist view 404s for library artists opened via source ID Opening a library artist from a non-library search result (e.g. a MusicBrainz hit) leaves the artist-detail page holding the source ID — the MBID — not the integer library PK. The standard /api/artist-detail route resolves that via find_library_artist_for_source, but the enhanced-view (`/api/library/artist/<id>/enhanced`) and quality-analysis endpoints call get_artist_full_detail directly with whatever ID the page holds. Its lookup was `WHERE id = ?` only, so it 404'd ("Artist with ID <mbid> not found") and the enhanced view failed to load. When the direct PK lookup misses, fall back to matching any per-service ID column, reusing SOURCE_ID_FIELD as the single source of truth so the resolution covers every source (MusicBrainz, Spotify, Deezer, iTunes, Discogs, Hydrabase, Amazon), not just MusicBrainz. Adds 4 isolated DB-method tests: direct PK still works, resolves by MBID, resolves by Spotify ID, and unknown IDs still 404.	3 weeks ago
Broque Thomas	96e6ba0ed7	Preserve Navidrome album cover art Expose Navidrome album coverArt as a Subsonic getCoverArt thumbnail so library refreshes keep a real album-art URL. Preserve existing album thumb_url when an incoming server album has no thumbnail, preventing manual or server-corrected covers from being cleared and later replaced by loose missing-cover searches. Add regression tests for Navidrome album thumbnails and DB thumb preservation.	3 weeks ago
Broque Thomas	dfdc6c6277	Restyle Auto-Sync manager and fix loading regressions Three problems wrapped into one pass on the Playlist Auto-Sync surface: 1. Visual: the manager modal had its own vibe (radial gradient, pill tabs, sky-blue chrome) that didn't line up with the rest of the app. Reworked the modal shell, KPI summary, live pipeline monitor, tab bar, schedule board sidebar, and column cards to use the standard SoulSync patterns — gradient `#1a1a1a → #121212`, accent-tinted 1px border, 20px radius, underline tabs, dense dark card pattern that Automations + Library pages already use. Modal now uses near-full screen so there's room for the schedule board without horizontal scroll pain. Run history cards followed the same path: slim horizontal row mirroring `.automation-card` plus an expanded detail that mirrors the Automations run-history modal (stats-grid + facts row + result pills + log section). 2. Hang: the previous SQL fix for the run-history "in library" count added `COLLATE NOCASE` on the join columns of `tracks` and `artists`. SQLite can't use `idx_artists_name` or `idx_tracks_title` when the comparison collation doesn't match the column collation, so the join did a full table scan per mirrored playlist track. ~18s per playlist × 30 playlists = `/api/mirrored-playlists` hung indefinitely and the modal stayed at "Loading schedule…" forever. Switched the join back to case-sensitive equality (~6ms per playlist, 3000× faster). Spotify names canonicalize to the same form as library imports so the recall loss is in the rounding error of pure case-only mismatches. 3. Slowness: even after the hang fix, each modal open spent ~1.5s gathering per-playlist status counts. The endpoint looped `get_mirrored_playlist_status_counts(playlist_id)` per row, which opened a fresh SQLite connection + PRAGMA setup each time. Added `get_all_mirrored_playlist_status_counts(profile_id)` which returns counts for every mirrored playlist owned by the active profile in 4 batched `GROUP BY` queries over a single connection. Modal load dropped to ~280ms. Also fixed: `tracks.artist` reference in `get_mirrored_playlist_status_counts` that never worked since the schema went relational — the query threw "no such column", got swallowed by the try/except, and the in-library count silently defaulted to 0 on every playlist. Rewired to join through `artists`. `get_mirrored_playlist_status_counts` (single-playlist) kept for callers that still want it, but the modal endpoint uses the batched version.	3 weeks ago
Broque Thomas	efdcde1892	Add playlist auto-sync run history Persist per-playlist pipeline run snapshots from the shared playlist pipeline, expose a history API, and upgrade the Auto-Sync modal with live pipeline monitoring, Run now controls, and a runs-style history tab.	3 weeks ago
Broque Thomas	9b086c5a65	Add owned_by column for Auto-Sync schedule ownership The Auto-Sync schedule board was detecting its own automations by checking `group_name === 'Playlist Auto-Sync' \|\| name.startsWith('Auto-Sync:')`. That's fragile — renaming the row from the Automations page silently hands ownership back to the read-only Automation Pipelines tab and the board stops managing it. This commit replaces the string convention with an explicit `automations.owned_by` TEXT column: - Migration `_add_automation_owned_by_column` adds the column and backfills `'auto_sync'` for existing rows that match the legacy `group_name`/`name`-prefix pattern, so users running the migration don't lose their schedules. - `database.create_automation` and `database.update_automation` accept `owned_by` (the latter via its `allowed` kwarg set). - `core/automation/api.py` forwards `owned_by` on both POST and PUT. Missing field is left as None, preserving today's behavior for every caller that doesn't opt in. - The Auto-Sync schedule board posts `owned_by: 'auto_sync'` and the detection helper now prefers that signal, falling back to the legacy name/group convention so any hand-rolled rows still show up. Tests: three new cases in `tests/automation/test_automation_api.py` covering create-with-owned-by, create-without (defaults to None), and update set/clear. The fake DB grew the matching kwarg.	3 weeks ago
Broque Thomas	feb6778af4	Address Cin review: extract helpers, indexed pool fetch, tidy nits Three changes folded into one perf+cleanup pass: 1. Indexed fast path for the per-artist pool fetch. The previous `search_tracks(artist=name)` call hit `unidecode_lower(artists.name) LIKE ?`, a function-in-WHERE that can't use `idx_artists_name`. New `MusicDatabase.get_artist_tracks_indexed` does a two-step lookup: exact-name match (indexed) plus a case-insensitive fallback, then `tracks WHERE artist_id IN (...)` via `idx_tracks_artist_id`. Drops per-artist fetch from seconds to milliseconds for the common case. The sync helper falls back to the old LIKE-based `search_tracks` only when the indexed lookup finds nothing, preserving diacritic recall and `tracks.track_artist` feature-artist matches with zero regression. 2. Public text-normalization helper. Lifted the body of `MusicDatabase._normalize_for_comparison` into `core/text/normalize.py:normalize_for_comparison` so callers outside the database layer (matching engine, sync pool, future import-side comparisons) don't reach across the module boundary into a leading-underscore "private" method. The DB method now delegates, so existing internal call sites stay untouched. Sync's lazy pool now imports the public helper. 3. Artist-name walker extracted. `_artist_name` at module level in `services/sync_service.py` replaces two near-identical inline str-or-dict-or-fallback walkers (one in `sync_playlist`, one in `_find_track_in_media_server`). Returns `''` for None instead of the literal string `'None'`. Plus three small tidies from the same review: - `_POOL_FETCH_LIMIT = 10000` constant in place of the literal at the pool-fetch call site. - Trimmed the verbose docstring + comment block on the pool helper. - Set-intersection predicate for the trigger-shape reset in `core/automation/api.py` instead of a two-line `or` chain. Also removed the duplicate `_get_active_media_client()` call at sync_service.py:212/214 — pre-existing wart that was sitting in the same block I was editing. Tests: 21 new tests across `tests/database/`, `tests/sync/`, and `tests/text/`, plus updates to the existing pool tests to cover the new fast/fallback split. Full suite stays green (3953 passing).	3 weeks ago

1 2 3 4 5 ...

320 Commits (31ebe96f76ada26847bf0bf4c8adabc70bb04f49)