DOI of the published article https://ieeexplore.ieee.org/document/11465217#:~:text=10.1109/ICIPTM69057.2026.11465217
MixSense: AI Optimization for Contiguous Music Segmentation at Scale
DOI:
https://doi.org/10.31224/6162Abstract
This paper casts long-form music stream segmentation as an AI optimization problem over a self-similarity manifold, unifying evolutionary search for parameter discovery with globally optimal dynamic-programming inference to recover contiguous boundaries consistent with a track-count prior or a data-driven estimate. Starting from Fourier-derived spectral embeddings, the method constructs cosine self-similarity and time-aware cost surfaces that encode symmetry, contiguity, and evolutionary stability, then solves for the minimum-cost partition without heuristic change-point thresholds. The pipeline is learning-free yet intelligent, leveraging search and global reasoning instead of supervised labels, and is stress-tested on a hand-annotated corpus exceeding 640 hours with humanvariance analysis to contextualize error and tolerance around true boundaries. Results show robust, scalable segmentation under both known and estimated segment counts, highlighting AI-style optimization as a powerful alternative to local novelty detectors and ad-hoc rules in music structure recovery
Downloads
Downloads
Posted
License
Copyright (c) 2026 Vipul Razdan

This work is licensed under a Creative Commons Attribution 4.0 International License.