On the number of MUSs crossing a position

📅 2025-08-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper investigates the maximum number of minimal unique substrings (MUSs) crossing a specified position $i$ in a string. For a string $T$ of length $n$, we establish the first tight asymptotic bound $Theta(sqrt{n})$ on $|{ m MUS}(T,i)|$, resolving a long-standing gap between known upper and lower bounds. Our approach combines combinatorial string analysis with suffix array properties and a novel boundary construction technique. First, we prove that any position $i$ in any length-$n$ string is crossed by at most $O(sqrt{n})$ MUSs. Second, we explicitly construct an infinite family of strings in which some position is crossed by $Omega(sqrt{n})$ MUSs. The matching upper and lower bounds are both constructive and asymptotically tight, thereby determining the theoretical optimum for this problem.

Technology Category

Application Category

📝 Abstract
A string $w$ is said to be a minimal unique substring (MUS) of a string $T$ if $w$ occurs exactly once in $T$, and any proper substring of $w$ occurs at least twice in $T$. It is known that the number of MUSs in a string $T$ of length $n$ is at most $n$, and that the set $MUS(T)$ of all MUSs in $T$ can be computed in $O(n)$ time [Ilie and Smyth, 2011]. Let $MUS(T,i)$ denote the set of MUSs that contain a position $i$ in a string $T$. In this short paper, we present matching $Θ(sqrt{n})$ upper and lower bounds for the number $|MUS(T,i)|$ of MUSs containing a position $i$ in a string $T$ of length $n$.
Problem

Research questions and friction points this paper is trying to address.

Count MUSs crossing a specific string position
Establish tight bounds for MUSs at position i
Prove Theta(sqrt(n)) bounds for MUSs per index
Innovation

Methods, ideas, or system contributions that make the work stand out.

Establishes tight bounds for MUS count
Proves Theta(sqrt(n)) upper and lower bounds
Analyzes minimal unique substrings crossing positions
🔎 Similar Papers
No similar papers found.