🤖 AI Summary
This study investigates the universality and structural complexity of punctuation distribution in modern Chinese prose. Using corpora from three contemporary Chinese novels, we apply statistical modeling, discrete Weibull distribution fitting, Zipf’s law validation, and multifractal analysis. We first demonstrate that inter-punctuation distances for non-terminal punctuation strictly follow a discrete Weibull distribution—supporting cross-linguistic universality of punctuation patterns. In contrast, terminal-punctuation intervals significantly deviate from this distribution, reflecting the high syntactic variability and structural complexity inherent in Chinese sentence architecture. Furthermore, Gao Xingjian’s *Soul Mountain* exhibits pronounced multifractal characteristics, corroborating the role of elevated narrative freedom in shaping hierarchical textual structure. Collectively, these findings provide quantifiable, punctuation-based empirical evidence for syntactic complexity in written Chinese, advancing formal linguistic analysis of discourse-level structure.
📝 Abstract
Recent research shows that punctuation patterns in texts exhibit universal features across languages. Analysis of Western classical literature reveals that the distribution of spaces between punctuation marks aligns with a discrete Weibull distribution, typically used in survival analysis. By extending this analysis to Chinese literature represented here by three notable contemporary works, it is shown that Zipf's law applies to Chinese texts similarly to Western texts, where punctuation patterns also improve adherence to the law. Additionally, the distance distribution between punctuation marks in Chinese texts follows the Weibull model, though larger spacing is less frequent than in English translations. Sentence-ending punctuation, representing sentence length, diverges more from this pattern, reflecting greater flexibility in sentence length. This variability supports the formation of complex, multifractal sentence structures, particularly evident in Gao Xingjian's Soul Mountain. These findings demonstrate that both Chinese and Western texts share universal punctuation and word distribution patterns, underscoring their broad applicability across languages.