Purpose - Akihito Kimura

「音程」ではなく「倍音」を聴いていた – 物理音響分析で解けた、私の耳の正体

私は「音程」を聴いているつもりで、実はずっと「倍音」を聴いていた。

この記事は、自分の耳に自信が持てないクリエイターや、感性と理論のズレに悩む方へ届いてほしい記録です。

昨年から「音を視る」シリーズや書籍執筆のため、自分の楽曲の物理音響データを分析してみると、私の耳は厳密な意味での音程ではなく、倍音も含めた認識が強いということに気づきました。

どの楽曲も明確な調性を持ちながら、12音の分布に大きな偏りがない…

私はこれまで音楽仲間たちに「半音なんだ」「微分音なんだ」と、ずっと主張してきましたが、あまり理解されませんでした。

ですが、ようやく腑に落ちました！！

「厳密な意味での音程ではなく、倍音も含めた認識」

これは、感覚的な言い回しではなく、物理音響的にも筋が通ります。

では、なぜ「音程」より「倍音」を聴いていると言えるのか？

ZCRが低い
（ZCR：音のザラつきやノイズ成分の指標）

→ ノイズ的なエッジより、周期性（＝倍音構造）を重視した音。
→ 音の輪郭よりも、中身の響きが安定している音。

Chromaが12音すべてに偏りなく分布
（Chroma：どの音階（ドレミ…）がどれくらい含まれているかの分布）

→ 特定の音高に「居座らない」。
→ でも調性は崩れていない。

つまり、「この音はCだ」「これは半音上だ」という点的な音程認識ではなくて、「この倍音の雲が、今どこに重心を持っているか」という場（フィールド）としての認識であることを意味します。

だから「半音」「微分音」の話が伝わらなかったのかもしれません。

音楽仲間の言語：

12平均律、ピアノロール、点としての「音程」。つまり「鍵盤座標」。

私の言語：

倍音の重なり。音色によるピッチ知覚のズレ。微妙な重心移動。

つまり、どうやら私が伝えようとしていたのは、音程の話に見せかけた「スペクトルの話」だったようです。

これは理解力の問題ではなく、そもそも「言語が違っていた」ということなのでしょう。

「明確な調性 × 12音の非偏重」は、かなり特殊で、普通は両立しません。これは、データを見たときに、私自身が一番驚いた点でもありました。

調性が明確

→ 主音・属音にエネルギーが集まりやすい

12音が均等

→ 無調・浮遊感になりやすい

私の楽曲は、調性は「方向」として存在しても、音高は「占拠」しない。確かに、Ⅰにリディアンを適用し11thを半音拡げたり、エオリアンの3rdを隠してオルタード化をする等、調性を曖昧にさせる私の意図的なスケールの拡大解釈は教科書通りではありません。

これは和声感ではなく、あたかも倍音場による重力で音楽を作っている感覚です。

私の楽曲が主音に居座らないにも関わらず、調性が維持される理由は、特定の杭（音程）を打つのではなく、倍音の密度によって、音楽全体に緩やかな傾斜（重力）がついている状態だからです。

これは、クラシックで言えば、

ドビュッシー後期
メシアンの一部
スペクトル楽派の入口

電子音楽的に言えば、

ピッチよりもスペクトル・モーフィング
音程＝結果、という発想

そして、今ようやく「自分の耳の正体」を言語化できたこと。これは、かなり大きな前進です。

これまで私は、感覚では確信があるのに、説明すると浮いてしまう。そんな「伝わらない」経験が積み重なる状態でした。

けれど今は、音程よりも倍音構造とスペクトル重心で音を捉えているのだと、自分で自分を誤解しなくて済む言葉を手に入れました。これは安心感が全く違います。

これまで私の楽曲は、周囲から「音が綺麗だ」と評価されることが多くありました。音楽の先生からは「シンセの音作りが上手いからだね」と言われてきました。ですが、自分の中では「良い感じのパッチを選んでいるだけなのに」という戸惑いがありました。

しかし、物理音響分析を経て気づいたのは、音色パッチを選ぶという行為は、単なる「手抜き」ではなく、膨大な選択肢から「理想的な倍音構造（スペクトル）」を持つ素材を射抜く「目利き」だったのかもしれません。

シンセのオシレーターやパラメーターをいじることは、いわば「食材を育てること」と言えるでしょう。一方、私の創作は、差し出された最高の食材（プリセット）をどう組み合わせれば、耳に心地よい「倍音のフルコース」になるかを設計すること。

「12音が大きな偏りもなく分布しているにも関わらず調性が崩れない」という私の楽曲の特徴は、12音体系の音高に最適化された、この「盛り付け（配置）」の絶妙なバランスによって成立していたのかもしれません。

点のメロディではなく、音色同士の倍音がカチッと噛み合う瞬間を探す作業。それは、スペクトルのパズルであり、音響的審美眼とも言えるでしょう。

ですから「イチから音を作れない」と悩む必要はなく、シンセの開発（土壌作り）にまで遡らなくても、目の前の音をどう響かせるかという「設計図」が頭にあるならば、それは立派なオリジナリティであると私は考えます。

今後、私が作る音そのものが変わるわけではありません。これからも、この「倍音の雲」を眺め、その重心を動かしながら、私のままで音を紡ぎ続けていくでしょう。

ただ、これからは自分の音を説明するために無理に「鍵盤の言葉」に翻訳する必要はないのかもしれません。私の耳が捉えている世界を、私自身が信じられるようになった。そのことが、これからの創作を、より迷いのない、自由なものに変えてくれる気がしています。

I Thought I Was Hearing Pitch – But I Was Really Hearing Overtones

Discovering the true nature of my ears through physical acoustic analysis

All this time, I believed I was listening to pitch.
In reality, I had been listening to overtones.

This is a personal record I hope will reach creators who lack confidence in their ears, or who struggle with the gap between intuition and theory.

While analyzing the physical acoustic data of my own music over the past year — for my “Visualizing Sound” series and a book project — I discovered something surprising: my ear is not strongly oriented toward pitch in the strict sense, but toward a perception that includes overtone structure.

Every piece clearly has tonality, yet the distribution of the 12 chromatic tones shows no strong bias.

For years I told fellow musicians, “It’s about semitones,” “It’s microtones,” but I was rarely understood.

Now, finally, it makes sense.

What I was experiencing wasn’t just a vague sensory metaphor — it is physically and acoustically consistent to describe it as perceiving sound through overtone structure rather than precise pitch positions.

Why I Can Say I’m Hearing Overtones Rather Than Pitch

Low ZCR (Zero Crossing Rate)
A measure related to noisiness or roughness in sound

→ My sounds prioritize periodicity (overtone structure) over noisy edges
→ The internal resonance of the sound is stable, rather than its outer contour

Chroma spread evenly across all 12 tones
A distribution showing how much each pitch class is present

→ The music doesn’t “sit” on specific pitch centers
→ Yet tonality does not collapse

This suggests that my perception isn’t point-based — not “this note is C” or “this is a semitone up.”
Instead, it’s more like sensing where the center of gravity lies within a cloud of overtones — a field rather than coordinates.

That may be why my talk of semitones and microtones never quite landed.

A Difference in Musical Language

My peers’ language:
12-tone equal temperament, piano rolls, pitch as discrete points — keyboard coordinates

My language:
Overtone overlap, timbre-induced pitch shifts, subtle movements of spectral center of gravity

In other words, what I was trying to describe as pitch was actually spectral behavior.

This wasn’t a failure of understanding — we were simply speaking different languages from the start.

“Clear Tonality × Even 12-Tone Distribution” – A Rare Combination

This was the most surprising discovery, even to me.

Normally:

Clear tonality
→ Energy concentrates on tonic and dominant

Even 12-tone distribution
→ Leads toward atonality or floating ambiguity

But in my music, tonality exists as a direction, while pitch classes do not become fixed territories.

Yes, I deliberately stretch scale interpretation beyond textbook rules — applying Lydian to the tonic and sharpening the 11th, hiding the Aeolian third to create altered colors, and so on. These choices blur tonal clarity on the surface.

Yet this doesn’t feel like harmony in the traditional sense.
It feels more like composing through gravitational forces within an overtone field.

Tonality is preserved not by hammering in pitch “stakes,” but by creating a gentle global tilt — a gravitational pull shaped by overtone density across the whole spectrum.

Musical and Sonic Lineage

In classical terms, this approach resonates with:

Late Debussy
Certain works of Messiaen
Early spectral music

In electronic music terms:

Spectral morphing over fixed pitch
Pitch as a result, not a starting point

Finally Being Able to Name My Own Ear

Being able to articulate this now feels like a major step forward.

For years, I had a strong intuitive certainty, yet whenever I tried to explain it, I felt out of place. The experience of “not being understood” kept piling up.

But now I have words that don’t misrepresent my own perception:
I understand that I hear sound through overtone structures and spectral centers of gravity more than through pitch positions.

The sense of relief is enormous.

About “Beautiful Sound” and Presets

People often describe my music as having “beautiful sound.”
Teachers used to say, “You’re good at synth sound design.”

But inside, I always felt confused — I’m just picking patches that feel right.

After analyzing the acoustics, I see it differently.

Choosing a timbre patch isn’t laziness — it may actually be the ability to identify, among countless possibilities, the material with the ideal overtone structure.

If tweaking oscillators and parameters is like growing ingredients, then my process is more like designing a full-course meal from the finest ingredients placed before me.

The defining feature of my music — balanced distribution across 12 tones without losing tonality — may come from this delicate “plating”: how sounds are arranged so their overtone structures interlock perfectly.

This isn’t melody as points in space.
It’s searching for moments where the overtones of different timbres click into place — a spectral puzzle, an acoustic sense of beauty.

So there’s no need to worry about not building sounds from scratch.
Even without going back to the “soil” of synth development, if you have a blueprint in your mind for how sounds should resonate together, that is a genuine form of originality.

Moving Forward

The sounds I create won’t suddenly change.

I will keep gazing at these clouds of overtones, gently shifting their centers of gravity, and weaving sound in my own way.

But I may no longer need to force my explanations into the “language of the keyboard.”
I’ve started to trust the world my ears actually perceive.

And that trust feels like it will make my future work freer, clearer, and far less hesitant.

Akihito Kimura (木村哲人)