Today, I will visualize “Samurai (momiji mix)”—a track from my 2019 album Ethereal 2—through three distinct layers: Sonic Texture (Cells), Tonal Framework (Bones), and Temporal Structure (Narrative).

Akihito Kimura / Samurai (momiji mix) A4=438Hz, Key of Am, 123BPM

Chapter 1: Sonic Texture (Cells)

Spectral / MFCC / RMS / ZCR

■ Timbre & Brightness

Spectral Centroid: ≈ 1,600 Hz

This index represents the “brightness” of the sound. A value of 1,600 Hz indicates a composed, well-rounded resonance with a rich midrange—neither muffled nor piercingly shrill.

Spectral Flatness: 25.1

This relatively high value suggests a texture rich in noise components and complex overtones (such as synth leads and thick backing tracks) rather than simple pure tones.

■ Mid-High Range Definition (Spectral Contrast)

A standout feature is the Spectral_Contrast_Mean_5 value of 52.95, which is exceptionally high compared to other bands (≈ 18–20).

  • Analysis: This signifies an extreme contrast between “active sound” and “silence” in the frequencies around 4kHz.
  • Interpretation: It suggests that high-frequency elements—such as sharp percussion or the aggressive attack of the Koto (plucked string)—stand out clearly without being buried in reverb. This points to a high-resolution mix with well-organized sonic “gaps.”

■ The Sharpness of the Koto “Tsume” and Spectral Contrast

The abnormally high Spectral_Contrast_Mean_5 (52.95) is likely driven by the “crisp plucking sound” of the main Koto riff.

  • Analysis: High contrast in this band indicates a sharp distinction between the moment a note is struck and the moment it vanishes.
  • Interpretation: Within the 123 BPM rhythm, the attack of the Koto “tsume” (pick) striking the strings remains prominent, never washed away by muddy reverb. By separating “silence” from “sound” in this range while maintaining a thick low end, the mix achieves a focused core without any blurring.

■ A “Grounded” Acoustic Balance (MFCC Analysis)

The MFCC_Mean_2 shows a high positive value of 88.80, indicating a spectral slope that leans toward the lower frequencies. The high end is not overbearing; instead, the foundation of lows and low-mids (bass and drum resonance) provides the track’s persuasive power.

  • Analysis: While the Koto is typically a “bright” instrument in the mid-high range, this data shows the track’s overall center of gravity is quite low.
  • Interpretation: Combined with the solemnity of the A minor key and the calm 438Hz tuning, the grounded bassline and kick drum prevent the Koto’s brilliance from feeling “flighty.” This imbues the “Samurai” theme with weight and resolve rather than mere lightheartedness.

■ Energy & Dynamics

RMS (Sound Pressure) Composition: RMS_Harmonic_Mean (0.18) exceeds RMS_Percussive_Mean (0.06). While percussion creates the “onsets” (triggers) of the rhythm, the actual foundation of the track’s energy is sustained by harmonic components like bass, pads, and melodies.

■ The Aesthetics of Decay (RMS & Zero Crossing Rate)

The Zero_Crossing_Rate_Mean is notably low at 0.073.

  • Analysis: This indicates a very clean waveform with minimal distortion or noise components.
  • Interpretation: This reflects a focus on the “beauty of resonance” as the Koto notes fade away. It’s a luxurious balance of “Japanese Minimalism”—utilizing space rather than filling it with noise—coexisting with a bass-heavy mix.

Chapter 2: Tonal Framework (Bones)

Key Strength / Tonnetz / Am–F Duality

■ Harmony & Chroma Features

Scale IndexPitch ClassMean IntensityVariation (Std)Interpretation
Chroma 1C0.5710.302Minor 3rd of Am / C Major component
Chroma 2C#0.3030.206
Chroma 3D0.2800.244
Chroma 4D#0.3180.228
Chroma 5E0.5530.304Perfect 5th of Am
Chroma 6F0.4470.306
Chroma 7F#0.4290.315
Chroma 8G0.5130.2897th of Am (Natural Minor)
Chroma 9G#0.4170.254
Chroma 10A0.4980.330Root Note
Chroma 11A#0.3900.248
Chroma 12B0.6010.2962nd of Am / Adds tension

The Chroma vector reveals strong presence in C / B, E, and G, indicating a chord progression with transparency and stability based in A minor.

■ The “Depth” of the 438Hz Setting

The tuning, slightly lower than the standard 440Hz, is reflected in the variance of the Chroma data.

  • Analysis: The values for Chroma 12 (B) and Chroma 1 (C) are nearly equally high.
  • Interpretation: By tuning to 438Hz, the notes “lean” toward the adjacent semitone below in standard pitch detection. This creates a data-level “ambiguous shimmer.” This choice isn’t just a preference; it is etched into the data as a subtle fluctuation, likely evoking the “archaic weight” and “melancholy” associated with the Samurai theme.

■ Absolute Tonal Stability (Key Strength Analysis)

The average minor key strength across the entire duration identifies A minor (Am) as the absolute axis.

Estimated Key (Minor)Average StrengthMusical Interpretation
A minor21.35The absolute axis that dominates the composition.
E minor20.84Dominant of Am. Reinforces the Koto melody and harmonic components.
D minor20.68Subdominant of Am. A component that provides expansiveness to the track.
  • A minor (21.35): The dominant pillar of the track.
  • E minor (20.84): The dominant of Am; reinforces Koto melodies and harmonies.
  • D minor (20.68): The subdominant; adds expansiveness.

The narrow margin between A, E, and D signifies that the Pentatonic sound (specifically “La-Re-Mi”), common in Koto music, is clearly captured in the data.

■ Global Key Analysis: Coexistence of Duality

In the overall key estimation, Am and F Major are neck-and-neck (0.97 vs 0.96).

RankEstimated KeyStrengthMusical Interpretation
1A4 minor0.978The soul of the track. Stoic and quiet resolve.
2F4 Major0.962The color of the track. Sentimentality and expansiveness (The “Momiji” scenery).
3C4 Major0.958Sense of stability. Resonates as the relative major of Am.

The tension between Am (Minor) and F (Major) speaks to the emotional depth of the composition.

■ A Dialogue between “Silent Am” and “Expansive F Major”

The section analysis shows a structure that frequently oscillates between Am and F Major.

  • Am (Tonic): Stability, evoking the discipline and quiet determination of a warrior.
  • F Major (Submediant): A moment where the field of vision suddenly opens up, and emotions overflow. This loop creates a dramatic, emotional progression rather than a simple “dark” track.

■ Tonnetz Analysis: The “Distance” between Tradition and Modernity

The Tonnetz data (geometric relationships between notes) shows how “close” or “far” harmonic changes are.

  • Analysis: Tonnetz Dim 0 & 1 (Circle of Fifths relationships) remain stable.
  • Interpretation: Shifts between Am, F Major, and G minor are “closely related” in music theory, making the transitions very smooth. Despite “edgy” elements like the 438Hz tuning and sharp attacks, the track maintains a dignified, accessible harmonic sense.

■ “Decisiveness” in the Tonnetz

The standard deviation of the tonnetz_dim_0 to 5 values is a very low 0.10.

  • Analysis: Harmonic movement stays within a specific geometric range. Even when it sounds like a modulation, it is geometrically stationary. The story is driven by internal tension rather than flashy key changes.
  • Interpretation: This visualizes a “stoic spirituality,” building a narrative through the addition and subtraction of timbre and rhythm within the single universe of A minor.

Chapter 3: Temporal Narrative (Story)

Section Structure / Similarity / Harmonic Change × Onset

■ Rhythm & Tempo

  • The Fluctuation in Tempo Detection: Interestingly, while the DAW was set to 123 BPM, the analysis detected 125 BPM.
  • Interpretation: The algorithm likely sensed a “driving” feel (playing ahead of the beat) or the abundance of 16th-note grace notes and ghost notes. It reflects the contrast between “stillness and motion”—moving instantly from a standstill, much like a Samurai.
  • Percussive Momentum: Onset_Percussive_Mean (1.92) is significantly higher than Harmonic_Mean (0.99), suggesting that drums and high-attack percussion drive the rhythm more than melodic transitions.

■ Harmonic Density and Narrative Flux

KeyAverage Harmonic DensityRole Inference
E minor / G minor3.0 – 3.1 (High)Tension, conflict, bridge sections.
A minor / F Major2.7 – 2.9 (Standard)Main theme, stability.
  • E minor / G minor (Density 3.0–3.1): High density. Represents tension, conflict, or “bridge” sections.
  • A minor / F Major (Density 2.7–2.9): Standard density. Main theme and stability.
  • Interpretation: The high-density sections likely depict “hesitation” or “intense conflict,” acting as bridges that build the listener’s anticipation for a resolution.

■ Scene Transitions via Harmonic Change

By identifying points where harmonic_change spikes (e.g., the shift to G minor at 62s), we can visualize where the “scenes” change.

  • Interpretation: These spikes represent “the moment the sword is drawn” or “the moment the landscape shifts from autumn to winter.” Combined with onset_strength, it proves that rhythm and harmony are perfectly synchronized to keep the listener engaged.

■ Synchronization of Rhythm and Harmony

There is a tight correlation between onset_strength (rhythmic impact) and harmonic_change.

  • Interpretation: Rather than mechanical MIDI coldness, the “humanized” recreation of a string instrument’s raw attack drives the groove of the track.

■ A 6-Minute 14-Second “Long Story”

  • Start to Middle: Onset_strength increases and decreases periodically, like waves crashing.
  • Toward the End: The amplitude of harmonic_change shifts, suggesting a movement toward either “silence” or a “tense finale.”

■ Section Cluster Analysis (Summary)

Estimated Clusters: 10

The story shifts from “The Blade” (Am) to “Maple Leaves” (F Major) to “Shadows” (Gm), a progression mathematically supported by key selection and density.

Section Definitions (Characteristics of Each Cluster)

SectionKeyStrengthHarmonic Density
0F4 Major0.9207952.434925
1F4 Major0.9457883.125001
2F4 Major0.9649873.304865
3A4 minor0.9446913.040811
4A#4 Major0.9199842.797080
5A4 minor0.9433702.467320
6G4 minor0.9754483.100542
7G4 minor0.9041422.865894
8E4 minor0.9300073.155316
9E4 minor0.9408683.164314

Ten clusters (S0–S9) each fulfill a distinct role within the track.

SectionPrimary KeyDensityRole Inference
S5 / S3Am2.4 – 3.0“Stillness / The Blade”: A sharp, stoic part where the main Koto riff stands out.
S0 / S1 / S2F Major2.4 – 3.3“Motion / Maple Leaves”: A part where harmonic expansiveness increases and the scenery unfolds.
S6 / S7Gm2.8 – 3.1“Tension / Shadows”: A slightly unsettling and dramatic development, as if shadows are momentarily falling.
S8 / S9Em3.15“Convergence toward the Finale”: An E minor resonance acting as the dominant of Am. Increase in density toward the end.
  • S5 / S3 (Am): “Stillness / The Blade.” Stoic parts where the Koto riff shines.
  • S0 / S1 / S2 (F Major): “Motion / Maple Leaves.” Expansive harmony and scenery.
  • S6 / S7 (Gm): “Tension / Shadows.” Dramatic, slightly ominous developments.

Section Time Ranges (Chronology)

No.Section (Cluster)StartTime (s)EndTime (s)
050.00000023.440544
1023.44054431.254059
2531.25405939.055964
3039.05596448.320726
4548.32072654.659773
5054.65977362.461678
6662.46167869.288345
7569.28834569.775964
8669.77596477.101859
9277.10185977.589478
10677.58947878.077098
11578.07709886.378231
12486.37823193.680907
13693.680907108.808707
147108.808707109.284717
155109.284717117.098231
160117.098231124.900136
175124.900136132.702041
180132.702041140.503946
192140.503946140.991565
203140.991565148.305850
211148.305850156.119365
222156.119365156.606984
233156.606984163.921270
241163.921270171.723175
252171.723175202.454785
261202.454785202.942404
272202.942404203.430023
283203.430023210.744308
291210.744308218.557823
302218.557823219.045442
313219.045442226.359728
321226.359728232.211156
332232.211156232.698776
349232.698776233.186395
358233.186395233.674014
362233.674014234.649252
373234.649252241.975147
381241.975147249.777052
392249.777052250.264671
403250.264671257.578957
411257.578957265.380862
422265.380862296.112472
431296.112472296.600091
443296.600091304.413605
451304.413605312.215510
462312.215510312.703129
473312.703129320.017415
481320.017415327.819320
492327.819320343.434739
505343.434739374.634127

The data density vividly illustrates the track’s dramatic arc, particularly through the rapid developments around the 140s mark (No. 19–33) and the extended outro starting at 343s (No. 50).

■ Structural Timeline Analysis

Within the 374-second journey, we observe a very compelling progression:

  • 0s ~ 62s (Opening): The track alternates between S5 (Am) and S0 (F Major). This serves as an introduction that begins in silence and gradually reveals the landscape.
  • 62s ~ 108s (Mid-track Turbulence): S6 (Gm) and S4 (A# Major) suddenly appear. Given the high Similarity (Sim) of 0.8–0.9, this suggests an evolution where new layers are added to existing sounds to heighten tension, rather than an abrupt change.
  • 140s ~ 327s (Mid-to-Late Loop Development): S1, S2, and S3 repeat with an extremely high Similarity of 0.966. This can be considered the “peak groove zone,” where subtle additions and subtractions (dub-style arrangements or improvisational Koto variations) occur within a perfected groove.
  • 343s ~ End (Conclusion): The track returns to S5 (Am), concluding with a sense of resolute stability.
No.TransitionSimilarity (Sim)
1S5 → S00.737
2S0 → S50.737
3S5 → S00.737
4S0 → S50.737
5S5 → S00.737
6S0 → S60.793
7S6 → S50.836
8S5 → S60.836
9S6 → S20.961
10S2 → S60.961
11S6 → S50.836
12S5 → S40.803
13S4 → S60.887
14S6 → S70.908
15S7 → S50.875
16S5 → S00.737
17S0 → S50.737
18S5 → S00.737
19S0 → S20.886
20S2 → S30.916
21S3 → S10.827
22S1 → S20.966
23S2 → S30.916
24S3 → S10.827
25S1 → S20.966
26S2 → S10.966
27S1 → S20.966
28S2 → S30.916
29S3 → S10.827
30S1 → S20.966
31S2 → S30.916
32S3 → S10.827
33S1 → S20.966
34S2 → S90.934
35S9 → S80.999
36S8 → S20.934
37S2 → S30.916
38S3 → S10.827
39S1 → S20.966
40S2 → S30.916
41S3 → S10.827
42S1 → S20.966
43S2 → S10.966
44S1 → S30.827
45S3 → S10.827
46S1 → S20.966
47S2 → S30.916
48S3 → S10.827
49S1 → S20.966
50S2 → S50.869

The consecutive high values in the latter half (from No. 21 onwards), such as 0.966 and 0.999 (S9→S8), confirm a structure with extremely high harmonic consistency. This allows for an engaging experience where listeners can enjoy the subtle timbral shifts and rhythmic nuances while the core foundation remains rock-solid.


Summary: The Track Profile Revealed by Analysis

“Samurai (momiji mix)” achieves a listener-friendly texture that avoids over-emphasized high frequencies while maintaining a pleasant 123 BPM beat. While the percussion drives the rhythm, the core is filled with rich harmonies (synths and instrumental resonances). The analysis suggests a track that conveys a “robust core strength” rather than mere flashiness.

The main Koto riff exhibits a unique combination of a “sharp attack” and “elegant decay.” The deliberate choice to lean the mix toward the low end manifests in the data in a very compelling way. The track essentially bonds “modern sub-bass power” with the “traditional delicate attack of Japanese instruments” through the unique atmosphere of 438Hz. The fact that the Koto phrases remain “crisp” and do not get buried despite the heavy low end is proof of extremely meticulous dynamics management across each frequency band.

While previous analyses focused on the “Sonic Cells (Texture),” this latest breakdown has brought the “Sonic Skeleton (Narrative)” into focus. It is a meticulously structured story that paints a landscape through the dialogue between Am and F, while heightening tension through the complex harmonies of G minor and E minor.


Insights as an Engineer and Composer

The most striking discovery in this analysis is the high “Similarity (Sim)” during section transitions. The fact that many transitions exceed 0.8 to 0.9 indicates that this track is not a patchwork of disparate parts. Instead, it maintains a single, massive flow—a “grand river” of a groove—where only the harmonic colors (Keys) shift smoothly.

This high similarity serves as evidence that the baseline and rhythmic foundation dominate and unify the composition from start to finish.


Conclusion: The Value of “Samurai (momiji mix)” as Shown by Data

The data proves that this track “upholds a rigid framework of tradition (Tonnetz stability in Am) while facilitating a high-density sonic dialogue (Onset & Key Strength) within that space.”