DanT: Kapow

dan.tilley · June 8, 2026, 10:36pm

July 2026 - v2.7.1 - Adds drag-and-drop, QoL features, Memory Banks
June 2026 - v2.7.0 - initial version of Kapow added

Kapow

Overview

Kapow is a polyphonic additive spectral resynthesis voice. Instead of playing back raw audio files directly, it tries to reconstruct sounds by layering individual sine wave oscillators and a basic residual noise generator.

What is Resynthesis

Resynthesis is the process of recreating an existing sound using basic acoustic building blocks. According to acoustic theory, any complex audio signal can be broken down into a combination of simple sine waves layered together at different frequencies and volumes.

To recreate a sample, Kapow first analyses the original audio to find these frequencies. It then sets up a bank of sine wave oscillators—each tuned to a specific component of the original sound—and plays them back together. By layering these sine waves alongside a bit of filtered noise for the unpitched parts of the sound, the module attempts to synthesise a recognisable recreation of the original audio.

Resynthesis vs Sampling

A traditional sampler plays back the raw PCM (Pulse Code Modulation) audio samples in a linear fashion. To change the pitch, it has to speed up or slow down the playback speed, which unfortunately links pitch and time together: transposing a sound up makes it play faster and shorter, while transposing it down makes it play slower and longer.

Kapow does not play back raw audio samples.

Instead of playing back the recorded waveform, Kapow uses a Short-Time Fourier Transform (STFT) during its offline analysis phase. It slices the audio into 5-millisecond frames (a 200 Hz analysis rate). In each frame, it measures:

The overall volume envelope of the sound.
The frequency ratios and amplitudes of up to 256 individual sine wave partials.
The residual noise level and its spectral tilt (whether the noise is low-frequency rumble or high-frequency hiss).

When triggered, Kapow reads this structural data from memory and instructs its internal oscillators to recreate the sound from scratch.

Pros of Resynthesis

Because the sound is built from scratch in real time, it opens up some musical options that traditional samplers struggle to achieve:

Independent Pitch and Time: You can stretch the playback speed to extreme values—from completely frozen (a single spectral snapshot) to 10x speed—without altering the pitch. You can also modulate the pitch across octaves via CV without changing the playback speed.
Envelope Warping: You can shape the playback velocity non-linearly. For example, you can compress the attack stage to make it punchy, or stretch the decay into an ambient wash.
Smooth Reverse Playback: Reversing playback simply means reading the analysis frames backward.
Fidelity Scaling: You can select how many active sine waves to use. Lower counts create lo-fi, chiptune, or vintage additive textures, while higher counts offer a clearer reconstruction of the original sound.
Transient and Noise Shading: You can independently boost or cut the attack transient, reshape the volume fade-out curve, or adjust the residual noise balance.

Consequences of Resynthesis

While this approach is very flexible, it certainly has some quirks and trade-offs:

Offline Analysis Needed: You cannot just drag-and-drop audio for instant playback. Every sample has to be pre-processed once through the analyser to create the .kapow file.
Heavy CPU Footprint: Running up to 256 sine wave oscillators per voice in polyphonic patches with up to 16 channels can be extremely CPU demanding.
Pitch Tracking Quirks: Resynthesis depends on finding the fundamental frequency to scale its sine wave ratios. The analyser uses the Harmonic Product Spectrum (HPS) pitch-detection algorithm to automate this, though complex samples can sometimes confuse it.

Legal & Copyright Considerations

Disclaimer: I am not a lawyer, and this is not legal advice.

From a legal perspective, resynthesis might be a bit of a grey area. Because .kapow files do not store raw PCM audio data, sharing a .kapow file is not quite the same as distributing a copyrighted audio sample. Instead, it is simply a mathematical list of sine wave ratios, amplitudes, and noise shapes.

This makes .kapow files an interesting option for users who want to share samples. However, please be responsible and respect original copyright laws and the work of other artists when analysing and synthesising source material.

Basic Operation

Setting up Kapow is relatively simple. Here is the basic workflow to get a sound up and running:

Load and Analyse Audio: You can select a file via Audio File Processing → Select Audio File… or simply drag-and-drop any supported audio file (.wav, .mp3, .flac) or pre-analyzed file (.kapow) directly onto the module display panel.
Background Analysis: When loading raw audio, a dark status overlay displays progress. You can bypass the simulated stage delay using Simulate Stage Delay in the context menu, or have the overlay close automatically once finished by checking Auto Close Analysis Overlay.
Save the Kapow File: Click Save Data on the overlay (or right-click → Save Analysis Data As…) to export a .kapow file. If you prefer not to manage external files, turn on Embed Kapow data in patch in the context menu to save analysis data directly inside the patch file.
Connect the output to your mixer or audio system.
Tuning: Double-click both the Fundamental V/Oct coarse knob and the Fine Tune knob. These knobs will snap to the default values calculated by the analyser, matching the detected base pitch of your original sample.
Trigger vs. Gate Playback:

Clicking the trigger button sends a short trigger, which plays the sound once from start to finish.
Holding the trigger button sends a gate, which plays the sound forward, and if you keep holding it, the voice will transition into a sustain loop centred around the Gate Hold Position (which the analyser tries to place at the most stable tonal region of the sample).

Preparing Audio for Analysis

To get the best possible resynthesis results, it helps to prepare your audio files before running them through the analyser:

10-Second Analysis Limit: The analyser is designed with a strict limit of 10 seconds. If you select an audio file that is longer than 10 seconds, it will be automatically truncated to the first 10 seconds.
Trim and Clean: A clean, well-trimmed audio sample will always yield a much better resynthesis than a messy recording with background noise. While Kapow does its best to dynamically compensate for low volumes and detect fundamental frequencies, background hum, room rumble, or tape hiss can easily confuse the HPS pitch tracking and partial tracking, leading to metallic ringing or pitch instability.
Normalise and Noise Removal: For the best results, it is a good idea to normalise your sample, trim silence from both ends, and clean up any background noise in an external editor before running the analysis.

Module Controls

Pitch & Playback Controls

Fundamental V/Oct (Coarse): Adjusts the base pitch of the resynthesised voice over a range of -5.0V to +5.0V. Double-clicking snaps it to the frequency the analyser detected.
Fine Tune: Tweak the pitch by -100 to +100 cents. Double-clicking snaps to the detected cents offset.
VOCT Input: Standard V/Oct pitch control input. Polyphonic.
Time Scale: Controls the playback speed (-10.0x to +10.0x). Uses exponential scaling: positive values speed up playback, while negative values slow it down.
Reverse Playback: A toggle switch to reverse the playhead direction (Forward/Reverse).

Envelope & Shape Modifiers

Envelope Warp: This control compresses the envelope shape (-100% to +100%) towards either the start or the end of the sample.
Fade Out Strength: Sets the depth (0% to 100%) of a volume fade applied to the final 20% region of the sound.
Fade Out Shape: Adjusts the curve of the fade-out from log through linear to exp.

Transient & Loop Controls

Transient Position: Sets the position in the sample where the transient strength control affects the envelope. Snaps on double-click to the frame where the analyser found the most prominent transient.
Transient Strength: Adjusts the envelope at the transient position. Positive values boost the attack, while negative values soften it.
Gate Hold Position: Sets the position in the sample of the sustain loop, activated when holding a gate high. Snaps to the detected stable tonal frame on double-click.

Timbre & Fidelity Controls

Sines (Resynth Sine Wave Count): Represents a power-of-2 count from 2 to 256 active sine waves. Fewer partials create a lo-fi vintage chiptune vibe and save a lot of CPU, while higher counts offer a clearer reconstruction.
Noise Amount: Adjusts the level of the residual noise component.
Noise Tilt: Balances low-frequency vs. high-frequency noise using a simple Tilt Filter.

Triggering & Outputs

Trigger/Gate Button & Input: Manually gates/triggers the voice or accepts CV trigger/gate signals.
Reset Button & Input: Triggers a soft reset. You can choose which parameters are reset in the right-click menu.
Activate Memory Bank input: Polyphonic CV trigger port to dynamically recall memory banks. Triggers map to polyphonic channels (Channel 1 maps to Bank 1, and so on) using Schmitt triggers. If a bank is unset, triggers on that channel are safely ignored.
Kapow Output: Resynthesised audio output (scaled to standard Eurorack +/-5V, hard-clipped at +/-10V). It uses a basic lowpass decay filter to smoothly ramp the volume to zero when playback ends to prevent popping. Polyphonic.
EOR (End of Resynthesis) Output: Emits a 10V trigger pulse (1ms duration) when playback reaches the end of the sample (or the start when in reverse). Polyphonic.

Interactive Visualiser Display

The display provides a simple real-time guide to help you see what is happening to the sound:

Log-mapped Spectrogram: A background plot of the top 32 loudest partials, mapped logarithmically from 20 Hz to 20 kHz.
Dynamic Envelope Overlay: A faint yellow line shows the original analysed volume envelope, while a solid yellow line represents your warped or modified volume envelope.
Interactive Markers:
An orange dot shows the Transient Position.
A green dot shows the Gate Hold Position.
A red curve shows the Fade Out effect.
Multi-Playhead Tracker: Shows where each active voice playhead is currently positioned.
Top left displays a poly badge when the module is in polyphonic mode.
Bottom left displays the original sample’s detected pitch.
Bottom right displays the length of playback in seconds. There is a horizontal axis along the bottom of the display that also represents playback time.

Right-Click Context Menu

Right-clicking the panel gives you access to several additional settings:

Retriggers Enabled: When checked, sending a new trigger or gate while the voice is active immediately ends that playback and starts a new playback. When unchecked, any new triggers or gates are ignored until the voice finishes playing.
Polyphony Enabled: Toggles polyphonic support.
Reset Behaviour Submenu: Lets you select which knobs and parameters are affected when you click the Reset button or send a Reset CV pulse.
Reset on successful analysis: When checked, automatically triggers a parameter and voice reset the moment a background audio file analysis finishes successfully.
Reset for Kapow drag-and-drop: When checked, automatically triggers a parameter and voice reset when you drag-and-drop a file onto the module.
Embed Kapow data in patch: When checked, automatically saves the main analysis data and all 16 memory bank files directly inside the VCV Rack patch storage folder/bundle (making your patch completely self-contained). When unchecked, the module references the local file path.

NOTE: Disabling this option on a patch that contains embedded data will prompt you to save all embedded analysis files locally back onto your disk to prevent data loss.

Memory Banks Submenu: Accesses the 16 memory preset slots.
- Store Current State: Saves the currently loaded analysis data and all 13 knob parameters to this bank.
- Load Bank State: Recalls the bank’s stored parameters and analysis data (swaps cleanly with a click-free 10ms crossfade, caching incoming triggers during the transition).
- Clear Bank: Deletes all stored data from the bank.

Audio File Processing Submenu:
- Select Audio File…: Open a dialog to analyse a WAV, MP3, or FLAC file.
- Load Analysis Data…: Load a pre-saved .kapow file.
- Save Analysis Data As…: Export the active analysis data to a .kapow file.
- Open Presets folder…: Access the folder where Kapow presets are saved by default.
- Trim Pre-Silence: When checked, the analyser automatically discards silence at the very start of your audio file.
- Silence Threshold (dB): Sets the volume threshold used by the silence-trimmer (defaults to -60 dB).
- Simulate Stage Delay: When checked, simulates stage-by-stage delay on the status overlay for visual feedback. Unchecking it bypasses the delay for near-instant analysis.
- Auto Close Analysis Overlay: When checked, automatically closes the dark progress status overlay immediately upon successful analysis completion.

Polyphony

Right-clicking the panel and checking Polyphony Enabled enables up to 16 voice channels. The module automatically manages the active channel count based on the number of channels connected to the VOCT and TRIGGER inputs.

CPU Saving Tips

Because running hundreds of oscillators in real time is computationally expensive, polyphonic patches can be very CPU heavy, so here are a few tips to reduce the weight:

Reduce Sines Count: Lowering the Sines knob (e.g., to 16 or 32 sines) dramatically reduces the work the CPU has to do, and depending on the original sample, can often sound as good as higher counts.
Minimise Channel Count: Try to only use as many polyphonic channels as your patch actually needs.
Bypass the Tilt Filter: Setting the Noise Amount to 0% completely bypasses the Tilt Filter code.
Reduce Envelope Warp: Keeping Envelope Warp closer to 0% avoids complex power-curve calculations.

Creative Sound Design Ideas

Because Kapow synthesises sound from scratch and completely decouples pitch and speed, it has some non-obvious creative use cases. Here are just a few creative ideas:

Endless Drone Generator: Patch a slow LFO into the Reverse Playback CV input. Trigger playback once. As the LFO toggles playback back and forth, the voice will continually read the analysis data forward and backward without ever stopping. Combine this with manual tweaks or CV modulation of the Time Scale (to stretch the sound) and Fundamental V/Oct controls to create endless, shifting spectral drones. You can also combine the LFO with an offset and then use the offset amount to move the directional loop backwards and forwards through the sample.
Granular Style Glitches: Turn up the time control so that the sample playback is quite fast. Patch a random jumping CV to the Gate Hold Position. Patch the EOR output to the Trigger input so that when the playback ends it immediatley retriggers a new playback. Patch an additional manual gate to the Trigger input, now when you hold a gate the playback quickly reaches the sustain loop and jumps around based on the random CV.
Frequency Modulation (FM) Chaos: Try patching an audio-rate signal from a standard VCO into the VOCT pitch input. Depending on the harmonic complexity of your original sample, this frequency modulation can generate beautifully metallic, chaotic, or completely unearthly sideband timbres.
Resculpting Non-Drum Sounds into Percussion: Load a short, non-percussive sample (such as a spoken word, a vocal snippet, or a string pluck). Turn the Sines count down to simplify the harmonics, boost the Transient Strength to its maximum positive value for a sharp clicky attack, and use the Noise Amount and Noise Tilt to sculpt the decay. With a bit of experimentation, you can turn almost any sound into unique electronic hi-hats, metallic snares, or clicky woodblocks.

`.kapow` Files

The .kapow file is a proprietary binary data format designed just for the Kapow resynthesis engine.

No PCM Audio: Unlike standard audio files, .kapow files do not contain any raw audio waveforms. Instead, they store structured lists of analysed parameters (envelope values, sine wave frequency ratios, partial amplitudes, and noise shapes).
Efficient Loading: The file is structured to match the internal SIMD layout of the engine. This lets Kapow load files quickly and perform real-time memory interpolation.

dan.tilley · June 8, 2026, 11:04pm

rsmus7 · June 9, 2026, 6:03am

Great, this is a really interesting new module

I will try it asap

Alphagem-O · June 9, 2026, 8:35am

This looks like a lot of fun! Thanks for the module!

gabtiorbi · June 9, 2026, 9:03am

After a few quick tests, the verdict is undeniable: this module is extraordinary! I think it will advantageously replace all the sampler modules in my library. Thank you!

codyge · June 9, 2026, 10:30am

Cool! Reminds me a bit of the Panharmonium eurorack module. Like a lo-fi mp3 representation of a sample that you can then mess around with in FFT space and just go mad with.

connorevansguitar · June 9, 2026, 4:08pm

Absolutely filthy module! Instantly fell in love.

dan.tilley · June 9, 2026, 9:55pm

dan.tilley · June 10, 2026, 8:24pm

jackokring · June 10, 2026, 9:30pm

This module is interesting. I need samples though. Magix-ed away my free loops …

dan.tilley · June 15, 2026, 9:54pm

Alphagem-O · June 17, 2026, 12:52pm

Kapov provides endless opportunities for sound design .

Would it be possible and what would you think of the idea of adding an audio input and a small record function, so own audio snippets could be used on the fly?

The way I envision it, Kapov would record the audio input during the period between two instances of pressing a record button (or start and stop), then auto-analyze and play the recording through the engine.

Similar to NYSTHI’s Simpliciter, the module would still play back the previous snippet while recording is in progress, then replace playback at the end of recording and analysis.

Just an idea, might be too complex or CPU heavy to implement, but would be quite cool.

dan.tilley · June 17, 2026, 5:52pm

There was a similar request on BlueSky…

As I said there, I’ll consider it. I may have some alternative solutions once my next module is functional, but just the analysis stage that is required makes an audio input for Kapow feel awkward. Would you want stereo inputs as well? The analysis stage mixes down to mono.

Lets try to think of some alternative workflows that might be better.

What about some sort of context menu option that will run through an entire directory of audio files and create .kapow files for all of them?

Or perhaps I can create a standalone executable of the analysis part that you can use outside of Rack?

Alphagem-O · June 17, 2026, 7:42pm

Thank you for considering a recording feature! I don’t feel strongly about anything I propose here, but would prefer a live input rather than advanced sample processing. My least favorite would be sample preparation outside VCV Rack.

Stereo would be great, although with my dinosaur PC, I may have to be concerned about CPU drag, so mono is fine also if stereo becomes too ‘expensive’.

For me, waiting a couple of seconds to complete processing actually wouldn’t seem to feel so awkward. I don’t perceive a need for a dry (input)/wet (playback) control, so I’d just let the buffer play until analysis is completed in the background. This would then be playing the recording from the previous cycle or silence (on the first or cycle), and the user would hardly notice the analysis phase, except maybe for a slight delay to commence.

Another question, independent of the one above: Are the .kapov files actually saved with the .vcv file, or is only the link maintained? If the .kapov is kept separately, it might be worth to have an option in the context to save it with the patch to make sharing more straightforward. Again, just an idea.

dan.tilley · June 17, 2026, 10:37pm

The patch saves which kapow file is loaded into the module, it doesn’t save the actual kapow file itself.

If you load a patch with a Kapow module that tries to load a kapow file that doesn’t exist, then the module will default to kapow.kapow, and if that doesn’t exist for some reason then the module will default to empty and play no sound.

Stereo would be great, although with my dinosaur PC, I may have to be concerned about CPU drag, so mono is fine also

My point was more along the lines of, the analysis is mono, analysing a stereo vs mono doesn’t make much difference, if there was a single input, stacking multiple input cables on it would have the same effect, they get mixed down.

For me, waiting a couple of seconds to complete processing actually wouldn’t seem to feel so awkward.

Actually, the delay in the analysis is fake, for most of my testing the analysis ran too fast to even see the stage readout, so i added a small delay between each stage. I could add a context menu to disable the delay and make the analysis as fast as possible.

What I mean by awkward is that the input would have live audio playing, in order to convert this into a kapow file, first it has to save the audio into either a file or a buffer, and then it has to pass this to the analyser. The analyser cannot work in real time (in its current form), it has to have the whole of the audio.

There are a bunch of technical questions around this process that I would have to answer, and some UX questions as well.

If we first save an audio file (and not just keep the audio in a buffer), then we need to account for the time it takes to write the file, which will be longer than the time it takes to write a kapow file. We also need to handle all the failure cases.

When you load an audio file, the max length is 10 seconds. Would the module have a start and stop record function, or does it just always record 10 seconds. Does it need to detect the volume of the input and not record silence? Does it attempt to normalise the audio once it is recorded?

It is already very easy to record audio files from Rack, and a small task to pick them for analysis in the Kapow module. I am not yet sure if live audio recording is an enhancement that is worth the effort.

Potentially there are other things I can do that will improve the workflow, like auto analysing files in a watched path, or looking into if I can enable drag and drop of audio files onto the module to analyse them etc…

Perhaps a memory slot system, where you can use CV to load different kapow files (would need to test to see if loading is fast enough).

Its still in consideration, just know that analysis of live audio to kapow file is probably not going to happen.

I was also thinking about how i use the module and wondered if making a slimmed down version might be good, one that uses less sines, no noise, and no envelope warping, but can load a different kapow file on each channel.

Alphagem-O · June 18, 2026, 5:07am

Thank you so much for your detailed answer!

That’s what I thought. So, when sharing a patch, the receiver is likely to not reproduce this part. I have the Sickozell samplers in mind, which I really like. They provide an option in the context menu to save the actual sample inside the .vcv. With Kapov, could this be done with the .kapov file, to make it self-contained?

I would prefer stereo then.

I suppose you wouldn’t have chosen the delay if there wasn’t some advantage to it. Why isn’t faster better?

Buffer would be fine with me, because the .kapov would get saved a few moments thereafter.

These are all very good questions! I’d say to keep the recording feature as simple as possible. Standard 10 sec would be fine. Or alternatively have the option to select a time (e.g. 1, 2, 3, 5, 10 sec fixed) in the context menu.

I understand. Still appreciate that you gave it a thought.

Path analysis, drag-drop and/or memory slots would be nice, too. Especially being able to control replacements by CV to add variation while performing. I also see good use for a slim version, although I would not cut down on mangling options like warping.

Thanks again for all your thoughts! Kapov definitely is very fun and useful addition to the repertoire of modules.

jaibhimadevi · July 13, 2026, 12:25am

Very cool module- though based on the name, I kinda had been hoping it was somehow based on this

dan.tilley · July 13, 2026, 9:54am

Interesting, will give that a look, might inspire new ideas, thanks

The Kapow name basically came from when I was implementing the transient strength modifier, you can take any sound and make it a drum like sample by increasing the transient strength at the start of the sample.

The default kapow file when you first use the module is a sample from a fight from the 60s Batman TV show

dan.tilley · July 13, 2026, 10:25am

DanTModules v2.7.1 is now live, bringing some upgrades to Kapow! I’ve also updated the original post at the top of this thread to integrate the new controls and settings into the user manual.

Here is a summary of the new features:

Drag-and-Drop Loading: Drag audio or .kapow files directly onto the panel. This works amazingly well with the new Siren module from Stoermelder & Omri. (Demo videos coming when I get time).
16 Memory Presets: Store different Kapow files and parameter shapes in 16 slots.
Polyphonic Memory Bank Recall: Switch memory banks dynamically using polyphonic triggers.
- Click-Free Swaps: A fast crossfade should prevent pops and clicks when changing banks.
- Trigger Caching: Triggers received during bank loads are queued and fired immediately after loading completes. (Only tested this on my powerful machine, hopefully it also does a good job on lower powered machines).
Patch Embedding: Embed your analysis files directly in the patch for 100% portability.
Quality of Life: Bypass simulated analysis delays, auto-close the progress status overlay, and auto-reset parameters on drop or successful analysis.