visualizeAudio()
Part of the @remotion/media-utils
package of helper functions.
This function takes in AudioData
(preferably fetched by the useAudioData()
hook) and processes it in a way that makes visualizing the audio that is playing at the current frame easy.
Arguments
Takes an object containing the following values:
audioData
AudioData
An object containing audio data. You can fetch this object using useAudioData()
or getAudioData()
.
frame
number
The time of the track that you want to get the audio information for. The frame
always refers to the position in the audio track - if you have shifted or trimmed the audio in your timeline, the frame returned by useCurrentFrame
must also be tweaked before you pass it into this function.
fps
number
The frame rate of the composition. This helps the function understand the meaning of the frame
input.
numberOfSamples
number
Must be a power of two, such as 32
, 64
, 128
, etc. This parameter controls the length of the output array. A lower number will simplify the spectrum and is useful if you want to animate elements roughly based on the level of lows, mids and highs. A higher number will give the spectrum in more detail, which is useful for displaying a bar chart or waveform-style visualization of the audio.
smoothing
boolean
When set to true
the returned values will be an average of the current, previous and next frames. The result is a smoother transition for quickly changing values. Default value is true
.
optimizeFor?
v4.0.83
"accuracy" | "speed"
Default "accuracy"
. When set to "speed"
, a faster Fast Fourier transform is used. Recommended for Remotion Lambda and when using a high number of samples. Read user experiences here.
Return value
number[]
An array of values describing the amplitude of each frequency range. Each value is between 0 and 1. The array is of length defined by the numberOfSamples
parameter.
The values on the left of the array are low frequencies (for example bass) and as we move towards the right, we go through the mid and high frequencies like drums and vocals.
Usually the values on left side of the array can become much larger than the values on the right. This is not a mistake, but for some visualizations you might have to apply some postprocessing to it, you can flatten the curve by for example mapping each value to a logarithm of the original function.
Example
In this example, we render a bar chart visualizing the audio spectrum of an audio file we imported using useAudioData()
and visualizeAudio()
.
tsx
import {useAudioData ,visualizeAudio } from "@remotion/media-utils";import {Audio ,staticFile ,useCurrentFrame ,useVideoConfig } from "remotion";export constMyComponent :React .FC = () => {constframe =useCurrentFrame ();const {width ,height ,fps } =useVideoConfig ();constaudioData =useAudioData (staticFile ("music.mp3"));if (!audioData ) {return null;}constvisualization =visualizeAudio ({fps ,frame ,audioData ,numberOfSamples : 16,}); // [0.22, 0.1, 0.01, 0.01, 0.01, 0.02, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]// Render a bar chart for each frequency, the higher the amplitude,// the longer the barreturn (<div ><Audio src ={staticFile ("music.mp3")} />{visualization .map ((v ) => {return (<div style ={{width : 1000 *v ,height : 15,backgroundColor : "blue" }}/>);})}</div >);};
Postprocessing example
A logarithmic representation of the audio will look more appealing than a linear one. Below is an example of a postprocessing step that looks prettier than the default one.
tsx
/*** This postprocessing step will match the values with what you'd* get from WebAudio's `AnalyserNode.getByteFrequencyData()`.** MDN: https://developer.mozilla.org/en-US/docs/Web/API/AnalyserNode/getByteFrequencyData* W3C Spec: https://www.w3.org/TR/webaudio/#AnalyserNode-methods*/// get the frequency dataconstfrequencyData =visualizeAudio (params );// default scaling factors from the W3C spec for getByteFrequencyDataconstminDb = -100;constmaxDb = -30;constamplitudes =frequencyData .map ((value ) => {// convert to decibels (will be in the range `-Infinity` to `0`)constdb = 20 *Math .log10 (value );// scale to fit between min and maxconstscaled = (db -minDb ) / (maxDb -minDb );returnscaled ;});