How to split an mp3 file by detecting silent parts?

Question

I have a big mp3 file including more songs. Is there any way to split the file by detecting silent parts? I would prefer a command line tool (e.g. ffmpeg).

Tried:

youtube-dl --ignore-errors --format bestaudio --extract-audio --audio-format mp3 --audio-quality 160K --output "%(title)s.%(ext)s" --yes-playlist

show us what you have tried so far using ffmpeg or others
– Scott Stensland
Commented Aug 5, 2020 at 14:51 — Scott Stensland, Commented Aug 5, 2020 at 14:51

llogan · Accepted Answer · 2020-08-05 18:43:33Z

You can do this manually with ffmpeg, and of course it can all be scripted if you prefer.

Get silent timestamps

Get silence with the silencedetect filter:

ffmpeg -i input.mp3 -af silencedetect -f null -

Note the default minimum length for silence is set to 2 seconds, but it can be adjusted. See ffmpeg -h filter=silencedetect.
There is also a silenceremove filter.

Example output using awk:

$ ffmpeg -i input.mp3 -af silencedetect=d=0.5 -f null - |& awk '/silencedetect/ {print $4,$5}'
silence_start: 1.20837
silence_end: 1.92546
silence_start: 3.51778
silence_end: 4.0881
silence_start: 6.40315
silence_end: 7.7922

Split

There are a few methods to split.

segment muxer

Example of splitting with the segment muxer:

ffmpeg -i input.mp3 -f segment -segment_times 1.20837,1.92546,3.51778,4.0881,6.40315,7.7922 -reset_timestamps 1 -map 0:a -c:a copy output_%03d.mp3

You will need to delete the silent segments. You could perform a Bash loop on the output files, use silencedetect to find these segments, then delete/move them if you want to script that process.
Note the use of -c:a copy which enables stream copy mode so your MP3 does not get re-encoded to avoid generation loss.

`-ss` and `-t` or `-to`

Using these options will omit the silent segments but is more work to make the commands:

ffmpeg -i input.mp3 -to 1.20837 -c copy output_01.mp3
ffmpeg -i input.mp3 -ss 1.92546 -to 3.51778 -c copy output_02.mp3

…and so on.

Or do it in one command:

ffmpeg -i input.mp3 -to 1.20837 -c copy output_01.mp3 -ss 1.92546 -to 3.51778 -c copy output_02.mp3

As in the segment muxer command this also uses stream copy.

I really like your answer, dude! One question, though: if you're going to use awk to print out the silent segments, why not pipe that right back into sed so you can rewrite this into the command you finish with? Then you would have a one-liner that works for everyone without any manual rework, and can even be saved as a script that does this task for anyone else who comes across this page. — Crates, Commented Aug 7, 2020 at 1:36
@Crates The awk example was a lazy afterthought. I almost didn't even include it. The main purpose of my answer was only to provide the step-by-step ffmpeg commands. It didn't seem like the question asker was looking for a turnkey script so I didn't spend the extra time making one. — llogan, Commented Aug 7, 2020 at 3:21
To get just hh:mm:ss timestamps of silence_end: ffmpeg -i input.mp3 -af silencedetect=d=0.1 -f null - |& awk '/silence_end/ {print $4,$5}' | awk '{S=$2;printf "%d:%02d:%02d\n",S/(60*60),S%(60*60)/60,S%60}' (this to get the album) — Pablo Bianchi, Commented Apr 3, 2021 at 3:48

Pablo Bianchi · Accepted Answer · 2021-04-03 03:18:30Z

You will likely not get a ready-to-go solution but will need to create a script yourself. pydub is made for this (it uses ffmpeg or libav internally) and it has a functions called split_on_silence() with some settings like keep_silence=100.

Some references from stackoverflow that have examples: 37725416 and 45526996 And from the creator of pydub.

From the 1st link the interesting part:

# Load your audio.
song = AudioSegment.from_mp3("your_audio.mp3")

chunks = split_on_silence (
   # Use the loaded audio. 
   song, 
   # Specify that a silent chunk must be at least 2 seconds or 2000 ms long.
   min_silence_len = 2000,
   # Consider a chunk silent if it's quieter than -16 dBFS.
   # (You may want to adjust this parameter.)
   silence_thresh = -16
)

Pablo Bianchi · Accepted Answer · 2021-04-03 03:28:53Z

15

mp3splt is a command for splitting mp3s and I believe mp3splt has a silence detector (among other methods of detection)... without re-encoding. To install:

sudo apt install mp3splt

Usage using Silence mode (-s):

mp3splt -s your.mp3

Mp3splt will try to automatically detect splitpoints with silence detection and will split all tracks found with default parameters.

Or

mp3splt -s -p th=-50,nt=10 your.mp3

passing desired parameters, splitting 10 tracks (or less if too much) with the most probable silence points at a threshold of -50 dB.

edited Apr 3, 2021 at 3:28

Pablo Bianchi

16.2k4 gold badges81 silver badges122 bronze badges

answered Aug 5, 2020 at 19:21

WU-TANG

3,1211 gold badge10 silver badges19 bronze badges

Please include an example usage of the command showing this silence detection.
– muru
Commented Aug 6, 2020 at 5:39

Add a comment |

muru · Accepted Answer · 2020-08-06 05:38:14Z

1

Get the timings by playing clip that you want to cut, I generally do long video clips that I want to trim or to cut out parts of the clip. It can be trimed by using video editing software, but these will typically also recompress the file on export. This reduces the quality of the video, and it also takes a long time. a solution is to perform zero loss cutting is there in ubuntu, in ubuntu there is a way to do this with FFmpeg :

example :

ffmpeg -i Myclip.mp4 -ss 00:15:10 -to 00:09:10 -c:v copy -c:a copy output.mp4

above will cut clip from about 0h10min10s to 0h09min0s, and this takes few seconds to complete.

edited Aug 6, 2020 at 5:38

muru

201k55 gold badges496 silver badges750 bronze badges

answered Aug 5, 2020 at 15:16

Vijay Joshi

213 bronze badges

It's a valid point about avoiding re-encoding, though the problem is actually often exaggerated and zero-conversion cutting has its own problem, namely that you can only split at block boundaries – which may result in the cuts being not quite in silence. Seeing as the input files are in the outdated MP3 format, I'd actually rather exact-cut them in plain PCM (what ffmpeg does by default, baring -c copy) and then re-compress in a more modern&efficient codec such as OPUS, which could give significantly smaller output files without making the quality notably worse than the original MP3 input.
– leftaroundabout
Commented Aug 6, 2020 at 9:55

Add a comment |

MDeBusk · Accepted Answer · 2020-08-11 16:43:11Z

0

I know you'd prefer a command-line tool, and I can't blame you. At the same time, if you're doing what I think you might be doing, Audacity is a better choice. It will let you split the file with greater precision and with far less trial-and-error.

(I've tried using ffmpeg to split such files and I will sometimes get a split where I don't want one.)

Here's the relevant part of the Audacity manual.

answered Aug 11, 2020 at 16:43

MDeBusk

1,2102 gold badges9 silver badges21 bronze badges

Add a comment |

Stack Exchange Network

How to split an mp3 file by detecting silent parts?

5 Answers 5

Get silent timestamps

Split

segment muxer

`-ss` and `-t` or `-to`

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged
ffmpeg
mp3
audio-recording
.

Linked

Hot Network Questions

How to split an mp3 file by detecting silent parts?

5 Answers 5

Get silent timestamps

Split

segment muxer

-ss and -t or -to

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged ffmpegmp3audio-recording.

Linked

Related

Hot Network Questions

`-ss` and `-t` or `-to`

Not the answer you're looking for? Browse other questions tagged
ffmpeg
mp3
audio-recording
.