Which audio encoders in FFmpeg support 8 kHz?

Question

I have an old video (made by a Casio Exilim EX-Z40, if it matters), whose audio stream ffprobe reports as pcm_u8, 8000 Hz, mono, u8.

I would like to transcode it into something modern.

Transcoding with FFmpeg defaults fails:

libfaac doesn't support this output format!

So presumably libfaac doesn't support 8 kHz, because -c:a copy works.

Which encoders support an 8 kHz sampling rate? The list found here barely mentions sampling rates at all.

Can I script something that tries every installed codec, from…

ffmpeg -codecs | grep EA`

…to see directly which ones work?

Do note that there's absolutely no reason you can't just pick a modern format with a higher sample rate, so that it will be playable by modern devices which expect a higher one. The compression will go a long ways from letting the file sizes get much larger than if you encoded at 8khz. It's not like a 32khz file will be four times larger, in other words. — trlkly, Commented Oct 15, 2019 at 8:03

llogan · Accepted Answer · 2019-10-14 19:19:40Z

10

The native FFmpeg AAC encoder (-c:a aac) supports 8000 Hz sample rate:

ffmpeg -h encoder=aac
...
Supported sample rates: 96000 88200 64000 48000 44100 32000 24000 22050 16000 12000 11025 8000 7350

It will automatically choose the sample rate the most closely matches the input, so you don't need to declare -ar:

ffmpeg -i input.mov -c:a aac output.m4a

Which audio encoders in FFmpeg support 8 kHz?

aac, aptx, aptx_hd, dca, flac, g723_1, libfdk_aac, libmp3lame, libopus, libspeex, libvorbis, real_144, wavpack, many pcm variants.

There are probably others, but reporting of supported_samplerates is inconsistent.

I would like to transcode it into something modern.

libfaac has been removed from FFmpeg for years and is not considered to be a modern AAC encoder. Your ffmpeg must be ancient. Update and use the native FFmpeg AAC encoder, or compile and use libfdk_aac.

If you want the most modern use libopus.

But when I tried [aac], compared to the original, the file size increased and some high frequencies were attenuated.

Since I suspect your ffmpeg is very old you are likely missing the major quality updates to the encoder aac. Upgrade and quality will likely improve.

edited Oct 14, 2019 at 19:19

answered Oct 14, 2019 at 17:58

llogan

60.6k17 gold badges130 silver badges152 bronze badges

1

Yes, that ffmpeg was years old. So I compiled afresh from today's git snapshot. Still, even at 8 kHz, ac3 and alac make files bigger than legacy pcm_u8; flac has problems playing in my 2 year old vlc; mp3 has horrible artifacts. But at least aac works and is slightly smaller, so I'll use that.
– Camille Goudeseune
Commented Oct 14, 2019 at 19:21
5

FFMpeg's AAC compressor supports 8KHz, but many players don't. So by creating an 8KHz AAC you risk it being unplayable on many devices.
– Eugen Rieck
Commented Oct 14, 2019 at 19:29
1

@CamilleGoudeseune Depending on where you need it to play, consider one of the high-efficiency codecs such as HE-AAC v1/v2. Or Opus as llogan suggested.
– Bob
Commented Oct 15, 2019 at 7:10
Opus would be a better suggestion, where the top-quality encoder is the free open-source one, and it's good at low bitrate, especially for speech. FFmpeg's native aac encoder is apparently ok, and sometimes beats libfdk_aac though. trac.ffmpeg.org/wiki/Encode/HighQualityAudio says that as of 2017, aac is good, Oh, but @CamilleGoudeseune is using an old FFmpeg, and the native aac encoder was much worse in older FFmpeg.
– Peter Cordes
Commented Oct 15, 2019 at 15:15
Added an answer recommending Opus.
– Peter Cordes
Commented Oct 15, 2019 at 16:14

Add a comment |

Eugen Rieck · Accepted Answer · 2019-10-14 17:26:49Z

6

Sampling rate and codec are different parameters. Most likely you want something along the lines of

-ar 48000 -c:a aac

To upsample from 8KHz to 48KHz and the compress to AAC

answered Oct 14, 2019 at 17:26

Eugen Rieck

20.3k5 gold badges53 silver badges48 bronze badges

Good idea. But when I tried it, compared to the original, the file size increased and some high frequencies were attenuated. So, worse than -c:a copy.
– Camille Goudeseune
Commented Oct 14, 2019 at 17:32
3

No modern compressor is optimized for 8KHz sample rate, so everything modern will have difficulties with 8KHz audio. Depending on the output container format you want, you might or might not have the choice between different codecs. Please append your OQ or comment, then I can help you find the best possible version.
– Eugen Rieck
Commented Oct 14, 2019 at 17:55
Yes, brute force testing of a bunch of modern compressors confirms that they suck at 8 kHz.
– Camille Goudeseune
Commented Oct 14, 2019 at 19:23

Add a comment |

Rup · Accepted Answer · 2019-10-15 13:09:18Z

8 KHz is fairly standard for speech, known as 'narrow band'. If this is speech then you should have plenty of options, although not that many are supported by FFmpeg out-of-the-box. Probably the best options are

AMR - you can compile libopencode-amrnb into FFmpeg for support
Opus, which will use the Vorbis CELT speech codec

However 8KHz 8-bit PCM isn't a very good source in the first place: most encoders will expect / hope for better input, e.g. 8-bit G.711 mu-law which is effectively 12-bit data encoded as 8-bit floating point. They may not do well with pure 8-bit PCM input as it might not fit speech patterns they're modelled for.

It's also a fairly small file already, and it's possible that your video container won't support more complicated codecs. So I think this is more trouble than it's worth, and I'd leave the audio as-is.

Peter Cordes · Accepted Answer · 2019-10-15 22:03:18Z

Opus is generally considered the best low-bitrate codec available, and doesn't have problems with an 8kHz input sample rate. The resulting opus stream can still be decoded to whatever sample rate is convenient for the decoder. (Like other lossy codecs, it compresses based on frequency bands after doing an FFT. But some other codecs apparently only want to decode to the same sample rate as the input. As other answers point out, you can get FFmpeg to resample the input before giving it to the codec, but you don't need that for Opus.

Try ffmpeg -c:a libopus -b:a 24k -frame_duration 120 for 24 kbit/s Opus.

Perhaps worth trying: -application voip to tune for "improved speech intelligibility" instead of the default audio profile.

Setting -frame_duration to the highest value reduces overhead, I think. You don't care about encoder / decoder latency because you just have files, not real-time 2-way voice chat. So you can let it buffer 120ms of audio and pack together multiple CELT or SILK frames to reduce redundancy of frame headers.

The best available Opus encoder is the free and open source libopus (https://opus-codec.org) so FFmpeg can just use it, unlike with AAC where the best encoders are closed-source.

Opus has special modes for very low bitrate speech (like 16kb/s), detecting speech and even switching over to a speech-specific encoder (SILK) at low bitrates.

Opus's low-bitrate coding tools are similar to what HE-AACv2 can do, see the wikipedia article.

But when I tried it, compared to the original, the file size increased ...

Part of the point of lossy compression is that you can choose the output bitrate, trading off against quality. Most codecs can use -b:a 32k for example to choose an audio bitrate of 32 kbit/s.

(For video, you can also trade off CPU time spent encoding, e.g. -preset veryslow vs. -preset medium. But compressing audio is cheap enough that most codecs don't have a lot of options for spending more CPU time to improve the bitrate vs. quality tradeoff.)

Mono 8-bit 8kHz PCM has a bitrate of 64 kbit/s = 8 * 8000 so you're aiming for lower than that, otherwise you might as well keep your original files. PCM is just raw samples so bitrate is just a product of sample rate and sample width. Like the audio equivalent of a .bmp bitmap image. That's highly inefficient, and the reason better codecs were invented. (And as you know from listening, saving bitrate for PCM comes at a massive cost to quality and frequency range because bitrate is tied 1:1 with sample rate. That's not the case when you quantize in the frequency domain with a lossy codec.)

and some high frequencies were attenuated. So, worse than -c:a copy

FFmpeg's native AAC encoder -c:a aac used to be pretty bad, and you were using an old FFmpeg. https://trac.ffmpeg.org/wiki/Encode/HighQualityAudio says that as of 2017, aac is sometimes better than libfdk_aac for AAC-LC (low-complexity high bitrate). It doesn't mention HE-AAC, though, and that's what you want for low bitrate AAC.

libfdk_aac used to be the best open-source AAC encoder available, and maybe still is for HE-AAC. AFAIK, neither of them are as good as the best non-free AAC encoders, though.

For low-bitrate AAC, you really want HE-AAC which adds more coding tools https://en.wikipedia.org/wiki/High-Efficiency_Advanced_Audio_Coding. I'm not sure if -c:a aac can do that.

https://trac.ffmpeg.org/wiki/Encode/HighQualityAudio lists some recommended settings and ranges of useful bitrates for various encoders.

But you probably want Opus, or possibly AMR-NB (narrowband) for bitrates like 4 kbit/s. I don't know how old the quality vs. bitrate plot on the Opus wiki article is, but it shows AMR-NB at higher quality than Opus down below 8kb/s.

With that few bits, you might be able to understand speech but it won't sound nice. It's just a question of which codec is least horrible.

Stack Exchange Network

Which audio encoders in FFmpeg support 8 kHz?

4 Answers 4

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged
ffmpeg
audio-conversion
.

Hot Network Questions

Which audio encoders in FFmpeg support 8 kHz?

4 Answers 4

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged ffmpegaudio-conversion.

Related

Hot Network Questions

Not the answer you're looking for? Browse other questions tagged
ffmpeg
audio-conversion
.