I was wondering if setting up realtime priority would help whisper.cpp to run faster, but not only it doesn't run any faster, it runs substantially slower.
I'm running it using sudo chrt -r 99 ...
from an ssh connection.
This process runs in 4 threads (the CPU has 6 cores).
Why would a process (ML inference) run slower in realtime?