Build Essentia in release #295

kmod-midori · 2022-12-31T09:58:44Z

Related to #221.

This builds a working essentia_streaming_extractor_music binary in /usr/bin that can be used to extract music features. It is not a static build and uses the ffmpeg library that are built in the same process.

Along with other dependencies, this change makes the final image much larger at 50 MiB on my local system.

libsamplerate is built manually since Alpine does not package it, but it is present in Debian and Arch. Apart from this, you should be able to copy my changes to other Dockerfile and call it a day (this will make CI slower, so proceed with caution).

Essentia can become problematic when Debian moves to ffmpeg 5.x (the current stable is fine, but testing and sid are already on 5.x).

BTW, why aren't you doing make -j$(nproc) when building these libraries?

epoupon · 2022-12-31T12:54:14Z

Thanks for building essentia! That's something I definitely wanted to try, specially to get an idea of how big it is.
50MiB is really huge, I really want to keep LMS lighweight, and the ffmpeg5 issue is really annoying too.
I am reading up on the recommendation/similarity topics, and since I want to get rid of ffmpeg binary calls and directly decode/encode using ffmpeg's libs, it may be not that hard to analyse the songs and extract ourselves the few low level features we want.

epoupon · 2022-12-31T12:57:25Z

BTW, why aren't you doing make -j$(nproc) when building these libraries?

Yes indeed we can restore this
Edit: ah actually I just remember
Dockerfile-build- files do use make -j$(nproc) because they are run via github actions. But Dockerfile-release is run locally using dockerx, and there are enough archs to fill in all the available cores)

kmod-midori · 2022-12-31T16:48:23Z

I have removed some large algorithms that is not really used by the extractor that we are using.

Found the problem: building the image before applying the changes in this PR produces a 34.8 MB image on my machine, much larger than what you have on Docker Hub.

Currently the image on my machine after these changes is 44.2 MB, so a 10 MB gain for this feature.

kmod-midori · 2022-12-31T16:54:13Z

it may be not that hard to analyse the songs and extract ourselves the few low level features we want.

If memory serves me right, we are using almost all the low-level features?

epoupon · 2022-12-31T18:44:25Z

it may be not that hard to analyse the songs and extract ourselves the few low level features we want.

If memory serves me right, we are using almost all the low-level features?

Not exactly, we get them all from AB, but only some of them are useful for clustering based on similarities. I tried a genetic algo to select the best features but I really lacked some time to get a proper training set and to optimize the SOM training part. And this is still in my todo list.
The current results I got is here:

lms/src/libs/services/recommendation/impl/features/FeaturesEngine.cpp

Line 51 in 7746ed7

{ "lowlevel.spectral_energyband_high.mean", {1}},

(Only five low level entries used, but we should have more to keep experiment on these. I found a thesis with valuable info on the useful features, will try to get a link)

epoupon · 2022-12-31T18:46:05Z

And btw images are compressed on docker hub. If you fetch the image you can see locally its real size

kmod-midori · 2023-01-01T14:12:24Z

If we can decide which features to use, it is possible to create a minimized version of essentia that only contains required algorithms. Some of the algorithms are really large (looking at object sizes).

However, I do not have a large library available, so I can not really help.

Danoloan10 · 2023-01-06T16:32:04Z

Benchmarking by running the extractor for a FLAC file, these were the algos that were used:

MusicExtractor,MetadataReader,AudioLoader,StereoDemuxer,StereoMuxer,Resample,StereoTrimmer,LoudnessEBUR128,FrameCutter,NoiseAdder,LoudnessEBUR128Filter,IIR,UnaryOperatorStream,BinaryOperatorStream,Mean,EqloudLoader,MonoLoader,MonoMixer,Trimmer,Scale,EqualLoudness,ReplayGain,InstantPower,EasyLoader,Windowing,Spectrum,FFT,Magnitude,SilenceRate,ZeroCrossingRate,MFCC,MelBands,TriangularBands,DCT,CentralMoments,DistributionShape,FlatnessDB,Flatness,GeometricMean,Crest,GFCC,ERBBands,BarkBands,FrequencyBands,UnaryOperator,Decrease,RollOff,Energy,RMS,EnergyBand,HFC,Flux,StrongPeak,SpectralComplexity,SpectralPeaks,PeakDetection,PitchSalience,AutoCorrelation,Centroid,Dissonance,Entropy,SpectralContrast,Loudness,DynamicComplexity,RhythmExtractor2013,BeatTrackerMultiFeature,CartesianToPolar,OnsetDetection,TempoTapDegara,MovingAverage,OnsetDetectionGlobal,TempoTapMaxAgreement,BeatTrackerDegara,BpmHistogramDescriptors,OnsetRate,Onsets,Danceability,TuningFrequency,BeatsLoudness,Slicer,SingleBeatLoudness,EnergyBandRatio,HPCP,Key,ChordsDetection,ChordsDescriptors,HighResolutionFeatures,PoolAggregator,SingleGaussian,YamlOutput

Not sure whether different formats make use of different algos.

The essentia library with these is 4.5MiB in x86_64. I'm guessing that's the bare minimum to use the MusicExtractor.

Danoloan10 · 2023-01-06T16:35:38Z

It worked for MP3, WAV and Opus files too. Ogg seems to be unsupported.

kmod-midori · 2023-01-14T12:58:24Z

Do we have that thesis available? I wonder if simple kNN would just work as well...

Ogg seems to be unsupported.

Essentia directly uses libav for decoding, so ogg should be supported as long as libav/ffmpeg has support for that. No idea why that does not work in your environment though.

Danoloan10 · 2023-01-14T13:36:53Z

The extractor complains about the algo that it can't find so I just made a simple bash loop that ran the extractor on a FLAC file and then captured the missing algo and recompiled essentia adding it to the list. Essentia is an impressively good piece of code as each recompilation only builds two new files, the new algo and the algo index, so the loop doesn't take that long to finish.

I can look for the loop and share the FLAC file if you'd like

Build essentia in release

1285898

kmod-midori added 2 commits December 31, 2022 23:20

Remove fftw

e6f3647

Remove more algorithms

4f62669

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build Essentia in release #295

Build Essentia in release #295

kmod-midori commented Dec 31, 2022

epoupon commented Dec 31, 2022

epoupon commented Dec 31, 2022 •

edited

Loading

kmod-midori commented Dec 31, 2022

kmod-midori commented Dec 31, 2022

epoupon commented Dec 31, 2022

epoupon commented Dec 31, 2022

kmod-midori commented Jan 1, 2023

Danoloan10 commented Jan 6, 2023 •

edited

Loading

Danoloan10 commented Jan 6, 2023

kmod-midori commented Jan 14, 2023 •

edited

Loading

Danoloan10 commented Jan 14, 2023 •

edited

Loading

Build Essentia in release #295

Are you sure you want to change the base?

Build Essentia in release #295

Conversation

kmod-midori commented Dec 31, 2022

epoupon commented Dec 31, 2022

epoupon commented Dec 31, 2022 • edited Loading

kmod-midori commented Dec 31, 2022

kmod-midori commented Dec 31, 2022

epoupon commented Dec 31, 2022

epoupon commented Dec 31, 2022

kmod-midori commented Jan 1, 2023

Danoloan10 commented Jan 6, 2023 • edited Loading

Danoloan10 commented Jan 6, 2023

kmod-midori commented Jan 14, 2023 • edited Loading

Danoloan10 commented Jan 14, 2023 • edited Loading

epoupon commented Dec 31, 2022 •

edited

Loading

Danoloan10 commented Jan 6, 2023 •

edited

Loading

kmod-midori commented Jan 14, 2023 •

edited

Loading

Danoloan10 commented Jan 14, 2023 •

edited

Loading