-
Notifications
You must be signed in to change notification settings - Fork 9
Experiments
Marco Fossati edited this page May 9, 2019
·
4 revisions
Applies to all experiments:
- stratified 5-fold cross validation over training/test splits;
- mean performance scores over the folds.
https://github.com/Wikidata/soweego/issues/285
- run: May 3 2019;
- output folder:
soweego-2.eqiad.wmflabs:/srv/dev/20190503/
; - head commit: d0d390e622f2782a49a1bd0ebfc64478ed34aa0c;
- command:
python -m soweego linker evaluate slp ${Dataset} ${Entity} optimizer=${Optimizer}
.
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .782 | .945 | .856 |
rmsprop | .801 | .930 | .860 |
nadam | .805 | .925 | .861 |
adamax | .795 | .938 | .861 |
adam | .800 | .929 | .860 |
adagrad | .802 | .927 | .859 |
adadelta | .799 | .934 | .861 |
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .815 | .985 | .892 |
rmsprop | .816 | .985 | .893 |
nadam | .816 | .986 | .893 |
adamax | .817 | .985 | .893 |
adam | .816 | .985 | .893 |
adagrad | .816 | .986 | .893 |
adadelta | .815 | .986 | .892 |
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .918 | .954 | .936 |
rmsprop | .895 | .954 | .923 |
nadam | .908 | .954 | .930 |
adamax | .907 | .955 | .930 |
adam | .909 | .953 | .931 |
adagrad | .867 | .950 | .907 |
adadelta | .902 | .954 | .927 |
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .912 | .927 | .920 |
rmsprop | .913 | .929 | .921 |
nadam | .913 | .929 | .921 |
adamax | .913 | .928 | .921 |
adam | .913 | .928 | .921 |
adagrad | .873 | .860 | .866 |
adadelta | .913 | .928 | .921 |
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .917 | .942 | .929 |
rmsprop | .916 | .938 | .927 |
nadam | .916 | .938 | .927 |
adamax | .916 | .940 | .928 |
adam | .916 | .938 | .927 |
adagrad | .852 | .684 | .756 |
adadelta | .916 | .939 | .928 |
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .929 | .943 | .936 |
rmsprop | .927 | .940 | .934 |
nadam | .930 | .940 | .935 |
adamax | .930 | .941 | .935 |
adam | .930 | .940 | .935 |
adagrad | .872 | .923 | .896 |
adadelta | .931 | .941 | .936 |
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .952 | .869 | .909 |
rmsprop | .949 | .875 | .911 |
nadam | .949 | .877 | .911 |
adamax | .952 | .871 | .910 |
adam | .951 | .875 | .911 |
adagrad | .932 | .886 | .909 |
adadelta | .952 | .874 | .911 |
Optimizer | Precision | Recall | F-score |
---|---|---|---|
sgd | .942 | .957 | .949 |
rmsprop | .941 | .958 | .949 |
nadam | .941 | .958 | .949 |
adamax | .941 | .958 | .949 |
adam | .941 | .958 | .949 |
adagrad | .946 | .953 | .950 |
adadelta | .941 | .958 | .950 |
- All optimizers seem to do a similar job;
- no specific impact on the performance.
https://github.com/Wikidata/soweego/issues/176
- run: May 7 2019;
- output folder:
soweego-2.eqiad.wmflabs:/srv/dev/20190507/
; - head commit: ddd5d719793ea217267413a52d1d2e5b90c341a7;
- command:
python -m soweego linker evaluate ${Algorithm} ${Dataset} ${Entity}
.
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .787 | .955 | .863 |
nb avg | .789 | .941 | .859 |
lsvm max | .780 | .960 | .861 |
lsvm avg | .785 | .946 | .858 |
svm max | .777 | .963 | .860 |
svm avg | .777 | .963 | .860 |
slp max | .784 | .954 | .861 |
slp avg | .776 | .956 | .857 |
mlp max | .822 | .925 | .870 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .831 | .975 | .897 |
nb avg | .836 | .958 | .893 |
lsvm max | .818 | .985 | .894 |
lsvm avg | .814 | .986 | .892 |
svm max | .815 | .985 | .892 |
svm avg | .815 | .985 | .892 |
slp max | .821 | .983 | .895 |
slp avg | .815 | .985 | .892 |
mlp max | .852 | .963 | .904 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .896 | .971 | .932 |
nb avg | .897 | .971 | .932 |
lsvm max | .919 | .943 | .931 |
lsvm avg | .919 | .942 | .930 |
svm max | .911 | .950 | .930 |
svm avg | .908 | .958 | .932 |
slp max | .917 | .953 | .935 |
slp avg | .867 | .953 | .908 |
mlp max | .913 | .964 | .938 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .889 | .962 | .924 |
nb avg | .891 | .960 | .924 |
lsvm max | .917 | .938 | .927 |
lsvm avg | .917 | .937 | .927 |
svm max | .904 | .944 | .924 |
svm avg | .908 | .942 | .924 |
slp max | .924 | .929 | .926 |
slp avg | .922 | .914 | .918 |
mlp max | .912 | .951 | .931 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .870 | .971 | .918 |
nb avg | .871 | .970 | .918 |
lsvm max | .920 | .940 | .930 |
lsvm avg | .920 | .938 | .929 |
svm max | .923 | .927 | .925 |
svm avg | .923 | .926 | .925 |
slp max | .914 | .940 | .927 |
slp avg | .862 | .914 | .883 |
mlp max | .911 | .956 | .933 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .904 | .975 | .938 |
nb avg | .910 | .961 | .935 |
lsvm max | .936 | .949 | .943 |
lsvm avg | .936 | .948 | .942 |
svm max | .932 | .954 | .943 |
svm avg | .932 | .954 | .943 |
slp max | .938 | .946 | .942 |
slp avg | .903 | .955 | .928 |
mlp max | .930 | .963 | .946 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .821 | .987 | .896 |
nb avg | .822 | .985 | .896 |
lsvm max | .944 | .879 | .910 |
lsvm avg | .943 | .888 | .914 |
svm max | .930 | .891 | .910 |
svm avg | .939 | .893 | .915 |
slp max | .953 | .865 | .907 |
slp avg | .930 | .885 | .907 |
mlp max | .906 | .918 | .911 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb max | .955 | .936 | .946 |
nb avg | .955 | .936 | .946 |
lsvm max | .941 | .963 | .952 |
lsvm avg | .941 | .962 | .952 |
svm max | .951 | .938 | .944 |
svm avg | .950 | .938 | .944 |
slp max | .942 | .957 | .949 |
slp avg | .943 | .956 | .949 |
mlp max | .939 | .970 | .954 |
Max Levenshtein has the following impact:
- NB is always improved or left untouched;
- LSVM is always improved, left untouched for IMDb director, but worsens for MusicBrainz band;
- SVM is often left untouched, but worsens for IMDb director and MusicBrainz band;
- SLP is always improved with the highest impact, left untouched for MusicBrainz;
- conclusion: max Levenshtein should replace the average one.
https://github.com/Wikidata/soweego/issues/174
- run: May 8 2019;
- output folder:
soweego-2.eqiad.wmflabs:/srv/dev/20190508/
; - head commit: 0c5137fc4fe446abdb6df6dbde277b7aa15881c5;
- command:
python -m soweego linker evaluate ${Algorithm} ${Dataset} ${Entity}
.
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .788 | .942 | .859 |
nb | .789 | .941 | .859 |
lsvm +sk | .785 | .946 | .858 |
lsvm | .785 | .946 | .858 |
svm +sk | .778 | .963 | .861 |
svm | .777 | .963 | .860 |
slp +sk | .783 | .947 | .857 |
slp | .776 | .956 | .857 |
mlp +sk | .848 | .913 | .879 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .836 | .958 | .893 |
nb | .836 | .958 | .893 |
lsvm +sk | .816 | .985 | .892 |
lsvm | .814 | .986 | .892 |
svm +sk | .815 | .985 | .892 |
svm | .815 | .985 | .892 |
slp +sk | .820 | .978 | .892 |
slp | .815 | .985 | .892 |
mlp +sk | .868 | .948 | .906 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .897 | .971 | .932 |
nb | .897 | .971 | .932 |
lsvm +sk | .923 | .949 | .935 |
lsvm | .919 | .942 | .930 |
svm +sk | .914 | .950 | .931 |
svm | .908 | .958 | .932 |
slp +sk | .918 | .955 | .936 |
slp | .867 | .953 | .908 |
mlp +sk | .918 | .964 | .941 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .891 | .961 | .924 |
nb | .891 | .960 | .924 |
lsvm +sk | .922 | .941 | .931 |
lsvm | .917 | .937 | .927 |
svm +sk | .910 | .949 | .929 |
svm | .908 | .942 | .924 |
slp +sk | .922 | .934 | .928 |
slp | .922 | .914 | .918 |
mlp +sk | .914 | .958 | .935 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .871 | .970 | .918 |
nb | .871 | .970 | .918 |
lsvm +sk | .921 | .943 | .932 |
lsvm | .920 | .938 | .929 |
svm +sk | .923 | .927 | .925 |
svm | .923 | .926 | .925 |
slp +sk | .916 | .942 | .929 |
slp | .862 | .914 | .883 |
mlp +sk | .912 | .959 | .935 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .910 | .961 | .935 |
nb | .910 | .961 | .935 |
lsvm +sk | .938 | .953 | .945 |
lsvm | .936 | .948 | .942 |
svm +sk | .933 | .957 | .945 |
svm | .932 | .954 | .943 |
slp +sk | .939 | .948 | .943 |
slp | .903 | .955 | .928 |
mlp +sk | .931 | .968 | .949 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .821 | .985 | .896 |
nb | .822 | .985 | .896 |
lsvm +sk | .940 | .895 | .917 |
lsvm | .943 | .888 | .914 |
svm +sk | .937 | .899 | .918 |
svm | .939 | .893 | .915 |
slp +sk | .952 | .873 | .911 |
slp | .930 | .885 | .907 |
mlp +sk | .937 | .904 | .920 |
Algorithm | Precision | Recall | F-score |
---|---|---|---|
nb +sk | .955 | .936 | .946 |
nb | .955 | .936 | .946 |
lsvm +sk | .938 | .965 | .951 |
lsvm | .941 | .962 | .952 |
svm +sk | .951 | .938 | .944 |
svm | .950 | .938 | .944 |
slp +sk | .941 | .958 | .950 |
slp | .943 | .956 | .949 |
mlp +sk | .939 | .972 | .955 |
The string kernel feature:
- has the most positive impact on SLP;
- slightly improves performance in most cases, but sligthly worsens:
- precision in 1 case, i.e., NB for MusicBrainz band;
- recall in 3 cases, i.e., SLP for Discogs band, LSVM & SLP for Discogs musician;
- f-score in 2 cases, i.e., SVM for IMDb director, LSVM for MusicBrainz musician.
- conclusion: the string kernel feature should be added.