Feat/bayesian ridge #247

JoaquinIglesiasTurina · 2024-04-03T21:54:54Z

While I still have to work do, namely:

documentation
handling the more features than samples case
fix internal variable naming of coefficients and whatnot.

I have non passing tests for each of the missing features.
However, the lion's share of the Bayesian Ridge is done and, I wanted to open this PR so code review can get started, and to ask a couple of questions.

Scores

Given that the computation is done inside a defn, I am not sure it is possible to do have it be optional, as is the case in sklearn.

I am afraid the testing code is not very idiomatic.
Also, I've had to make a precision compromise, as I could not get the exact same score with both methods, but the scores are within 5%.
Please compare with the scikit-learn test.

Diabetes data

I've brought in the diabetes data to replicate scikit-learn tests.
Is this data open source?
Can I just include it?
Is having it as a .csv where it is a good idea?

Matrix inversion times out

I've had some issues with diabetes data matrix inversion. Mainly, it times out the tests.
I've tried running a linear regression with the diabetes data, and that also times out, so I'm pretty sure the problem is not only with my implementation of this algorithm.
From what I've gathered this problem is related to this issue. Is that understanding correct?

To avoid the timeout I've limited the data, and checked that I get similar results than with numpy and scikit-learn. However, that is no automated test.

Module name

I've named the module BayesianRidgeRegression, maybe it should just be called BayesianRidge.

Closes #244.

Co-authored-by: Mateusz Sluszniak <56299341+msluszniak@users.noreply.github.com>

sample_weights. consistent with regression

Co-authored-by: Mateusz Sluszniak <56299341+msluszniak@users.noreply.github.com>

JoaquinIglesiasTurina · 2024-04-05T19:04:34Z

Thank you all for the code review. I think I've addressed all the points brought up.

Right now, I'm missing documentation. Once that is done, I will push again, to avoid triggering the CI pipeline.

krstopro · 2024-04-06T10:18:10Z

I think sample_weights_flag can be removed by setting default sample_weights to tensor of ones, i.e.
sample_weights = Nx.broadcast(Nx.as_type(1.0, x_type), {num_samples})
(this holds for few other modules as well)
I am still not sure what is the default way to pass weights as an option. In several modules it is assumed that they are a list of num_samples elements. Yet in Scholar.Options they are allowed to be Nx.Tensor as well, see https://github.com/elixir-nx/scholar/blob/main/lib/scholar/options.ex#L78.
@msluszniak @josevalim Any comments on this one?
I think it's a good idea to try to unify this across all modules. I think we might create a separate issue and point to your comments. Do you want me to create this issue?
I would move this in a separate issue. Same for input validation, e.g. checking if x is of rank 2, if y is of rank 1 or 2, if first dimension of x and y match, etc.

@msluszniak By "I would move this in a separate issue" I meant "you are welcome to open a separate issue for this". Sorry, I now realise I wasn't clear enough. 😅

JoaquinIglesiasTurina · 2024-04-20T09:07:49Z

From my end, I don´t think there is anything else left to do. Please let me know if you think otherwise.

krstopro · 2024-04-20T10:01:47Z

From my end, I don´t think there is anything else left to do. Please let me know if you think otherwise.

@JoaquinIglesiasTurina Sure, will have a look later today.

krstopro

Few minor comments.

lib/scholar/linear/bayesian_ridge_regression.ex

krstopro · 2024-04-20T20:14:33Z

lib/scholar/linear/bayesian_ridge_regression.ex

+        {scalar * x, scalar * y}
+
+      _ ->
+        scale = sample_weights |> Nx.sqrt() |> Nx.make_diagonal()


This can be simplified by doing

Suggested change

scale = sample_weights |> Nx.sqrt() |> Nx.make_diagonal()

scale = sample_weights |> Nx.sqrt() |> Nx.new_axis(1)

and using * instead of Nx.dot.

I do not believe this change is possible.
Please note that a is a {n_samples} sized vector, b is an {n_samples, n_features} matrix and sample_weights is an {n_samples} sized vector.

sample_weights |> Nx.sqrt() |> Nx.new_axis(1) yields an {n_samples, 1} sized matrix, that cannot be multiplied with a.

As far as I can tell, sample_weights |> Nx.sqrt() |> Nx.make_diagonal() yields the only matrix that is dottable with both a and b.

An alternative would be keeping 2 scale tensors, one for a and one for b. I personally do not like this option as it would unbalance the function. You would have a different tensor for each piece of data, and the operations would look different than the other branch of the case statement.

Please, let me know if you think of a better way.

You are correct, but we are able to use * instead of Nx.dot. This should work:

defnp rescale(x, y, sample_weights) do factor = Nx.sqrt(sample_weights) x_scaled = case Nx.shape(factor) do {} -> factor * x _ -> Nx.new_axis(factor, 1) * x end y_scaled = factor * y {x_scaled, y_scaled} end

This works and I find it's a pretty clean solution. Thank you for your comments.

Co-authored-by: Krsto Proroković <krstopro@yahoo.com>

josevalim · 2024-04-21T15:19:43Z

💚 💙 💜 💛 ❤️

JoaquinIglesiasTurina and others added 30 commits March 21, 2024 22:37

mean pinball loss

ae425f1

add links to origin of cases and formulas

ad92978

Update lib/scholar/metrics/regression.ex

8cbcabb

Co-authored-by: Mateusz Sluszniak <56299341+msluszniak@users.noreply.github.com>

Update lib/scholar/metrics/regression.ex

67b716d

Co-authored-by: Mateusz Sluszniak <56299341+msluszniak@users.noreply.github.com>

fix tests to use opts in optional arguments

864a0dd

add default alpha 0.5, same as sklearn

3be9884

added sample_weights option to mean_pinball_loss

8572534

added multioutput support

30952da

fixed multioutput behavior to be on par with sklearn

0ca6725

added comments for better multioutput understanding

cb2a2fa

add option to allow for 2 dimensional sample weights

0ab4741

add nimble options and rename sample_weight to

b7f4a9d

sample_weights. consistent with regression

fix tests

ae8b3b0

fixed sample_weights: as tensor behaviour

afe88ee

fixed call to NimbleOptions to be consistent

421a8ef

fixed multi_weights option and docs

9be10e4

Update lib/scholar/metrics/regression.ex

02b285d

Co-authored-by: Mateusz Sluszniak <56299341+msluszniak@users.noreply.github.com>

use assert_all_close on multi output pinball loss tests

373eda0

run formatter

3c508a4

working on bayesian ridge

2cf1c98

bayesian ridge algorithm works for simplest case

895dd97

run formatter

1989c3f

refactor recursion to use while loop

f5ba5b6

cleanup code and add convergence message

115c447

add test case

1a3070c

simple case works

c0ebe68

reshape for multi feat

5533ab5

expanded test passing

df1be29

better test name

afa417d

use new_axis where I should

a761eab

JoaquinIglesiasTurina added 5 commits April 5, 2024 19:04

dot product without transpose

de7b972

minor fixes, and cleanup

7762408

fix test types

5ae61c4

fixed optional scores and n_features > n_samples

7ce464b

reduce diabetes data sample

0fe5c1a

JoaquinIglesiasTurina added 2 commits April 5, 2024 21:06

test if jit compilable

e4ed46a

formatter run

95d673f

This was referenced Apr 6, 2024

Unification of handling weights passed as arguments to functions #249

Closed

Unification of shape checks #250

Open

JoaquinIglesiasTurina added 2 commits April 14, 2024 10:26

remove multi_weights duplication

5823c97

wrote docs

58f36ce

krstopro reviewed Apr 20, 2024

View reviewed changes

JoaquinIglesiasTurina and others added 9 commits April 21, 2024 10:44

Update lib/scholar/linear/bayesian_ridge_regression.ex

98c51cc

Co-authored-by: Krsto Proroković <krstopro@yahoo.com>

Update lib/scholar/linear/bayesian_ridge_regression.ex

52b2084

Co-authored-by: Krsto Proroković <krstopro@yahoo.com>

Update lib/scholar/linear/bayesian_ridge_regression.ex

4570fc5

Co-authored-by: Krsto Proroković <krstopro@yahoo.com>

fix references

06eea91

undo scale suggestion

ccc2358

remove jit compilable test case

8253e0b

formatter

4670db6

refactored rescale fuction

f0e87a7

formatter

db92d39

krstopro approved these changes Apr 21, 2024

View reviewed changes

josevalim merged commit f64e65a into elixir-nx:main Apr 21, 2024
2 checks passed

JoaquinIglesiasTurina deleted the feat/bayesian-ridge branch April 21, 2024 15:29

JoaquinIglesiasTurina mentioned this pull request May 15, 2024

Unify weight handling and refactor linear models' helper functions #267

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/bayesian ridge #247

Feat/bayesian ridge #247

JoaquinIglesiasTurina commented Apr 3, 2024 •

edited by krstopro

Loading

JoaquinIglesiasTurina commented Apr 5, 2024

krstopro commented Apr 6, 2024

JoaquinIglesiasTurina commented Apr 20, 2024

krstopro commented Apr 20, 2024

krstopro left a comment

krstopro Apr 20, 2024

JoaquinIglesiasTurina Apr 21, 2024

krstopro Apr 21, 2024 •

edited

Loading

JoaquinIglesiasTurina Apr 21, 2024

josevalim commented Apr 21, 2024

	scale = sample_weights \|> Nx.sqrt() \|> Nx.make_diagonal()
	scale = sample_weights \|> Nx.sqrt() \|> Nx.new_axis(1)

Feat/bayesian ridge #247

Feat/bayesian ridge #247

Conversation

JoaquinIglesiasTurina commented Apr 3, 2024 • edited by krstopro Loading

Scores

Diabetes data

Matrix inversion times out

Module name

JoaquinIglesiasTurina commented Apr 5, 2024

krstopro commented Apr 6, 2024

JoaquinIglesiasTurina commented Apr 20, 2024

krstopro commented Apr 20, 2024

krstopro left a comment

Choose a reason for hiding this comment

krstopro Apr 20, 2024

Choose a reason for hiding this comment

JoaquinIglesiasTurina Apr 21, 2024

Choose a reason for hiding this comment

krstopro Apr 21, 2024 • edited Loading

Choose a reason for hiding this comment

JoaquinIglesiasTurina Apr 21, 2024

Choose a reason for hiding this comment

josevalim commented Apr 21, 2024

JoaquinIglesiasTurina commented Apr 3, 2024 •

edited by krstopro

Loading

krstopro Apr 21, 2024 •

edited

Loading