Feature request: Allow coef and p-value extraction by variable name #491

eirikbrandsaas · 2022-08-02T20:31:29Z

Hi,

It would be great (and safer?) if one could extract coeffecients by variable names:

df = DataFrame(y=rand(3),x=rand(3))
out = reg(df,@formula(y~x))
out.coef[findfirst(isequal("x"),out.coefnames)] # hard to do
out.coef["x"] # would be great
out.coef[:x] # would be great

Or really any thing like that.

See e.g., https://discourse.julialang.org/t/how-to-obtain-the-pvalues-of-the-coefficients-in-glm-jl/9531/4

The text was updated successfully, but these errors were encountered:

ararslan · 2022-08-02T21:45:11Z

I guess in addition to directly delegating coef to the model object in https://github.com/JuliaStats/StatsModels.jl/blob/61de82aa23fb562697fe0f750f6f83ca7be79506/src/statsmodel.jl#L128 we could define e.g.

coef(model, term) = coef(model)[findfirst(==(term), coefnames(model))]
model = fit(Whatever, @formula(y ~ 1 + x), data)
coef(model, :x)  # coefficient for `x`

That could only sensibly support table-based models though, since those are the only ones for which you know the coefficient names. (e.g. this wouldn't work for models fit with an explicit design matrix rather than a formula)

xgdgsc · 2023-06-14T12:42:35Z

This should be added or at least documented.

andreasnoack · 2024-11-21T10:46:16Z

I think we should consider if we can do something here before releasing 2.0. The raw vectors without any context aren't that helpful

ararslan · 2024-11-21T18:00:12Z

The raw vectors without any context aren't that helpful

How do you mean? Like coef(model) returning a Vector? If it didn't, I'd be concerned about possible performance regressions for downstream linear algebra computations that use the result of coef.

I guess it could be convenient for coef(::TableRegressionModel{<:GeneralizedLinearModel}) to return e.g. a 1-dimensional AxisArray and coef(::GeneralizedLinearModel) to return a Vector?

andreasnoack · 2024-11-21T18:32:21Z

If it didn't, I'd be concerned about possible performance regressions for downstream linear algebra computations that use the result of coef.

Costless abstractions and all that. Hopefully we can have both. I just think it's really error prone not to have some kind of label associating an estimate in a vector with a parameter name or an effect name. That being said, I'm not sure what the right implementation would look like.

andreasnoack added this to the Release 2.0 milestone Nov 21, 2024

andreasnoack added enhancement breaking labels Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Allow coef and p-value extraction by variable name #491

Feature request: Allow coef and p-value extraction by variable name #491

eirikbrandsaas commented Aug 2, 2022

ararslan commented Aug 2, 2022

xgdgsc commented Jun 14, 2023

andreasnoack commented Nov 21, 2024

ararslan commented Nov 21, 2024

andreasnoack commented Nov 21, 2024

Feature request: Allow coef and p-value extraction by variable name #491

Feature request: Allow coef and p-value extraction by variable name #491

Comments

eirikbrandsaas commented Aug 2, 2022

ararslan commented Aug 2, 2022

xgdgsc commented Jun 14, 2023

andreasnoack commented Nov 21, 2024

ararslan commented Nov 21, 2024

andreasnoack commented Nov 21, 2024