Generate message types #14

robinkrahl · 2022-08-31T10:42:25Z

This is a first draft implementation of #13 with some open points as indicated by the TODO comments. Still, using the parse example, I was able to parse some real-world test files as well as the files from the tests/fixtures directory.

stadelmanma · 2022-10-28T03:16:47Z

@robinkrahl any news on this? Apologies for going AWOL for awhile.

robinkrahl · 2022-10-29T17:37:21Z

I think we should first talk about whether this is generally the right approach. Then we can discuss the open points mentioned in the TODO comments.

stadelmanma · 2022-10-29T18:39:54Z

I do wonder if we’d be ahead to move the “FitDataRecord” concept into a trait that each of these message structs would implement and return structs of the actual message types directly instead. From a parsing standpoint usage would probably be the same (returns Vec<T: FitDataRecord> or something) which provides the same access/serialization methods as before. It feels a little more intuitive to return the specific thing and be able to use it in a generic fashion rather than return the generic thing and convert it into the more specific thing.

stadelmanma · 2022-10-29T18:47:15Z

I’ve worked off and on trying to implement a different decoding process that’s more efficient where instead of creating those MessageInfo structs at runtime each message type has a decode function building the decoded data record directly. It’s about twice as fast since building a MessageInfo struct on repeat is rather expensive. I’ve not managed to figure it out successfully yet with how some fields get split up and transformed into other things but that would work nicely to build a message type specific struct. See the ‘function-based-decoding’ branch, I just started on an updated version of it this morning.

robinkrahl · 2022-10-29T18:52:28Z

return structs of the actual message types directly instead

Typically, a FIT file will contain many different messages so I don’t see how we could return specific types. The Message enum could be used to remove the parsing step though.

I’ve worked off and on trying to implement a different decoding process that’s more efficient

Sounds good! I also experimented with moving much of that information into constants so that more work can be done at compile time. But I did not want to introduce too many changes at the same time. I’ll have a look at your branch next week.

robinkrahl · 2022-10-29T18:54:11Z

Can you please link the branch you’re working on? I don’t see it in the branch list.

stadelmanma · 2022-10-29T21:36:34Z

Sorry about that, I didn't realize that it wasn't pushed yet. https://github.com/stadelmanma/fitparse-rs/blob/function-based-decoding-old/src/profile/decode.rs

And by returning specific types I was meaning our message structs would look like this

pub struct Activity {
    pub total_timer_time: Option<f64>,
    pub num_sessions: Option<u16>,
    pub r#type: Option<fitparser::profile::field_type::Activity>,
    // ....
}

impl FitDataRecord for Activity {
    // ...
}

And then the parsing methods would return a trait type, at least I think that's possible. My Rust is a little ... rusty.

from_reader(...) -> Result<Vec<T: FitDataRecord>>

robinkrahl · 2022-10-30T07:08:08Z

from_reader(...) -> Result<Vec<T: FitDataRecord>>

But this would mean that we can only parse messages of a single type. If we want to support all messages, we either have to use a Box which is not really ergonomic, or use a wrapper enum like Message.

stadelmanma · 2022-10-30T15:05:10Z

Ah yep your right, I’ve been spending too much time in Java-land where you can do that kind of thing with interfaces.

Looking back at the issue the example struct for each message looks good, but I think I would prefer to add getters instead of using public members, and maybe setters as well. I’m 50/50 on setters since this library doesn’t support writing data but there certainly could be some use cases and I don’t want to put unnecessary limitations on folks.

I’m not a huge fan of the TryFrom logic for the conversion to the rich type. Since we already know what message type we have at decode time it would be nice to start with the rich type instead of trying to find our way back there. The enum wrapper route you mentioned might fit nicely, especially since we already use a Message enum to kick off the decoding process anyways.

It’d be nice to not have significant breaking API changes to the high level “from_X” parsing functions but they could convert the rich type into the generic one. The conversion from rich to generic could be done with a “From” implementation I think?

What kind of use cases do you envision being supported by explicit types? That might be the best way to inform our implementation route.

robinkrahl · 2022-10-30T16:33:45Z

I think I would prefer to add getters instead of using public members, and maybe setters as well.

I prefer public members because they are easier to use and they lead to cleaner code. I don’t see any reason to use getters and setters because there are no invariants to be enforced.

I’m not a huge fan of the TryFrom logic for the conversion to the rich type.

I only added that step because I did not want to break compatibility and I was under the impression that you preferred to use the raw type. I agree that returning the enum instead is much more ergonomic.

What kind of use cases do you envision being supported by explicit types?

Generally speaking, my goal is to make it easier to use the library and to discover the data structures from the API docs. My specific use case is this conversion code that already uses this PR. Ideally, I could also drop the unit conversions if fitparser takes care of them (as discussed in #13), but let’s do one thing at a time.

stadelmanma · 2022-11-02T03:56:48Z

I prefer public members because they are easier to use and they lead to cleaner code. I don’t see any reason to use getters and setters because there are no invariants to be enforced.

@robinkrahl that's fair, I can be persuaded to avoid the getters, again too much time spent in java-land nowadays. Components and subfields in a FitMessage do represent an invariant case between the parent and child fields but I don't think it's worth the hassle to bake that logic in.

I like the idea of adding uom as a feature if we can do it neatly. I think it would make some parts of my run-tracking code that uses this library cleaner as well in that department. So if you'd like to throw a separate issue up for that enhancement that'd be great.

I pushed some changes to the profile generation code to split it up from being a giant monolithic main script into a few files each with a specific purpose. I'm not entirely sure yet if it makes the profile generation library easier to navigate or not but at least I don't have to scroll as much. I think long term if makes more sense to call the file your code generates messages.rs and in my function based decoding branch I was starting to call what is now messages.rs decode.rs instead since that's it's purpose.

beyoung · 2023-06-29T09:44:00Z

Thanks for your great work on this feature 😊 . And any progress on this feature now ?

stadelmanma · 2023-07-07T13:26:09Z

@beyoung I don't have a use case for this feature myself so I haven't attempted to finish it up, and the underlying profile generation code has changed dramatically since this PR was initially drafted.

If you have a concrete need for it you are welcome to pick it up!

beyoung · 2023-08-30T03:05:52Z

@beyoung I don't have a use case for this feature myself so I haven't attempted to finish it up, and the underlying profile generation code has changed dramatically since this PR was initially drafted.
If you have a concrete need for it you are welcome to pick it up!

I used to want to call fitparse-rs code from python through pyo3 and it would be better to return a good defined structrue. I know there's example about serializer fit to json , but the performace is bad when decode large json in python. Finally I found that I can return pyo3 PyObject and receive the object as dict from python side. now everything works great. And finally thanks your awesome work on this project. 😊

stadelmanma · 2023-08-30T12:39:58Z

@beyoung sounds great. If there are ways to streamline the process of calling the code’s api from Python I’m all ears. I’ve written c-extensions for Python but never rust bindings before.

beyoung · 2023-09-01T03:07:19Z

@beyoung sounds great. If there are ways to streamline the process of calling the code’s api from Python I’m all ears. I’ve written c-extensions for Python but never rust bindings before.

Do u mean something like the example/streaming.rs in the repo ? It's super easy to write rust bindings for python when you use packages like pyo3 andmaturin.

stadelmanma · 2023-09-04T14:11:45Z

@beyoung I don’t have any concrete ideas in particular. But given how I’ve structured the library and how Python works returning some kind of iterable structure probably makes sense. Since from there they can process stuff without needing to pass over the data multiple times.

robinkrahl · 2024-02-18T14:40:35Z

I’m currently trying to clean up some of my unpublished projects, so I need to get rid of Git dependencies. Therefore I’d like to revisit this PR/feature request and try to get it merged. I’ll update this PR and rebase it onto the current master branch soon.

Initially, my plan was to have a complete implementation in this PR. But maybe it would make more sense to split this up into several smaller PRs. What do you think? Also, would you be fine with an initial solution that does not cover every edge case? I think having message structs with proper typing for 95 % of the fields would already be a big improvement.

stadelmanma · 2024-02-18T20:26:03Z

I’m currently trying to clean up some of my unpublished projects, so I need to get rid of Git dependencies. Therefore I’d like to revisit this PR/feature request and try to get it merged. I’ll update this PR and rebase it onto the current master branch soon.

Sounds great!

Initially, my plan was to have a complete implementation in this PR. But maybe it would make more sense to split this up into several smaller PRs. What do you think?

It might make the most sense to have a long running feature branch you do incremental PRs into. Then we merge the feature branch all at once when it’s ready. That would keep code review in manageable chunks but not add incomplete bits to the main library until we’re happy with it.

Also, would you be fine with an initial solution that does not cover every edge case? I think having message structs with proper typing for 95 % of the fields would already be a big improvement.

I’d likely be fine with that, it’s usually very hard to cover all the bases in the first go around. Although I do think it depends on how we handle the edge cases, i.e. just less ergonomic to use vs some fields are simply unwritable vs unexpected panics. For example, it would likely be fine if the first pass is not able to handle writing out a data field that requires multiple rounds of subfield/component resolution (maybe with an appropriate error state to avoid generating invalid files). But I’d be less keen on it just bailing out with a hard panic out of the blue if that makes sense.

robinkrahl · 2024-02-18T20:36:00Z

I’ve updated the PR and rebased it onto the current master branch.

The main changes from the initial implementation are:

Instead of trying to convert the data to the correct data type, all fields are Option<Value> in this implementation. I’ll try to add data type support in a separate PR.
Instead of manually generating the code using string formatting, quote! is used. I think this makes the code easier to read as you have syntax highlighting for the generated code, you don’t need to add the writeln! boilerplate and you don’t need to escape braces. It probably increases compile time, but as this only affects the offline code generation, I think that’s acceptable. Let me know if you want me to use the writeln! approach instead.
It adds MessageParseOptions that let the user decide whether unexpected fields should cause an error. I think it makes sense to error out per default so that unknown fields are not discarded, but sometimes users might want to just parse the supported fields.

robinkrahl · 2024-02-18T22:14:13Z

It might make the most sense to have a long running feature branch you do incremental PRs into.

Sounds good to me. Just let me know which branch to base the PR against.

Although I do think it depends on how we handle the edge cases, i.e. just less ergonomic to use vs some fields are simply unwritable vs unexpected panics.

My plan would be to provide an implementation that is at least as stable and ergonomic as the current FitDataRecord path, and then gradually improve ergonomics.

The initial implementation in this PR has all fields as Option<Value>. I would then start to try replacing Value with more appropriate types depending on the field. E. g. fields with FieldDataType::UInt8 should use u8 (or some wrapper with improved error handling). Cases that are not handled yet would just mean that users have to look at their data or the profile definition and then manually convert the Value to the correct data type – the same thing they already have to do today. I don’t think it makes sense to spend weeks or months to handle every theoretical case before releasing these changes that greatly improve overall usability.

stadelmanma · 2024-02-21T02:00:44Z

We can use this branch, feature/13-add-data-types-for-messages. I agree once something is fairly stable and usable we can try rolling it out with some appropriate docs that it's in an "experimental" mode and the API is subject to change.

This patch adds one struct per message type to the profile::messages module as well as a Message enum for all supported message types. The messages can be parsed from a FitDataRecord.

robinkrahl · 2024-02-21T08:52:29Z

Okay, I’ve rebased the PR onto the feature branch.

fitparser/src/error.rs

generate-fit-profile/Cargo.toml

generate-fit-profile/src/main.rs

stadelmanma · 2024-02-22T12:29:42Z

I think this looks good. I left a few questions, mostly just for my curiosity/ improving my knowledge.

robinkrahl · 2024-02-22T15:14:21Z

Thank you for the review!

stadelmanma mentioned this pull request Oct 10, 2023

Support writing FIT files #33

Open

robinkrahl force-pushed the types branch from 289a0c8 to 0d70326 Compare February 18, 2024 20:24

robinkrahl changed the title ~~[wip] Generate message types~~ Generate message types Feb 18, 2024

robinkrahl marked this pull request as ready for review February 18, 2024 20:25

robinkrahl force-pushed the types branch from 0d70326 to 5d0ec07 Compare February 18, 2024 20:31

robinkrahl mentioned this pull request Feb 18, 2024

Investigate if the codegen crate would make the profile generation code cleaner #25

Closed

robinkrahl changed the base branch from master to feature/13-add-data-types-for-messages February 21, 2024 08:49

Add message structs to generated profile

ba67b03

This patch adds one struct per message type to the profile::messages module as well as a Message enum for all supported message types. The messages can be parsed from a FitDataRecord.

robinkrahl force-pushed the types branch from 5d0ec07 to ba67b03 Compare February 21, 2024 08:51

stadelmanma reviewed Feb 22, 2024

View reviewed changes

fitparser/src/error.rs Show resolved Hide resolved

stadelmanma reviewed Feb 22, 2024

View reviewed changes

generate-fit-profile/Cargo.toml Show resolved Hide resolved

stadelmanma reviewed Feb 22, 2024

View reviewed changes

generate-fit-profile/src/main.rs Show resolved Hide resolved

stadelmanma merged commit 5177185 into stadelmanma:feature/13-add-data-types-for-messages Feb 22, 2024
1 check passed

robinkrahl deleted the types branch February 22, 2024 15:14

robinkrahl mentioned this pull request Feb 22, 2024

Add data types for messages #13

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate message types #14

Generate message types #14

robinkrahl commented Aug 31, 2022

stadelmanma commented Oct 28, 2022

robinkrahl commented Oct 29, 2022

stadelmanma commented Oct 29, 2022

stadelmanma commented Oct 29, 2022

robinkrahl commented Oct 29, 2022

robinkrahl commented Oct 29, 2022

stadelmanma commented Oct 29, 2022

robinkrahl commented Oct 30, 2022

stadelmanma commented Oct 30, 2022

robinkrahl commented Oct 30, 2022

stadelmanma commented Nov 2, 2022 •

edited

Loading

beyoung commented Jun 29, 2023

stadelmanma commented Jul 7, 2023 •

edited

Loading

beyoung commented Aug 30, 2023

stadelmanma commented Aug 30, 2023

beyoung commented Sep 1, 2023

stadelmanma commented Sep 4, 2023

robinkrahl commented Feb 18, 2024

stadelmanma commented Feb 18, 2024 •

edited

Loading

robinkrahl commented Feb 18, 2024 •

edited

Loading

robinkrahl commented Feb 18, 2024

stadelmanma commented Feb 21, 2024 •

edited

Loading

robinkrahl commented Feb 21, 2024

stadelmanma commented Feb 22, 2024

robinkrahl commented Feb 22, 2024

Generate message types #14

Generate message types #14

Conversation

robinkrahl commented Aug 31, 2022

stadelmanma commented Oct 28, 2022

robinkrahl commented Oct 29, 2022

stadelmanma commented Oct 29, 2022

stadelmanma commented Oct 29, 2022

robinkrahl commented Oct 29, 2022

robinkrahl commented Oct 29, 2022

stadelmanma commented Oct 29, 2022

robinkrahl commented Oct 30, 2022

stadelmanma commented Oct 30, 2022

robinkrahl commented Oct 30, 2022

stadelmanma commented Nov 2, 2022 • edited Loading

beyoung commented Jun 29, 2023

stadelmanma commented Jul 7, 2023 • edited Loading

beyoung commented Aug 30, 2023

stadelmanma commented Aug 30, 2023

beyoung commented Sep 1, 2023

stadelmanma commented Sep 4, 2023

robinkrahl commented Feb 18, 2024

stadelmanma commented Feb 18, 2024 • edited Loading

robinkrahl commented Feb 18, 2024 • edited Loading

robinkrahl commented Feb 18, 2024

stadelmanma commented Feb 21, 2024 • edited Loading

robinkrahl commented Feb 21, 2024

stadelmanma commented Feb 22, 2024

robinkrahl commented Feb 22, 2024

stadelmanma commented Nov 2, 2022 •

edited

Loading

stadelmanma commented Jul 7, 2023 •

edited

Loading

stadelmanma commented Feb 18, 2024 •

edited

Loading

robinkrahl commented Feb 18, 2024 •

edited

Loading

stadelmanma commented Feb 21, 2024 •

edited

Loading