Nexus transforms improvements #126

SimonHeybrock · 2024-11-04T13:01:39Z

This collects a number of small necessary improvements I ran into when trying to use GenericNeXusWorkflow on NMX files. I recommend looking at the individual commit messages.

Related: #96 (solving the simplest case).

May need revisting later, but currently this is too much guesswork.

jl-wynen · 2024-11-05T07:41:10Z

src/ess/reduce/nexus/types.py

@@ -192,10 +192,10 @@ class Filename(sciline.Scope[RunType, Path], Path): ...


 @dataclass
-class PulseSelection(Generic[RunType]):
+class TimeInterval(Generic[RunType]):
    """Range of neutron pulses to load from NXevent_data or NXdata groups."""


Only pulses or also logs?

For the logs in NXtransformations it is loading the full log. The time interval is later used to determine which values are relevant.

The workflow is not loading any other logs currently.

But is your intention to use TimeInterval for other logs, too?

Not really at the moment, since the label-based slicing in Scipp and ScippNeXus does not do what we need (include the previous value). We thus want to load "more" than the naive slice says. Unless we get very large logs it seems easier to just load everything, and then move events to log values.

src/ess/reduce/nexus/types.py

jl-wynen · 2024-11-05T07:50:43Z

src/ess/reduce/nexus/workflow.py

+            # "end" time in the files. We add a dummy end so we can use Scipp's label-
+            # based indexing for histogram data.
+            time = t.value.coords['time']
+            delta = sc.scalar(86_400_000, unit='s', dtype='int64').to(unit=time.unit)


Where does this number come from? Can't you just use np.iinfo('int64').max as the last value?

I think that is tricky, since we don't know the input dtype (could be signed or unsigned, or a datetime). There probably is a way (can you think of a simple one?), but just adding 1000 days seemed "safe".

I think it is safe enough. I was more surprised by the concrete number and wondered whether it has some significance because it is not simply 10**10 or something like that.

src/ess/reduce/nexus/workflow.py

jl-wynen · 2024-11-05T07:59:41Z

src/ess/reduce/nexus/workflow.py

+    It one or more transformations in the chain are time-dependent, the time interval
+    is used to select a specific time point. If the interval is not a single time point,
+    an error is raised. This may be extended in the future to a more sophisticated
+    mechanism, e.g., averaging over the interval to remove noise.


The last sentence is not really usage documentation. If you want to track work on this, I would say it should be an issue.

I feel it kind of is usage documentation: Someone will look for a way of processing the time-series, and this tells them it is not implemented.

jl-wynen · 2024-11-05T08:01:28Z

src/ess/reduce/nexus/workflow.py

+    # If the NXdetector in the file is not 1-D, we want to match the order of dims.
+    # zip_pixel_offsets otherwise yields a vector with dimensions in the order given
+    # by the x/y/z offsets.
+    offsets = snx.zip_pixel_offsets(da.coords).transpose(da.dims).copy()


I felt there was little to lose, whereas we still run into some Scipp operations that do not handle non-contiguous data.

When I see copy() somewhere, my assumption is that it has some significance. E.g., that the result will be modified in-place. So I went looking but didn't find anything.
Essentially, it increases 'noise' for the reader. But leave it or remove it, whichever you prefer.

SimonHeybrock added 9 commits November 4, 2024 10:14

Expose GenericNeXusWorkflow in nexus module

e53857c

Rename PulseSelection -> TimeInterval

67c7c93

Check earlier for bad time dependence

49afdd8

Rename function

6619f57

Add mechanism for processing time-dependent transformation chains

97d0ec5

Remove dodgy and likely insufficient time-filter mechanism

cddfcdc

May need revisting later, but currently this is too much guesswork.

Match input dim order and offset units

dbc2b9c

Remove tests written during feature development

70d9443

Bump scippnexus

853544d

SimonHeybrock force-pushed the nexus-transforms-improvements branch from 16955ab to 853544d Compare November 5, 2024 05:13

jl-wynen reviewed Nov 5, 2024

View reviewed changes

SimonHeybrock added 2 commits November 5, 2024 09:16

spelling

63a7d45

Explain choice of final bin size

0ce71d5

jl-wynen approved these changes Nov 5, 2024

View reviewed changes

SimonHeybrock merged commit f5748ef into main Nov 5, 2024
4 checks passed

SimonHeybrock deleted the nexus-transforms-improvements branch November 5, 2024 14:09

SimonHeybrock mentioned this pull request Nov 7, 2024

Fix NeXus workflow for pure rotations on detectors #127

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nexus transforms improvements #126

Nexus transforms improvements #126

SimonHeybrock commented Nov 4, 2024 •

edited

Loading

jl-wynen Nov 5, 2024

SimonHeybrock Nov 5, 2024

jl-wynen Nov 5, 2024

SimonHeybrock Nov 5, 2024

jl-wynen Nov 5, 2024

SimonHeybrock Nov 5, 2024

jl-wynen Nov 5, 2024

jl-wynen Nov 5, 2024

SimonHeybrock Nov 5, 2024

jl-wynen Nov 5, 2024

SimonHeybrock Nov 5, 2024

jl-wynen Nov 5, 2024

Nexus transforms improvements #126

Nexus transforms improvements #126

Conversation

SimonHeybrock commented Nov 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock commented Nov 4, 2024 •

edited

Loading