Rearranging and Filtering Binned Data#

Event filtering refers to the process of removing or extracting a subset of events based on some criterion such as the temperature of the measured sample at the time an event was detected. Instead of extracting based on a single parameter value or interval, we may also want to rearrange data based on the parameter value, providing quick and convenient access to the parameter-dependence of our data. Scipp’s binned data can be used for both of these purposes.

The Quick Reference below provides a brief overview of the options. A more detailed walkthrough based on actual data can be found in the Full example.

Quick Reference#

Extract events matching parameter value#

Use label-based indexing on the bins property. This works similar to regular label-based indexing but operates on the unordered bin contents. Example:

param_value = sc.scalar(1.2, unit='m')
filtered = da.bins['param', param_value]

The output data array has the same dimensions as the input da.
filtered contains a copy of the filtered events.

Extract events falling into a parameter interval#

Use label-based indexing on the bins property. This works similar to regular label-based indexing but operates on the unordered bin contents. Example:

start = sc.scalar(1.2, unit='m')
stop = sc.scalar(1.3, unit='m')
filtered = da.bins['param', start:stop]

The output data array has the same dimensions as the input da. filtered contains a copy of the filtered events.
Note that as usual the upper bound of the interval (here \(1.3~\text{m}\)) is not included.

Split into bins based on a discrete event parameter#

Use scipp.group. Example:

split = da.group('param')

The output data array has a new dimension 'param' in addition to the dimensions of the input.
split contains a copy of the reordered events.
Pass an explicit variable to group listing desired groups to limit what is included in the output.

Split into bins based on a continuous event parameter#

Use scipp.bin. Example:

split = da.bin(param=10)

The output data array has a new dimension 'param' in addition to the dimensions of the input.
split contains a copy of the reordered events.
Provide an explicit variable to bin to limit the parameter interval that is included in the output, or for fine-grained control over the sub-intervals.

Compute derived event parameters for subsequent extracting or splitting#

Use scipp.transform_coords. Example:

da2 = da.transform_coords(derived_param=lambda p1, p2: p1 + p2)

da2 can now be used with any of the methods for exctracting or splitting data described above. The intermediate variable can also be omitted, and we can directly extract or split the result:

filtered = da.transform_coords(derived_param=lambda p1, p2: p1 + p2) \
             .bin(new_param=10)

Compute derived event parameters from time-series or other metadata#

In practice, events are often tagged with a timestamp, which can be used to lookup parameter values from, e.g., a time-series log given by a data array with a single dimension and a coordinate matching the coordinate name of the timestamps. Use scipp.lookup with scipp.transform_coords. Example:

temperature = da.attrs['sample_temperature'].value  # temperature value time-series
interp_temperature = sc.lookup(temperature, mode='previous')
filtered = da.transform_coords(temperature=interp_temperature) \
             .bin(temperature=10)

Rearranging and Filtering Binned Data

Contents

Rearranging and Filtering Binned Data#

Quick Reference#

Extract events matching parameter value#

Extract events falling into a parameter interval#

Split into bins based on a discrete event parameter#

Split into bins based on a continuous event parameter#

Compute derived event parameters for subsequent extracting or splitting#

Compute derived event parameters from time-series or other metadata#

Full example#

Input data#

Extract time interval#

Filter bad pulses#

Rearrange data based on strain#