Tutorial data preparation #103

Erikpostt · 2025-01-10T11:06:38Z

This PR changes the following:

Add user guides to the docs
Add a user guide for the coordinate system used
Add a user guide for the config files (first version, but will expand once the config file structure has been modified)
Displays prints in data preparation tutorial

KarsVeldkamp · 2025-01-10T12:47:59Z

docs/tutorials/data_preparation.ipynb

@@ -117,7 +124,8 @@
   "metadata": {},
   "source": [
    "#### Set sensor values to the correct units\n",
-    "First, TSDF stores the data efficiently using scaling factors. We should therefore convert the sensor values back to the true values. "
+    "TSDF stores the data efficiently using scaling factors. We should therefore convert the sensor values back to the true values. This is only relevant if you use TSDF and scaled the data for storage purposes.\n",


Is this only relevant if you use TSDF and scaled the data for storage purposes or also when you don't use tsdf but have scaled the data for storage purposes?

I think it's also relevant when using other data formats, but generally data formats such as parquet have inherent data compression, meaning that the scaling happens when storing/loading. Therefore I don't expect users to require the scaling. Do you think we should remove this from the tutorial, and perhaps scale it prior to loading the data such that it doesn't distract the user?

Not per se I think but I was just wondering if other formats have something like scale factors that you need to apply just like we do with TSDF. But if they all do this like Parquet then we can leave it this way.

KarsVeldkamp · 2025-01-10T12:49:52Z

docs/tutorials/data_preparation.ipynb

@@ -171,7 +179,7 @@
   "metadata": {},
   "source": [
    "#### Account for watch side\n",
-    "For the Gait & Arm Swing pipeline, it is essential to ensure correct sensor axes orientation. For more information please read [X]. If the sensors are not correctly aligned, you can use `invert_watch_side` to ensure consistency between sensors worn on the left or right wrist."
+    "For the Gait & Arm Swing pipeline, it is essential to ensure correct sensor axes orientation. For more information please read [Coordinate System](../guides/coordinate_system.md). If the sensors are not correctly aligned, you can use `invert_watch_side` to ensure consistency between sensors worn on the left or right wrist."


invert_watch_side does not necessarily correctly align the sensors right? It only does when one side is already correctly aligned?

You're absolutely correct, I'll adjust it accordingly.

KarsVeldkamp · 2025-01-10T12:51:32Z

docs/tutorials/data_preparation.ipynb

@@ -198,7 +206,7 @@
   "metadata": {},
   "source": [
    "#### Change time column\n",
-    "ParaDigMa expects the data to be in seconds relative to the first row. The toolbox has the built-in function `transform_time_array` to help users transform their time column to the correct format."
+    "ParaDigMa expects the data to be in seconds relative to the first row, which should be equal to 0. The toolbox has the built-in function `transform_time_array` to help users transform their time column to the correct format."


first row or first data point? The first is technically correct but I think the latter is more intuitive?

KarsVeldkamp · 2025-01-10T12:54:09Z

@Erikpostt I have some minor remarks regarding the data_preparation tutorial. The rest is looking good! I'll leave it up to you if you wish to make changes or merge the branch already ;)

…tutorial-data-preparation

Erikpostt added 2 commits January 10, 2025 12:01

Add user guides for coordinate system and update index of docs

5f1809e

Displays outputs of data preparation tutorial

1962ef8

Erikpostt marked this pull request as ready for review January 10, 2025 11:10

Erikpostt requested a review from KarsVeldkamp January 10, 2025 11:10

KarsVeldkamp reviewed Jan 10, 2025

View reviewed changes

KarsVeldkamp approved these changes Jan 10, 2025

View reviewed changes

Erikpostt added 2 commits January 10, 2025 14:29

Merge branch 'main' of github.com:biomarkersParkinson/paradigma into …

d379f6d

…tutorial-data-preparation

Modify according to feedback of reviewer

d88d4e7

Erikpostt merged commit 9f0862b into main Jan 10, 2025
1 check passed

Erikpostt deleted the tutorial-data-preparation branch January 10, 2025 13:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tutorial data preparation #103

Tutorial data preparation #103

Erikpostt commented Jan 10, 2025

KarsVeldkamp Jan 10, 2025

Erikpostt Jan 10, 2025

KarsVeldkamp Jan 10, 2025

KarsVeldkamp Jan 10, 2025

Erikpostt Jan 10, 2025

KarsVeldkamp Jan 10, 2025

KarsVeldkamp commented Jan 10, 2025

Tutorial data preparation #103

Tutorial data preparation #103

Conversation

Erikpostt commented Jan 10, 2025

KarsVeldkamp Jan 10, 2025

Choose a reason for hiding this comment

Erikpostt Jan 10, 2025

Choose a reason for hiding this comment

KarsVeldkamp Jan 10, 2025

Choose a reason for hiding this comment

KarsVeldkamp Jan 10, 2025

Choose a reason for hiding this comment

Erikpostt Jan 10, 2025

Choose a reason for hiding this comment

KarsVeldkamp Jan 10, 2025

Choose a reason for hiding this comment

KarsVeldkamp commented Jan 10, 2025