[spi_flash] handle reentrance gracefully #4397

tyomitch · 2021-03-13T12:07:45Z

Report failure instead of deadlocking the device

tyomitch · 2021-03-13T12:29:36Z

Looks like an HTTPS misconfiguration is blocking the CI

gamblor21

The changes look fine to me, but did not test directly.

gamblor21 · 2021-03-13T16:14:37Z

supervisor/shared/external_flash/spi_flash.c

-    while (!common_hal_busio_spi_try_lock(&supervisor_flash_spi_bus)) {}
-    common_hal_digitalio_digitalinout_set_value(&cs_pin, false);
+static bool flash_enable(void) {
+    if (common_hal_busio_spi_try_lock(&supervisor_flash_spi_bus)) {


Is it worth retrying X amount of times even? I know adafruit_bus_device had a routine to catch background tasks and keep trying for the lock, I'm not familiar enough to say for sure this needs it just a thought.

Could it be that instead of the busy-wait in https://github.com/adafruit/Adafruit_CircuitPython_BusDevice/blob/master/adafruit_bus_device/spi_device.py#L73, it's better to throw an exception when the SPI is locked? Then the user would have control over how many times, if at all, to retry.

Looping over common_hal_busio_spi_try_lock() in C will not succeed if it doesn't work the first time: If foreground Python code has locked the bus, it has to be able to proceed in order to unlock the bus. I don't think we ever un/lock the bus from an interrupt context.

@jepler doesn't the same apply to adafruit_bus_device? Or may run_background_tasks() lock SPI in one tick, and unlock it in another?

tyomitch · 2021-03-19T11:57:20Z

Ping @tannewt who added the loop in question in 9d91111

tannewt · 2021-03-19T19:50:22Z

I added it three years ago... Have you reproduced an issue with this or just suspect a problem?

I think the assumption is that it'd be incorrect for this code to occur when the spi bus is locked. Having this be a loop allows you to detect the error with a debugger when developing.

tyomitch · 2021-03-19T20:35:32Z

I added it three years ago... Have you reproduced an issue with this or just suspect a problem?

My particular use case was attempting a file access in an interrupt handler, and observing that it locks up the board if the interrupt happens during screen update.

I think the assumption is that it'd be incorrect for this code to occur when the spi bus is locked. Having this be a loop allows you to detect the error with a debugger when developing.

There is nothing to prevent this code from being called when the SPI bus is locked: interrupts notwithstanding, the Python user can lock the bus via busio.SPI, then try a file operation, and he would have no indication whatsoever as to why the board locked up.

hierophect · 2021-03-19T22:10:15Z

@tannewt It seems the most likely place for this to be encountered in practice is with the Meowbit, which shares the SPI bus between the flash and the screen (I assume this is where you first encountered the problem, @tyomitch). Specifically, when a module tries to access the flash while the screen is updating, it causes an irrecoverable hang. Then you need to figure out how to manually wipe the NOR flash somehow, like introducing an intentional exception into your build. I just ran into it with the AudioPWMIO PR - it's not fun to deal with.

hierophect · 2021-03-19T22:19:42Z

If there are unwanted wider consequences for this change, and since the Meowbit is just one board (and a historically annoying one, for this same reason), this could be folded into an optional mpconfigboard macro so it doesn't apply to other boards.

tannewt

This looks good to me. Code that calls transfer should already be able to handle failures. Thanks! Looks like it just needs a final merge.

Report failure instead of deadlocking the device

hierophect

LGTM

jepler · 2021-03-23T19:12:33Z

Thank you @tyomitch !

gamblor21 reviewed Mar 13, 2021

View reviewed changes

tyomitch mentioned this pull request Mar 19, 2021

[stm] implementation of audiopwmio #4399

Merged

tannewt previously approved these changes Mar 22, 2021

View reviewed changes

[spi_flash] handle reentrance gracefully

3b613ff

Report failure instead of deadlocking the device

tyomitch dismissed tannewt’s stale review via 3b613ff March 23, 2021 06:15

tyomitch force-pushed the patch-1 branch from 24f48d8 to 3b613ff Compare March 23, 2021 06:15

hierophect approved these changes Mar 23, 2021

View reviewed changes

jepler merged commit f8cea3b into adafruit:main Mar 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spi_flash] handle reentrance gracefully #4397

[spi_flash] handle reentrance gracefully #4397

tyomitch commented Mar 13, 2021

tyomitch commented Mar 13, 2021

gamblor21 left a comment

gamblor21 Mar 13, 2021

tyomitch Mar 13, 2021

jepler Mar 15, 2021

tyomitch Mar 15, 2021

tyomitch commented Mar 19, 2021

tannewt commented Mar 19, 2021

tyomitch commented Mar 19, 2021

hierophect commented Mar 19, 2021

hierophect commented Mar 19, 2021

tannewt left a comment

hierophect left a comment

jepler commented Mar 23, 2021

[spi_flash] handle reentrance gracefully #4397

[spi_flash] handle reentrance gracefully #4397

Conversation

tyomitch commented Mar 13, 2021

tyomitch commented Mar 13, 2021

gamblor21 left a comment

Choose a reason for hiding this comment

gamblor21 Mar 13, 2021

Choose a reason for hiding this comment

tyomitch Mar 13, 2021

Choose a reason for hiding this comment

jepler Mar 15, 2021

Choose a reason for hiding this comment

tyomitch Mar 15, 2021

Choose a reason for hiding this comment

tyomitch commented Mar 19, 2021

tannewt commented Mar 19, 2021

tyomitch commented Mar 19, 2021

hierophect commented Mar 19, 2021

hierophect commented Mar 19, 2021

tannewt left a comment

Choose a reason for hiding this comment

hierophect left a comment

Choose a reason for hiding this comment

jepler commented Mar 23, 2021