tests/kernel/interrupt failed on ARC #23476

chen-png · 2020-03-16T02:25:00Z

Describe the bug
Running tests/kernel/interrupt on iotdk board, it will stop at "test_prevent_interruption" and fail due to timeout.

To Reproduce
Steps to reproduce the behavior:

sanitycheck -p iotdk --device-testing --device-serial /dev/ttyUSB0 -T tests/kernel/interrupt/
See error in hander.log file.

Screenshots or console output

*** Booting Zephyr OS build zephyr-v2.2.0-368-g22b9167acb52  ***
Running test suite interrupt_feature

starting test - test_isr_dynamic
PASS - test_isr_dynamic

starting test - test_nested_isr
Triggering irq : 93
isr0 running !!
Triggering irq : 92
isr1 ran !!
isr0 execution completed !!
PASS - test_nested_isr

**starting test - test_prevent_interruption
locking interrupts**

Environment (please complete the following information):

OS: fedora 28
Toolchain: zephyr-sdk-0.11.1
Commit ID: 22b9167

The text was updated successfully, but these errors were encountered:

andrewboie · 2020-04-10T19:02:14Z

This also fails on nsim_sem, nsim_em, nsim_sem_mpu_stack_guard, nsim_em_mpu_stack_guard

vonhust · 2020-04-13T09:54:16Z

@andrewboie @ruuddw @abrodkin , here is my investigation
In the following codes of prevent_irq.c

         key = irq_lock();
	handler_result = 0;

	k_timer_init(&irqlock_timer, timer_handler, NULL);

	/* Start the timer and busy-wait for a bit with IRQs locked. The
	 * timer ought to have fired during this time if interrupts weren't
	 * locked -- but since they are, check_lock_new isn't updated.
	 */
	k_timer_start(&irqlock_timer, K_MSEC(DURATION), K_NO_WAIT);
	k_busy_wait(MS_TO_US(1000));
	zassert_not_equal(handler_result, HANDLER_TOKEN,
		"timer interrupt was serviced while interrupts are locked");

	printk("unlocking interrupts\n");
	irq_unlock(key);

systick timer irq cannot be handled as irq is locked. So it means the sys tick may be lost.

Meanwhile, k_busy_wait(MS_TO_US(1000)) relies on k_cycle_get_32, then to arch_k_cycle_get_32()->z_timer_cycle_get_32

However, as the sys tick is not updated because of irq lock , the value return by z_timer_cycle_get_32 will wrap back. ARC's timer is a free running timer and will wrap to 0 when the counter reaches the LIMIT. (other arch like ARM has similar issue)

This will cause processor loops forever in k_busy_wait.

The root issue is how to maintain the sys cycles when sys tick irq is not updated and there is no global wall clock.

For the test, a quick fix is don't k_busy_wait too long when irq is locked. e.g,
k_busy_wait(MS_TO_US(1000)); -> k_busy_wait(MS_TO_US(K_MSEC(2 * DURATION)));

a second fix is to implement CONFIG_ARCH_HAS_CUSTOM_BUSY_WAIT

a third fix is to maintain the correct wall clock in arch_k_cycle_get_32/z_timer_cycle_get32.

I think in high-level, it needs some discussion about this.

ruuddw · 2020-04-15T10:03:15Z

It could be argued that busy waiting for longer than 2 (dynamic) system ticks is a-typical use ("A busy wait is typically used instead of thread sleeping when the required delay is too short to warrant having the scheduler context switch from the current thread to another thread and then back again.").
However, I think this is more than just a test issue, unless k_busy_wait() gets a pre-condition that (timer) interrupts should be enabled when calling -> I don't like the first proposed fix. I think an ARC custom busy wait is the better solution (e.g. calculate and count the number of required timer wraps, and/or test if the busy wait time exceeds the currently set timer alarm).

chen-png added the bug The issue is a bug, or the PR is fixing a bug label Mar 16, 2020

carlescufi added area: ARC ARC Architecture area: Testing labels Mar 16, 2020

nashif assigned vonhust Mar 31, 2020

nashif added the priority: medium Medium impact/importance bug label Apr 7, 2020

andrewboie changed the title ~~tests/kernel/interrupt failed on iotdk board.~~ tests/kernel/interrupt failed on ARC Apr 10, 2020

vonhust mentioned this issue Apr 14, 2020

tests: can't busy_wait too long when interrupt is locked #24332

Merged

vonhust mentioned this issue Apr 17, 2020

drivers: improve the arcv2_timer driver to update cycles correctly #24445

Merged

carlescufi added area: Tests Issues related to a particular existing or missing test and removed area: Testing labels Apr 30, 2020

carlescufi closed this as completed in #24445 May 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests/kernel/interrupt failed on ARC #23476

tests/kernel/interrupt failed on ARC #23476

chen-png commented Mar 16, 2020 •

edited by andrewboie

Loading

andrewboie commented Apr 10, 2020

vonhust commented Apr 13, 2020

ruuddw commented Apr 15, 2020

tests/kernel/interrupt failed on ARC #23476

tests/kernel/interrupt failed on ARC #23476

Comments

chen-png commented Mar 16, 2020 • edited by andrewboie Loading

andrewboie commented Apr 10, 2020

vonhust commented Apr 13, 2020

ruuddw commented Apr 15, 2020

chen-png commented Mar 16, 2020 •

edited by andrewboie

Loading