Open
Description
Kernel oops happened when removing modules in ADLP_RVP_NOCODEC. Found internal test 15892?model=ADLP_RVP_NOCODEC&testcase=check-kmod-load-unload-25
- Kernel Branch: topic/sof-dev
- Kernel Commit: 237b967
- SOF Branch: stable-v2.2
- SOF Commit: afe466da0238
To reproduce
TPLG=/lib/firmware/intel/sof-tplg/sof-adl-nocodec.tplg MODEL=ADLP_RVP_NOCODEC ~/sof-test/test-case/check-kmod-load-unload.sh -l 25
Reproduction Rate
1/50 : this is very rare error. With same hash build, it didn't happen even with stress-test
...
drm_panel_orientation_quirks 24576 1 drm
/home/ubuntu/sof-test/tools/kmod/sof_remove.sh FAILED
[ 5668.874898] kernel: snd_sof:ipc3_log_header: sof-audio-pci-intel-tgl 0000:00:1f.3: ipc tx: 0x40020000: GLB_PM_MSG: CTX_RESTORE
[ 5668.882680] kernel: BUG: unable to handle page fault for address: 000000000395de55
[ 5668.882737] kernel: #PF: supervisor instruction fetch in kernel mode
[ 5668.882767] kernel: #PF: error_code(0x0010) - not-present page
[ 5668.882795] kernel: PGD 0 P4D 0
[ 5668.882815] kernel: Oops: 0010 [#1] PREEMPT SMP NOPTI
[ 5668.882839] kernel: CPU: 18 PID: 79665 Comm: rmmod Tainted: G U 6.0.0-rc5-daily-nocodec-20220930-0-g237b967b3bf4 #dev
[ 5668.882897] kernel: Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS ADLPFWI1.R00.2411.A02.2110081023 10/08/2021
[ 5668.883009] kernel: RIP: 0010:0x395de55
[ 5668.883034] kernel: Code: Unable to access opcode bytes at RIP 0x395de2b.
[ 5668.883067] kernel: RSP: 0018:ffffb265c158bb30 EFLAGS: 00010046
[ 5668.883098] kernel: RAX: 0000000000000000 RBX: dead000000000122 RCX: 0000000000000000
[ 5668.883136] kernel: RDX: 0000000000000000 RSI: ffffffffffffffff RDI: ffffb265c158bb40
[ 5668.883173] kernel: RBP: 0000000000000246 R08: 0000000000000003 R09: ffffffffffffffff
[ 5668.883210] kernel: R10: 0000000000000001 R11: 0000000000000001 R12: ffff8a409040f000
[ 5668.883246] kernel: R13: dead000000000100 R14: 0000000000000001 R15: ffff8a409040f200
[ 5668.883285] kernel: FS: 00007fb672ac3c40(0000) GS:ffff8a441f800000(0000) knlGS:0000000000000000
[ 5668.883358] kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5668.883390] kernel: CR2: 000000000395de55 CR3: 0000000114db6005 CR4: 0000000000770ee0
[ 5668.883427] kernel: PKRU: 55555554
[ 5668.883457] kernel: Call Trace:
[ 5668.883477] kernel:
[ 5668.883494] kernel: ? xas_find+0x178/0x1c0
[ 5668.883524] kernel: ? ida_destroy+0x65/0x1a0
[ 5668.883553] kernel: ? sof_widget_unload+0x160/0x2e0 [snd_sof]
[ 5668.883630] kernel: ? remove_widget+0x34/0xa0 [snd_soc_core]
[ 5668.883699] kernel: ? snd_soc_tplg_component_remove+0x252/0x2a0 [snd_soc_core]
[ 5668.883765] kernel: ? soc_remove_component+0x73/0x80 [snd_soc_core]
[ 5668.883815] kernel: ? soc_cleanup_card_resources+0xb3/0x260 [snd_soc_core]
[ 5668.883851] kernel: ? snd_soc_unbind_card+0x6a/0xf0 [snd_soc_core]
[ 5668.883886] kernel: ? snd_soc_unregister_card+0x24/0x60 [snd_soc_core]
[ 5668.883920] kernel: ? release_nodes+0x42/0x70
[ 5668.883944] kernel: ? devres_release_all+0x8b/0xc0
[ 5668.883960] kernel: ? device_unbind_cleanup+0xe/0x70
[ 5668.883977] kernel: ? device_release_driver_internal+0xee/0x170
[ 5668.884004] kernel: ? bus_remove_device+0xef/0x160
[ 5668.884020] kernel: ? device_del+0x1aa/0x420
[ 5668.884034] kernel: ? platform_device_del.part.0+0x13/0x70
[ 5668.884051] kernel: ? platform_device_unregister+0x1c/0x40
[ 5668.884067] kernel: ? snd_sof_device_remove+0x4b/0xe0 [snd_sof]
[ 5668.884105] kernel: ? sof_pci_remove+0x1a/0x50 [snd_sof_pci]
[ 5668.884123] kernel: ? pci_device_remove+0x33/0xa0
[ 5668.884147] kernel: ? device_release_driver_internal+0xc1/0x170
[ 5668.884173] kernel: ? driver_detach+0x47/0x90
[ 5668.884193] kernel: ? bus_remove_driver+0x58/0xe0
[ 5668.884216] kernel: ? pci_unregister_driver+0x3b/0x90
[ 5668.884240] kernel: ? __x64_sys_delete_module+0x195/0x280
[ 5668.884268] kernel: ? exit_to_user_mode_prepare+0x3b/0x120
[ 5668.884286] kernel: ? do_syscall_64+0x38/0x90
[ 5668.884309] kernel: ? entry_SYSCALL_64_after_hwframe+0x63/0xcd
[ 5668.884331] kernel:
[ 5668.884344] kernel: Modules linked in: snd_hwdep snd_sof_pci_intel_mtl snd_sof_probes snd_sof_ipc_msg_injector snd_sof_nocodec snd_sof_ipc_flood_test snd_sof_pci_intel_tgl(-) snd_sof_intel_hda_common snd_sof_pci snd_sof_acpi_intel_bdw snd_sof_acpi_intel_byt snd_sof_intel_atom snd_sof_xtensa_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_sof_acpi snd_sof snd_sof_utils snd_soc_core snd_compress snd_intel_dspcfg snd_pcm snd_usbmidi_lib i915 wmi_bmof snd_seq_midi snd_seq_midi_event snd_rawmidi i2c_algo_bit x86_pkg_temp_thermal drm_buddy snd_seq drm_display_helper intel_powerclamp snd_seq_device snd_timer drm_kms_helper cfbfillrect syscopyarea snd cfbimgblt mei_me sysfillrect soundcore processor_thermal_device_pci mei sysimgblt processor_thermal_device fb_sys_fops cfbcopyarea processor_thermal_rfim ttm processor_thermal_mbox int340x_thermal_zone wmi ax88796b int3400_thermal acpi_thermal_rel squashfs drm drm_panel_orientation_quirks fuse efivarfs asix usbnet intel_ishtp_hid intel_lpss_pci xhci_pci
[ 5668.884407] kernel: intel_lpss intel_ish_ipc idma64 intel_ishtp mfd_core xhci_hcd vmd [last unloaded: snd_sof_pci_intel_icl]
[ 5668.884635] kernel: CR2: 000000000395de55
[ 5668.884648] kernel: ---[ end trace 0000000000000000 ]---
[ 5668.884671] kernel: RIP: 0010:0x395de55
[ 5668.884683] kernel: Code: Unable to access opcode bytes at RIP 0x395de2b.
[ 5668.884700] kernel: RSP: 0018:ffffb265c158bb30 EFLAGS: 00010046
[ 5668.884717] kernel: RAX: 0000000000000000 RBX: dead000000000122 RCX: 0000000000000000
[ 5668.884736] kernel: RDX: 0000000000000000 RSI: ffffffffffffffff RDI: ffffb265c158bb40
[ 5668.884756] kernel: RBP: 0000000000000246 R08: 0000000000000003 R09: ffffffffffffffff
[ 5668.884776] kernel: R10: 0000000000000001 R11: 0000000000000001 R12: ffff8a409040f000
[ 5668.884795] kernel: R13: dead000000000100 R14: 0000000000000001 R15: ffff8a409040f200
[ 5668.884815] kernel: FS: 00007fb672ac3c40(0000) GS:ffff8a441f800000(0000) knlGS:0000000000000000
[ 5668.884853] kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5668.884870] kernel: CR2: 000000000395de55 CR3: 0000000114db6005 CR4: 0000000000770ee0
[ 5668.884890] kernel: PKRU: 55555554
[ 5668.884905] kernel: note: rmmod[79665] exited with preempt_count 1