Skip to content

Fix a couple of bugs

Matthew Brost requested to merge (removed):xe into xe
[  339.137642] xe 0000:03:00.0: [drm] Timedout job: seqno=1, guc_id=0, flags=0x2                                                                                                                                                                                                                                                                                                                                                         
[  339.137945] xe 0000:03:00.0: [drm] Try GT reset                                                                                                                                                                                                                                                                                                                                                                                       
[  339.137995] xe 0000:03:00.0: [drm] Doing GT reset                                                                                                                                                                                                                                                                                                                                                                                     
[  339.138187] xe 0000:03:00.0: [drm] GT reset started                                                                                                                                                                                                                                                                                                                                                                                   
[  339.138491] xe 0000:03:00.0: [drm] *ERROR* GuC PC reset: -ENODEV                                                                                                                                                                                                                                                                                                                                                                      
[  339.144768] ------------[ cut here ]------------                                                                                                                                                                                                                                                                                                                                                                                      
[  339.144772] WARNING: CPU: 3 PID: 90 at drivers/gpu/drm/xe/xe_guc.c:721 xe_guc_stop+0x36/0x40 [xe]                                                                                                                                                                                                                                                                                                                                     
[  339.144813] Modules linked in: xe(+) drm_ttm_helper drm_suballoc_helper gpu_sched ttm br_netfilter bridge stp llc overlay x86_pkg_temp_thermal coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel mei_me mei fuse drm ip_tables x_tables i2c_i801 i2c_smbus e1000e                                                                                                                                    
[  339.144887] CPU: 3 PID: 90 Comm: kworker/u12:4 Tainted: G        W         5.18.0-xe+ #3                                                                                                                                                                                                                                                                                                                                              
[  339.144892] Hardware name: Intel Corporation CoffeeLake Client Platform/CoffeeLake S UDIMM RVP, BIOS CNLSFWR1.R00.X220.B00.2103302221 03/30/2021                                                                                                                                                                                                                                                                                      
[  339.144895] Workqueue: gt-ordered-wq gt_reset_worker [xe]                                                                                                                                                                                                                                                                                                                                                                             
[  339.144935] RIP: 0010:xe_guc_stop+0x36/0x40 [xe]                                                                                                                                                                                                                                                                                                                                                                                      
[  339.144972] Code: c7 58 e8 6d 53 00 00 48 89 df e8 45 ca 00 00 89 c5 85 c0 74 05 89 e8 5b 5d c3 48 8d bb 68 02 00 00 e8 ee f0 00 00 85 c0 74 eb <0f> 0b eb e7 66 0f 1f 44 00 00 0f 1f 44 00 00 e9 16 d0 00 00 66 0f                                                                                                                                                                                                                   
[  339.144976] RSP: 0018:ffff888115f8fcf0 EFLAGS: 00010286                                                                                                                                                                                                                                                                                                                                                                               
[  339.144982] RAX: 00000000ffffffed RBX: ffff88813fc6a2d0 RCX: 0000000000000000                                                                                                                                                                                                                                                                                                                                                         
[  339.144986] RDX: 0000000000000000 RSI: ffffffff811433a4 RDI: ffffed1022bf1f7d                                                                                                                                                                                                                                                                                                                                                         
[  339.144989] RBP: 0000000000000000 R08: 0000000000000001 R09: ffff888225bc3817                                                                                                                                                                                                                                                                                                                                                         
[  339.144993] R10: ffffed1044b78702 R11: 0000000000000001 R12: ffff88813fc69fc0                                                                                                                                                                                                                                                                                                                                                         
[  339.144996] R13: ffff88813fc6a2d0 R14: ffff88813fc68000 R15: 0000000000000000                                                                                                                                                                                                                                                                                                                                                         
[  339.144999] FS:  0000000000000000(0000) GS:ffff888225b80000(0000) knlGS:0000000000000000                                                                                                                                                                                                                                                                                                                                              
[  339.145003] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  339.145007] CR2: 000055c7228c50b8 CR3: 0000000007e16005 CR4: 00000000003706e0
[  339.145010] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  339.145013] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  339.145017] Call Trace:                                                                                
[  339.145020]  <TASK>                                                                                    
[  339.145025]  gt_reset_worker.cold+0x103/0x27e [xe]                                                     
[  339.145073]  process_one_work+0x543/0x9a0                                                              
[  339.145090]  ? pwq_dec_nr_in_flight+0x100/0x100                                                        
[  339.145101]  ? rwlock_bug.part.0+0x60/0x60                                                             
[  339.145108]  ? _raw_spin_lock_irq+0x46/0x50                                                            
[  339.145121]  worker_thread+0x8f/0x630                                                                  
[  339.145138]  ? process_one_work+0x9a0/0x9a0                                                            
[  339.145143]  kthread+0x17b/0x1b0                                                                       
[  339.145147]  ? kthread_complete_and_exit+0x20/0x20                                                     
[  339.145156]  ret_from_fork+0x1f/0x30                                                                   
[  339.145177]  </TASK>                                                                                   
[  339.145180] irq event stamp: 49095                                                                     
[  339.145182] hardirqs last  enabled at (49101): [<ffffffff811b276a>] __up_console_sem+0x5a/0x70
[  339.145188] hardirqs last disabled at (49106): [<ffffffff811b274f>] __up_console_sem+0x3f/0x70
[  339.145193] softirqs last  enabled at (48718): [<ffffffff810f9b88>] __irq_exit_rcu+0xc8/0x130
[  339.145198] softirqs last disabled at (48677): [<ffffffff810f9b88>] __irq_exit_rcu+0xc8/0x130
[  339.145202] ---[ end trace 0000000000000000 ]---                                                       
[  339.145577] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] Applying save-restore MMIOs
[  339.145630] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] 00009434 = 00020000
[  339.145693] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] 00009524 = 10000000
[  339.145744] xe 0000:03:00.0: [drm:xe_reg_sr_apply_mmio [xe]] save-restore MMIOs done
[  339.145784] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] WOPCM: 4096K
[  339.145830] xe 0000:03:00.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [1440K, 544K)
[  339.145880] ==================================================================
[  339.153112] BUG: KASAN: null-ptr-deref in xe_hw_engine_enable_ring+0xd9/0x1e0 [xe]
[  339.160716] Read of size 8 at addr 00000000000002e0 by task kworker/u12:4/90
 
[  339.169269] CPU: 3 PID: 90 Comm: kworker/u12:4 Tainted: G        W         5.18.0-xe+ #3
[  339.177362] Hardware name: Intel Corporation CoffeeLake Client Platform/CoffeeLake S UDIMM RVP, BIOS CNLSFWR1.R00.X220.B00.2103302221 03/30/2021
[  339.190310] Workqueue: gt-ordered-wq gt_reset_worker [xe]
[  339.195751] Call Trace:                                                                                
[  339.198208]  <TASK>                                                                                    
[  339.200317]  dump_stack_lvl+0x6a/0x9b                                                                  
[  339.203993]  kasan_report+0xbf/0xf0                                                                    
[  339.207527]  ? xe_hw_engine_enable_ring+0xd9/0x1e0 [xe]
[  339.212795]  xe_hw_engine_enable_ring+0xd9/0x1e0 [xe]
[  339.217892]  gt_reset_worker.cold+0x22e/0x27e [xe]                                                     
[  339.222731]  process_one_work+0x543/0x9a0                                                              
[  339.226753]  ? pwq_dec_nr_in_flight+0x100/0x100                                                        
[  339.231296]  ? rwlock_bug.part.0+0x60/0x60                                                             
[  339.235407]  ? _raw_spin_lock_irq+0x46/0x50                                                            
[  339.239603]  worker_thread+0x8f/0x630                                                                  
[  339.243279]  ? process_one_work+0x9a0/0x9a0                                                            
[  339.247486]  kthread+0x17b/0x1b0                                                                       
[  339.250749]  ? kthread_complete_and_exit+0x20/0x20                                                     
[  339.255550]  ret_from_fork+0x1f/0x30                                                                   
[  339.259141]  </TASK>                                                                                   
[  339.261344] ==================================================================
[  339.268675] Disabling lock debugging due to kernel taint
[  339.268679] BUG: kernel NULL pointer dereference, address: 00000000000002e0
[  339.275641] #PF: supervisor read access in kernel mode
[  339.280790] #PF: error_code(0x0000) - not-present page

Merge request reports