fixup! microsoft/clc: Add new intrinsics to lowering to force 32bit loads
This was causing us to read out-of-bounds of the work properties constant buffer if the Z dimension offset was queried, which failed DXIL valiation, and would've returned wrong values if the Y or Z offsets were actually used.