asahi: Use device_load shift for VBO loads
When possible. Only occassionally possible because the loads are pretty limited in the addressing arithmetic. This probably doesn't matter for performance but it saves some noise in dEQP tests which makes for nicer debugging, plenty of optimizations end up worth it for that alone.
Signed-off-by: Alyssa Rosenzweig alyssa@rosenzweig.io