HSA's requirements are far broader than those provided in Metal. A7's shared memory implementation appears to be far too basic to be able to implement HSA-type API. For example, on-demand paging, user-mode queueing and ability for GPU to do CPU callbacks. Many systems can have shared memory but are not HSA compliant.
Samsung and TI don't design their own GPUs so it'd be surprising to see them outpacing AMD in this. Qualcomm's efforts also appear to be behind AMD so far, and they don't appear to have as much priority for it as AMD does.