Processing 2 elements at once with half2 and ternary sign reduces address count
Ternary overhead exceeds savings from fewer addresses. Metal compiles ternaries to branches.