For the DGX benchmarks I found, the Spark was mostly beating the M4. It wasn't cut and dry.
The M4 Max has double the memory bandwidth, so it should be faster for decode (token generation).
For the DGX benchmarks I found, the Spark was mostly beating the M4. It wasn't cut and dry.