Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
echion
3 days ago
|
parent
|
context
|
favorite
| on:
Dell's version of the DGX Spark fixes pain points
> you can combine Spark with M3U, the former streaming the compute, lowering TTFT, the latter doing the token generation part
Are you doing this with vLLM, or some other model-running library/setup?
coder543
3 days ago
[–]
They're probably referencing this article:
https://blog.exolabs.net/nvidia-dgx-spark/
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Are you doing this with vLLM, or some other model-running library/setup?