❌

Reading view

UCSD Lab Advances Low-Latency LLM Serving with DGX B200

The post UCSD Lab Advances Low-Latency LLM Serving with DGX B200 appeared first on StartupHub.ai.

UC San Diego's Hao AI Lab is pushing the frontier of low-latency LLM serving by leveraging NVIDIA's DGX B200 system and pioneering disaggregated inference.

The post UCSD Lab Advances Low-Latency LLM Serving with DGX B200 appeared first on StartupHub.ai.

❌