Distributed layersSplit any transformer model across heterogeneous devices using the DPPAN sharding protocol.
OpenAI-compatibleA drop-in /v1/chat/completions endpoint — point your existing tools and SDKs at your local cluster.