Xorbits Inference: Model Serving Made Easy - GitHub Xorbits Inference (Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models With Xorbits Inference, you can effortlessly deploy and serve your or state-of-the-art built-in models using just a single command