- This is a Python framework which wraps Databricks Model Serving Endpoints API functionality.
- With a few lines of code, you can:
- Deploy realtime models
- Distribute traffic across two or more models running under the same endpoint (e.g. for A/B testing)
- Inspect model build and server logs
To get started, simply install the package from this repo:
pip install git+https://github.com/sebrahimi1988/databricks-model-serving
Once the package is installed, you can leverage different functions in the EndpointClient
class to list
, create
and update
endpoints, amongst others. For instance, to list all model serving endpoints from a particular workspace:
from databricks.model_serving.client import EndpointClient
client = EndpointClient(databricks_url, databricks_token)
client.list_inference_endpoints()
In the notebooks
folder you can find an example use case, where we train a model, register it in Model Registry and deploy it using the framework.