Comments (3)
Can you ellaborate? I understand you would like to test pipelines locally but depending on the resources required to train that model you need a production-ready kubernetes cluster. If you can provide code snippets, configuration files, or else it will help us understand the problem.
from pipelines.
The example above is an example of object detection using YOLOX. In the production environment, I will be using Vertex AI pipelines for training, but first I want to make sure that the components work as intended in the local environment. As the local execution is a behaviour check, I run with small datasets and a small number of epochs. Full size datasets and epochs will be run on Vertex AI pipelines for the production environment.
I will give an example that is difficult to prepare as a code snippet for YOLOX above, but requires CUDA for execution. This requires an option to be GPU aware at runtime.
from kfp import dsl, local
local.init(runner=local.DockerRunner())
@dsl.container_component
def gpu_processing():
return dsl.ContainerSpec(
image="gcr.io/google_containers/cuda-vector-add:v0.1",
)
task = gpu_processing()
When I run it, it does not detect CUDA and gives me an error.
02:21:39.615 - INFO - Executing task 'gpu-processing'
02:21:39.615 - INFO - Streamed logs:
Pulling image 'gcr.io/google_containers/cuda-vector-add:v0.1'
Image pull complete
Failed to allocate device vector A (error code CUDA driver version is insufficient for CUDA runtime version)!
[Vector addition of 50000 elements]
I expect the following results.
10:43:49.816 - INFO - Executing task 'gpu-processing'
10:43:49.816 - INFO - Streamed logs:
Found image 'gcr.io/google_containers/cuda-vector-add:v0.1'
[Vector addition of 50000 elements]
Copy input data from the host memory to the CUDA device
CUDA kernel launch with 196 blocks of 256 threads
Copy output data from the CUDA device to the host memory
Test PASSED
Done
10:43:51.690 - INFO - Task 'gpu-processing' finished with status SUCCESS
10:43:51.691 - INFO - Task 'gpu-processing' has no outputs
If the user can configure the Docker runtime options, the results will be as expected.
from pipelines.
Are you following this method to execute the pipeline? https://www.kubeflow.org/docs/components/pipelines/v2/local-execution/
from pipelines.
Related Issues (20)
- [feature] Add the integration tests (github actions) and corresponding deployment manifests for Tekton runtime HOT 5
- chore: Abstract driver.go configmap/secret volume and env mount into a common function.
- Conflict in ${REPO_ROOT}/v2/test dependencies
- TypeError: to_dict() missing 1 required positional argument: 'self' HOT 3
- [feature] Add environment variables to container_component within a local runner
- [backend] Using `create_custom_training_job_from_component` function causes components to use different Nvidia CUDA version than they should HOT 2
- [feature] Store Pipeline IR in database, not object storage HOT 2
- [feature] Support PV/PVCs as an alternative to Object Storage for KFP HOT 8
- [feature] Support for location arguments for `create_custom_training_job_from_component` HOT 4
- [Help] I want to add input argument in image name of YAML file component, but it can't be parsed in Kubeflow pipeline
- [frontend] Unable to pass input artifact to pipeline on Central Dashboard HOT 3
- [frontend] Unable to visualize HTML output of pipeline HOT 1
- How to read kubernetes secrets in v2 pipelines HOT 4
- [feature] Add the ability to disable annotations to reduce the compiled manifest size HOT 1
- [feature] Volume mount with kfp local and use credentials to access Google Cloud resources
- [feature] Support parameter inputs for V2 Kubernetes_platform Spec
- [feature] Directly transfer dataset from GCP bucket to minio HOT 6
- [sdk] Missing optional parameter to v2 `use_config_map_as_volume` and `use_secret_as_volume`
- [backend] Failed to build backend/Dockerfile due to the installation of default-jdk HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pipelines.