Coder Social home page Coder Social logo

lmiroslaw / batch-shipyard Goto Github PK

View Code? Open in Web Editor NEW

This project forked from azure/batch-shipyard

0.0 2.0 0.0 1.51 MB

Execute batch and HPC Dockerized workloads on Azure Batch with shared file system provisioning and linking support

License: MIT License

Python 86.05% Shell 13.76% Batchfile 0.20%

batch-shipyard's Introduction

Build Status Docker Pulls Image Layers

Batch Shipyard

Batch Shipyard is a tool to help provision and execute batch processing and HPC Docker workloads on Azure Batch compute pools. No experience with the Azure Batch SDK is needed; run your Dockerized tasks with easy-to-understand configuration files!

Additionally, Batch Shipyard provides the ability to provision and manage entire standalone remote file systems (storage clusters) in Azure, independent of any integrated Azure Batch functionality.

Major Features

  • Automated Docker Host Engine installation tuned for Azure Batch compute nodes
  • Automated deployment of required Docker images to compute nodes
  • Accelerated Docker image deployment at scale to compute pools consisting of a large number of VMs via private peer-to-peer distribution of Docker images among the compute nodes
  • Comprehensive data movement support: move data easily between locally accessible storage systems, remote filesystems, Azure Blob or File Storage, and compute nodes
  • Docker Private Registry support
  • Standalone Remote Filesystem Provisioning with integration to auto-link these filesystems to compute nodes with support for
  • Automatic shared data volume support
  • Seamless integration with Azure Batch job, task and file concepts along with full pass-through of the Azure Batch API to containers executed on compute nodes
  • Support for Azure Batch task dependencies allowing complex processing pipelines and DAGs with Docker containers
  • Transparent support for GPU-accelerated Docker applications on Azure N-Series VM instances
  • Support for multi-instance tasks to accommodate Dockerized MPI and multi-node cluster applications on compute pools with automatic job completion and Docker task termination
  • Transparent assist for running Docker containers utilizing Infiniband/RDMA for MPI on HPC low-latency Azure VM instances:
    • A-Series: STANDARD_A8, STANDARD_A9
    • H-Series: STANDARD_H16R, STANDARD_H16MR
    • N-Series: STANDARD_NC24R (not yet ready with Linux hosts)
  • Automatic setup of SSH users to all nodes in the compute pool and optional tunneling to Docker Hosts on compute nodes
  • Support for credential management through Azure KeyVault
  • Support for execution on an Azure Function App environment

Installation

Installation is typically an easy two-step process. The CLI is also available as a Docker image: alfpark/batch-shipyard:cli-latest. Please see the installation guide for more information regarding installation and requirements.

Documentation

Please refer to the Batch Shipyard Guide for a complete primer on concepts, usage and a quickstart guide.

Please visit the Batch Shipyard Recipes for various sample Docker workloads using Azure Batch and Batch Shipyard after you have completed the introductory sections of the Batch Shipyard Guide.

Batch Shipyard Compute Node OS Support

Batch Shipyard is currently only compatible with Azure Batch supported Marketplace Linux VMs.

Change Log

See the CHANGELOG.md file.


This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

batch-shipyard's People

Contributors

alfpark avatar lediur avatar pareshverma91 avatar jasper-schneider avatar andreadotti avatar smith1511 avatar gonzaloruiz avatar

Watchers

Lukasz Miroslaw avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.