Coder Social home page Coder Social logo

ahod-hpc's Introduction

AHOD-HPC

Azure Resource Manager Templates for Ad-Hoc On-Demand HPC clusters

Deploy cluster with a new VNet

Deploy cluster to an existing VNet

Visualize architecture

Deploy using JSON copy

Deploy using Custom Image



Quickstart
1) Deploy ARM Template
	a. Click on the link above
	b. Select HPC available region
	c. Select vm size (H16m/H16mr or A8/A9) and quantity (make sure to have quota for it)
	d. Name, less than 10 characters
	e. License server IP, use default if in MSFT
	f. Benchmark model
2) Wait for deployment (may be long if a larger model)
3) Logon to machine IP listed in portal
4) Navigate to /mnt/resource

Architecture

hpc_vmss_architecture

This template is designed to assist in the assessment of the ANSYS Fluent CFD package in the Microsoft Azure environment. It automatically downloads and configures Fluent. In addition it authenticates all of the nodes on the cluster and creates a common share directory to be used for each of the nodes. A Virtual Machine Jumpbox is created and a Virtual Machine Scale Set (VMSS) of the same type of machine is created. The VMSS enables easy scaling and quick deployment of the cluster. The Jumpbox serves as the head node. A network card is attached to the Jumpbox and placed in a Virtual Network. The Jumpbox and VMSS reside in the same virtual network. A public IP is assigned to the network with port 22 open. The Jumpbox can be accessed with the following command:

ssh {username}@{vm-private-ip-address}

Four Storage Accounts are created for the VMSS and one for the Jumpbox. An NFS file share is created from the head node's OS disk and shared with all of the VMSS nodes. This NFS share is located at /mnt/nfsshare/ No other file sharing or server is used. The Jumpbox and each of the nodes in the VMSS also have a data disk mounted at /mnt/resource/ that provides local storage space.

Monitoring

By default Ganglia is installed on the Jumbpox and all of the compute nodes. You can access this monitor by opening a web browser and navigating to the ip address of the Jumpbox /ganglia. For example http://11.22.33.44/ganglia

Here you will find the health and status of all of the nodes in the cluster.

Software Configuration

A number of packages are installed during deployment in order to support the NFS share and the tools that are used to create the authentication. During the authentication phase of the deployment, files named nodenames.txt and nodeips.txt are placed in ~/bin. These are files that contain the names and ip addresses of all of the nodes in the VMSS a copy of the nodenames.txt is placed in /mnt/resource/hosts. Each of these nodes should be accesible with the following command:

ssh {username}@{vm-private-ip-address}

Licensing

Currently the default settings point to an internal Microsoft ANSYS license that can only be used for benchmarking, performance testing, and other non-sales related activities. If you are using this as part of a sales process you will need to simply place the IP address of the ANSYS license server in as a parameter at deploy time.

We have updated this to use a VPN, please contact [email protected] for information regarding how to use the ANSYS license.

Known Issues

The Jumpbox takes the name given in the vmssName parameter and appends a 'jb' for its hostname. It is a known bug that Fluent will not properly communicate with the license server if the hostname is longer than 12 characters. The vmssName parameter is limited to 10 characters for that reason. H-Series VMs are only available in the South Central region, A8 and A9 VMs are only available in East US, North Central US, South Central US, West US, North Europe, West Europe, and Japan East.

ahod-hpc's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.