NVIDIA Workshops on Deep Learning, Volta, CUDA & OpenACC

The Stony Brook Institute for Advanced Computational Science and NVIDIA invite you to attend one or two days of in-depth workshop lectures covering a variety of topics including the latest NVIDIA Volta GPU architecture news and Pascal updates, hands on exposure to contemporary Deep Learning frameworks as well as NVIDIA CUDA and OpenACC.

Registration required. Workshop is free, but seating is limited. 
Click here to register

Day 1 - Thursday, June 29th :


  • Bradley Palmer, NVIDIA Solution Architect

  • Mathew Colgrove, PGI Compiler Engineering team

  • Barton Fiske, NVIDIA Senior Account Manager


08:30 am - 09:00 am - Introductions & NVIDIA Updates
09:00 am - 10:30 am - OpenACC Tutorial
10:00 am - 10:30 am - Break / refreshments provided
10:30 am - 12:30 pm - OpenACC Tutorial

12:30 pm - 01:30 pm - Lunch to be provided (Panera menu, vegetarian options included]

01:30 pm - 03:00 pm - Image Classification with DIGITS Lab
03:00 pm - 03:15 pm - Break
03:15 pm - 04:30 pm - Deep Learning for Image Segmentation with TensorFlow Lab
04:30 pm - 05:00 pm - General Q&A / Meet & Greet - All


Day 2 - Friday, June 30th :


  • Ward Eldred, NVIDIA Solution Architect


08:30 am - 09:00 am - GTC 2017 Highlights
09:00 am - 10:00 am - GPU Acceleration for HPC and Deep Learning
10:00 am - 10:30 am - Break / refreshments provided
10:30 am - 12:00 pm - NVIDIA GPU Architecture and CUDA Deep Dive

12:30 pm - 01:30 pm - Lunch to be provided (Panera menu, vegetarian options included]

01:00 pm - 02:30 pm - Object Detection with DIGITS Lab
02:30 pm - 03:00 pm - Wrap up summary, General Q&A 

Detailed Session Descriptions:

Day 1

Introductions and NVIDIA Updates - speaker, Barton Fiske

A brisk overview of the agenda for the two days as well as latest/greatest breaking news on NVIDIA and topics relevant to the workshop material. 

OpenACC Tutorial (led remotely by Mathew Colgrove, PGI Compiler team

NVIDIA GPUs are the world’s fastest and most efficient accelerators delivering world record scientific application performance. NVIDIA CUDA is the most pervasive parallel computing model, used by over 370 scientific applications and over 150,000 developers worldwide.

This workshop will focus on introducing scientific computing and programming concepts utilizing NVIDIA GPUs to accelerate applications. The workshop will introduce programming techniques using OpenACC paradigms as well as optimization, profiling, and methods for GPU programming.

The workshop will cover:

  • High Level Overview of GPU Architecture

  • Introduction to GPU programming with OpenACC

  • Hands on examples demonstrating the OpenACC optimization process

Image Classification with DIGITS by Brad Palmer, NVIDIA Solution Architect

Deep learning is giving machines near human levels of visual recognition capabilities and disrupting many applications by replacing hand-coded software with predictive models learned directly from data. This lab introduces the machine learning workflow and provides hands-on experience with using deep neural networks (DNN) to solve a real-world image classification problem. You will walk through the process of data preparation, model definition, model training and troubleshooting, validation testing and strategies for improving model performance. You will also see the benefits of GPU acceleration in the model training process. On completion of this lab you will have the knowledge to use NVIDIA DIGITS to train a DNN on your own image classification dataset.


Deep Learning for Image Segmentation with TensorFlow by Brad Palmer, NVIDIA Solution Architect

There are a variety of important applications that need to go beyond detecting individual objects within an image and instead segment the image into spatial regions of interest. For example, in medical imagery analysis it is often important to separate the pixels corresponding to different types of tissue, blood or abnormal cells so that we can isolate a particular organ. In this lab we will use the TensorFlow deep learning framework to train and evaluate an image segmentation network using a medical imagery dataset.


Day 2 - presented - by Ward Eldred, NVIDIA Solution Architect

NVIDIA GPU Technology Conference (GTC17) Highlights

A selection of highlights from the NVIDIA GPU Technology Conference will be presented including the newest Volta GPU, DGX1-V, NVIDIA GPU Cloud and more. 


GPU Acceleration for HPC and Deep Learning

This session will explain why GPU acceleration is important and review the different ways you can take advantage of GPU acceleration. GPU programming concepts will also be introduced. This session will include information on deep learning with GPUs and NVIDIA’s Deep Learning SDK


NVIDIA GPU Architecture and CUDA Deep Dive (including CUDA9)

This session will take a deeper look at NVIDIA’s current “Pascal” GPU architecture and features added to the CUDA9 toolkit and libraries along with a peek at the upcoming “Volta” GPU architecture and CUDA9 toolkit and libraries.


Object Detection with DIGITS

This lab explores three approaches to identify a specific feature within an image. Each approach is measured in relation to three metrics: model training time, model accuracy and speed of detection during deployment. On completion of this lab, you will understand the merits of each approach and learn how to detect objects using neural networks trained on NVIDIA DIGITS on real-world datasets. 

General guidelines/recommendations:

  • Participants should bring a laptop with an SSH client already installed. (PuTTY is recommended for PC Users, Mac & Linux users can use the built in "Terminal" application). Power and wireless connectivity will be provided.

  • A GPU in the laptop is not required. Hands on sessions make use of cloud based GPU services available through QwikLabs Participants are encouraged to join QwikLabs and create a user account prior to the event.

  • Basic Linux desktop and command line familiarity including use of a standard file editor such as VIM or Emacs.

  • Familiarity with software development tools and concepts: compiling, linking and using GNUMake.

  • Rudimentary programming experience in C/C++ (memory management using malloc/free, using pointers, etc)


Thursday, June 29, 2017 to Friday, June 30, 2017


8:30 am - 5:00 pm


IACS Seminar Room