OpenCL in Houston, TX

Partnering with AMD, this professional four day course is designed for programmers who are looking to develop comprehensive skills in writing and optimizing applications that fully leverage data parallel processing capabilities of GPUs.

Your fee includes

  • Use of a laptop equipped with AMD Fusion APU
  • Choice of Linux or Windows operating system
  • Printed manual of all lectures
  • Electronic copy of lab exercises
  • OpenCL Quick Reference Guide
  • Certificate of Completion
  • Beverages and snacks
  • 90 days post training support (conditions apply)

Space is limited - Please register early to guarantee your spot

Your Instructor

Gilbert Lee – Team Lead Software Developer

Gilbert heads up Acceleware’s professional services team developing and optimizing applications for our clients. He has worked on a wide range of projects from option pricing to seismic imaging and security scanners. His specialty is algorithmic design and analysis for ultimate performance. Gilbert’s previous experience includes formalized algorithm verification and automated scheduling solvers. In his spare time he enjoys solving puzzles and coding programs to crack the ultra-hard problems! Gilbert has a M.Sc. in Computing Science from the University of Alberta and is a two-time ACM World Finalist and a Google CodeJam Finalist.


Tue to Fri: 9:00AM – 5:00PM (includes a 1 hour break for lunch)


  • Day 1: Introduction to GPU Programming and GPU Architectures
    • Overview of GPU Computing
    • OpenCLL Software
    • Data-Parallel Architectures and the OpenCL Programming Model
    • The OpenCL Memory Model & Work-item Cooperation
    • Hands-on-Exercises: Buffer Allocation and Buffer Transfers, Simple Kernels and Local and Constant Memory
  • Day 2: Advanced GPU Programming and Debugging
    • Task Concurrency and Synchronization
    • Images and Graphics Interoperability
    • Debugging GPU Programs and Numerical Accuracy
    • Hands-on-Exercises: Asynchronous Operations, Images and Graphics Interoperability and Debugging
  • Day 3: Introduction to Optimizations
    • Latency and the Execution Model
    • Arithmetic Optimizations
    • Memory Optimizations
    • Hands-on-Exercises: Arithmetic Optimizations and Correcting memory access pattern transfers and local memory bank conflicts
  • Day 4: Advanced Optimizations and Profiling
    • Profiling applications
    • Case Study – 3D Convolution or Other Case Study
    • Multiple GPUs, Cluster of GPUs
    • Hands-on-Exercises: Using the Stream Profiler to optimize a kernel and 3D Convolution – Putting it all together

Online registration is now closed.

Course details

Date:Oct 22 to Oct 25, 2013
Registration closes:Oct 8, 2013 at 8:00PM (MST)
Location:Houston, TX (venue TBC)
Cost:$3,250 USD


Click here to see our terms and conditions.