Designing with Versal AI Engine 3: Kernel Programming and Optimization

Course Description

This course covers the advanced features of the Versal™ ACAP AI Engine, including debugging an application in the Vitis™ unified software platform, using filter intrinsics, implementing a system design in hardware, and optimizing an AI Engine kernel program.The emphasis of this course is on:

  • Reviewing the advanced features of the Versal ACAP AI Engine architecture.
  • Optimizing AI Engine kernels using compiler directives, programming style, and efficient movement of data.
  • Describing C++ kernel template functionality
  • Identifying the different types of kernel instance states
  • Using AI Engine filter intrinsics and programming a FIR filter using filter intrinsics
  • Debugging applications using the Vitis unified software platform
  • Describing the Xilinx Add-on for MATLAB and Simulink tool for AI Engine kernel development

Release Date

December 2020


Connectivity 3

Training Duration

3 days

Who Should Attend?

Software and hardware developers, system architects, and anyone who needs to accelerate their software applications using Xilinx devices


  • Comfort with the C/C++ programming language
  • Software development flow
  • Vitis software for application acceleration development flow
  • Designing with Versal AI Engine 1: Architecture and Design Flow
  • Designing with Versal AI Engine 2: Graph Programming with AI Engine Kernels

Skills Gained

After completing this comprehensive training, you will have the necessary skills to:

  • Utilize various AI Engine kernel optimization techniques, such as compiler directives, software pipelining, coding for performance, and core utilization
  • Apply C coding guidelines for performance improvement, including function inlining, pointer restricting, and code shuffling
  • Identify and debug the various problems that arise in application development
  • Implement an AI Engine kernel using intrinsics for a symmetric FIR with mul4_sym and mac4_sym
  • Implement an AI Engine kernel using a non-symmetric FIR with mul4_nc and mac4_nc
  • Debug an application using the simulation debugging methodology and event traces
  • Use the Xilinx Add-on for MATLAB and Simulink tool for AI Engine kernel development and modeling of a heterogeneous device

Course Outline

  • AI Engine Architecture
  • Versal AI Engine Data Movement and Interfaces
  • Overview of AI Engine Kernel Optimization
  • AI Engine Kernel Optimization –Compiler Directives
  • AI Engine Kernel Optimization –Coding Style
  • Advanced C++ Kernel Programming
  • Vector Data Types (Review)
  • AI Engine Symmetric Filter Implementation
  • AI Engine Non-Symmetric Filter Implementation
  • Floating-Point Operations
  • Debugging AI Engine Applications 1
  • Debugging AI Engine Applications 2 (Use Cases)
  • Xilinx Add-on for MATLAB and Simulink for AI Engine Development

Event Schedule

No events found. Event request.


Updated at: 2021-07-18 17:11:54 +0200to the top