Abstract
We present a new algorithm to automatically schedule Halide programs for high-performance image processing and deep learning. We significantly improve upon the performance of previous methods, which considered a limited subset of schedules. We define a parameterization of possible schedules much larger than prior methods and use a variant of beam search to search over it. The search optimizes runtime predicted by a cost model based on a combination of new derived features and machine learning. We train the cost model by generating and featurizing hundreds of thousands of random programs and schedules. We show that this approach operates effectively with or without autotuning. It produces schedules which are on average almost twice as fast as the existing Halide autoscheduler without autotuning, or more than twice as fast with, and is the first automatic scheduling algorithm to significantly outperform human experts on average.
Original language | English (US) |
---|---|
Article number | 123 |
Journal | ACM Transactions on Graphics |
Volume | 38 |
Issue number | 4 |
DOIs | |
State | Published - Jul 2019 |
Keywords
- Halide
- Optimizing compilers
ASJC Scopus subject areas
- Computer Graphics and Computer-Aided Design