Meta AITemplate Transforms Deep Neural Networks into C++ Code
Summary
InfoQ: Meta Has Developed an AITemplate Which Transforms Deep Neural Networks into C++ Code
Meta created a Python framework called AITemplate that improves inference performance for deep neural networks by:
- transforming and optimizing the graph
- converting code to C++
Details
- created by Meta
- for deep neural networks
- accelerates inference:
- optimizes models with graph transformations
- does not improve training performance
- Works with AMD and Nvidia GPUs
- Future compatibility for M-series Apple GPUs as well.
- results compared to “eager mode” in PyTorch:
- improved performance 12x with Nvidia GPUs
- improved performance
- probably only works with PyTorch, not tensorflow?