Meta AITemplate Transforms Deep Neural Networks into C++ Code

Summary

InfoQ: Meta Has Developed an AITemplate Which Transforms Deep Neural Networks into C++ Code

Meta created a Python framework called AITemplate that improves inference performance for deep neural networks by:

  • transforming and optimizing the graph
  • converting code to C++

Details

  • created by Meta
  • for deep neural networks
  • accelerates inference:
    • optimizes models with graph transformations
  • does not improve training performance
  • Works with AMD and Nvidia GPUs
    • Future compatibility for M-series Apple GPUs as well.
  • results compared to “eager mode” in PyTorch:
    • improved performance 12x with Nvidia GPUs
    • improved performance
  • probably only works with PyTorch, not tensorflow?