TorchAO: PyTorch-Native Training-to-Serving Model Optimization Paper • 2507.16099 • Published Jul 21 • 6
Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs Paper • 2410.08806 • Published Oct 11, 2024 • 1
Compiler generated feedback for Large Language Models Paper • 2403.14714 • Published Mar 18, 2024 • 7
Priority Sampling of Large Language Models for Compilers Paper • 2402.18734 • Published Feb 28, 2024 • 19
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine Paper • 2206.10558 • Published Jun 21, 2022 • 1
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8 • 27
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18 • 88