在 Amazon Inferentia 上为 PyTorch 自然语言处理应用程序实现 12 倍的吞吐量和低延迟_AI/ML_亚马逊云科技 (Amazon Web Services）