Skip to content

Lab 7: Web Deployment

By Charles Frye. Published September 7, 2022.

In this lab, we'll take the leap from ML model to ML-powered application by packaging our text recognizer into a portable TorchSript binary, wrapping that binary up into a serverless cloud function, and building a simple UI in Python with gradio.

Outline

  • 00:00 Overview
  • 01:06 Compiling the model to TorchScript
  • 06:00 Why not deploy on GPUs?
  • 08:58 Building a GUI with gradio
  • 15:34 Spinning up a model service
  • 21:11 Creating a public URL with ngrok
  • 24:52 Writing a Dockerfile for our server
  • 30:06 Recap