How to Use 'tf.GradientTape'

TensorFlow’s tf.GradientTape is a powerful tool for automatic differentiation, enabling the computation of gradients for training machine learning models. This blog post will guide you through the basics of using tf.GradientTape, followed by a simple image classification example using the Common Objects in Context (COCO) dataset and TensorFlow’s Keras API.

What Is tf.GradienTape?

tf.GradientTape is a context manager that records operations for automatic differentiation. This is particularly useful for custom training loops and gradient-based optimization.

Simple Example of tf.GradienTape

Before diving into the image classification example, let’s look at a simple example of using tf.GradientTape to compute gradients.

import tensorflow as tf

# Define a simple computation
x = tf.Variable(3.0)
with tf.GradientTape() as tape:
    y = x ** 2

# Compute the gradient of y with respect to x
grad = tape.gradient(y, x)
print(f'The gradient of y = x^2 with respect to x is {grad.numpy()}')  # Output: 6.0

This example demonstrates the basic usage of tf.GradientTape to compute the gradient of a function.

Image Classification with the COCO Dataset

Now, let’s move on to a more complex example: image classification using the COCO dataset and TensorFlow’s Keras API.

Prepare the Dataset

We’ll use TensorFlow Datasets to load the COCO dataset. For simplicity, we’ll use the CIFAR-10 dataset to demonstrate.

import tensorflow as tf
import tensorflow_datasets as tfds

# Load the CIFAR-10 dataset
(ds_train, ds_test), ds_info = tfds.load(
    'cifar10',
    split=['train', 'test'],
    shuffle_files=True,
    as_supervised=True,
    with_info=True,
)

# Normalize the images
def normalize_img(image, label):
    return tf.cast(image, tf.float32) / 255.0, label

ds_train = ds_train.map(normalize_img, num_parallel_calls=tf.data.experimental.AUTOTUNE)
ds_train = ds_train.cache().shuffle(ds_info.splits['train'].num_examples).batch(32).prefetch(tf.data.experimental.AUTOTUNE)
ds_test = ds_test.map(normalize_img, num_parallel_calls=tf.data.experimental.AUTOTUNE)
ds_test = ds_test.batch(32).cache().prefetch(tf.data.experimental.AUTOTUNE)

Define the Model

We’ll use a simple Convolutional Neural Network (CNN) for the classification task.

from tensorflow.keras import layers, models

model = models.Sequential([
    layers.Conv2D(32, (3, 3), activation='relu', input_shape=(32, 32, 3)),
    layers.MaxPooling2D((2, 2)),
    layers.Conv2D(64, (3, 3), activation='relu'),
    layers.MaxPooling2D((2, 2)),
    layers.Conv2D(64, (3, 3), activation='relu'),
    layers.Flatten(),
    layers.Dense(64, activation='relu'),
    layers.Dense(10)  # 10 classes for CIFAR-10
])

model.compile(optimizer='adam', loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True), metrics=['accuracy'])

Train the Model Using model.fit

We’ll train the model using the Keras model.fit method, which abstracts away much of the complexity of training.

model.fit(ds_train, epochs=5, validation_data=ds_test)

This approach simplifies the training process significantly. However, let’s incorporate tf.GradientTape for a custom training loop to demonstrate its flexibility and use.

Custom Training Loop with tf.GradientTape

For those interested in more control over the training process, here’s how you can use tf.GradientTape.

loss_object = tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True)
optimizer = tf.keras.optimizers.Adam()

@tf.function
def train_step(images, labels):
    with tf.GradientTape() as tape:
        predictions = model(images, training=True)
        loss = loss_object(labels, predictions)
    gradients = tape.gradient(loss, model.trainable_variables)
    optimizer.apply_gradients(zip(gradients, model.trainable_variables))
    return loss

# Custom training loop
EPOCHS = 5
for epoch in range(EPOCHS):
    for images, labels in ds_train:
        loss = train_step(images, labels)
    print(f'Epoch {epoch + 1}, Loss: {loss.numpy()}')

What's next? We recommend PyImageSearch University.

Course information:
86+ total classes • 115+ hours hours of on-demand code walkthrough videos • Last updated: December 2025
★★★★★ 4.84 (128 Ratings) • 16,000+ Students Enrolled

I strongly believe that if you had the right teacher you could master computer vision and deep learning.

Do you think learning computer vision and deep learning has to be time-consuming, overwhelming, and complicated? Or has to involve complex mathematics and equations? Or requires a degree in computer science?

That’s not the case.

All you need to master computer vision and deep learning is for someone to explain things to you in simple, intuitive terms. And that’s exactly what I do. My mission is to change education and how complex Artificial Intelligence topics are taught.

If you're serious about learning computer vision, your next stop should be PyImageSearch University, the most comprehensive computer vision, deep learning, and OpenCV course online today. Here you’ll learn how to successfully and confidently apply computer vision to your work, research, and projects. Join me in computer vision mastery.

Inside PyImageSearch University you'll find:

✓ 86+ courses on essential computer vision, deep learning, and OpenCV topics
✓ 86 Certificates of Completion
✓ 115+ hours hours of on-demand video
✓ Brand new courses released regularly, ensuring you can keep up with state-of-the-art techniques
✓ Pre-configured Jupyter Notebooks in Google Colab
✓ Run all code examples in your web browser — works on Windows, macOS, and Linux (no dev environment configuration required!)
✓ Access to centralized code repos for all 540+ tutorials on PyImageSearch
✓ Easy one-click downloads for code, datasets, pre-trained models, etc.
✓ Access on mobile, laptop, desktop, etc.

Click here to join PyImageSearch University

Summary

In this blog post, we’ve seen how to use tf.GradientTape for custom training loops in TensorFlow, with a practical example using a simple CNN for image classification. We also demonstrated a straightforward approach using the model.fit method for training. This approach provides flexibility and control over the training process, which is essential for many advanced machine learning tasks.

Feel free to expand upon this example, experiment with different models, and apply it to the full COCO dataset for more complex image classification tasks. Happy coding!

Citation Information

Martinez, H. “How to Use ‘tf.GradientTape’,” PyImageSearch, P. Chugh, A. R. Gosthipaty, S. Huot, K. Kidriavsteva, and R. Raha, eds., 2024, https://pyimg.co/h1f9s

@incollection{Martinez_2024_How-to-Use-tf.GradientTape,
  author = {Hector Martinez},
  title = {How to Use 'tf.GradientTape'},
  booktitle = {PyImageSearch},
  editor = {Puneet Chugh and Aritra Roy Gosthipaty and Susan Huot and Kseniia Kidriavsteva and Ritwik Raha},
  year = {2024},
  url = {https://pyimg.co/h1f9s},
}

Join the PyImageSearch Newsletter and Grab My FREE 17-page Resource Guide PDF

Enter your email address below to join the PyImageSearch Newsletter and download my FREE 17-page Resource Guide PDF on Computer Vision, OpenCV, and Deep Learning.