Pandas Melt (pd.melt) - PyImageSearch

Home » Blog » Pandas Melt (pd.melt)

Introduction to pandas melt() function

In this tutorial, you will learn about the Pandas melt() function, a powerful tool in Python’s Pandas library for reshaping your dataframes. Whether you’re new to data manipulation or looking to enhance your data preparation skills, understanding how to use Pandas melt() can significantly simplify your data transformation tasks.

Pandas is a staple in the data science community for its robust capabilities in data manipulation and analysis. The Pandas melt() function specifically is invaluable for turning wide data into long format, making it easier to analyze, visualize, and model. If you’ve ever struggled with cumbersome datasets or needed to restructure your data for better insight, this function will become a crucial part of your data wrangling toolkit.

Throughout this guide, we’ll dive deep into practical examples that not only demonstrate the syntax of pd.melt() but also illustrate its applications in real-world scenarios. By the end of this post, you’ll be equipped to reshape any dataframe to fit your analytical needs, ensuring your data tells the story you want to hear.

Configuring Your Development Environment

To follow this guide, you need to have the Pandas library installed on your system.

Luckily, Pandas is pip-installable:

$ pip install pandas

Need Help Configuring Your Development Environment?

**Figure 3:** Having trouble configuring your development environment? Want access to pre-configured Jupyter Notebooks running on Google Colab? Be sure to join PyImageSearch University — you will be up and running with this tutorial in a matter of minutes.

All that said, are you:

Short on time?
Learning on your employer’s administratively locked system?
Wanting to skip the hassle of fighting with the command line, package managers, and virtual environments?
Ready to run the code immediately on your Windows, macOS, or Linux system?

Then join PyImageSearch University today!

Gain access to Jupyter Notebooks for this tutorial and other PyImageSearch guides pre-configured to run on Google Colab’s ecosystem right in your web browser! No installation required.

And best of all, these Jupyter Notebooks will run on Windows, macOS, and Linux!

Project Structure

We first need to review our project directory structure.

Start by accessing this tutorial’s “Downloads” section to retrieve the source code and example images.

From there, take a look at the directory structure:

$ tree . --dirsfirst
.
└── pandas_melt_examples.py
0 directories, 1 files

Implementing Pandas melt()

Let’s start with a simple example to demonstrate the pandas melt() function. We’ll create a small dataset representing sales data for different products across multiple months, and then use pd.melt() to reshape this data.

Creating a Sample DataFrame for Pandas Melt

Before diving into the pd.melt() function, it’s crucial to understand how to prepare your data for transformation. A DataFrame is a two-dimensional tabular structure in pandas that organizes data into rows and columns. Below, we’ll create a simple DataFrame to demonstrate the steps involved in this process.

The DataFrame is structured in a wide format, where columns like January, February, and March represent sales data across different months. It is the starting point for using the pd.melt() function to reshape data into a long format, which is better suited for certain types of analysis and visualization.

Simple pd.melt Example:

Suppose we have a dataframe that lists the monthly sales figures for three different products: A, B, and C. The data is structured with columns for each month and rows for each product.

# Import Pandas library
import pandas as pd
import numpy as np


# Creating the example dataframe
data = {
	'Product': ['A', 'B', 'C'],
	'January': [100, 150, 200],
	'February': [120, 160, 210],
	'March': [130, 170, 220]
}
df = pd.DataFrame(data)
print("Original Data:")
print(df)


#Apply Pandas melt 
melted_df = pd.melt(df, id_vars=['Product'], var_name='Month', value_name='Sales')
print("\nMelted Data:")
print(melted_df)

We start on Line 1-2, we Import Python Library. The script begins by importing the Pandas and other python packages, which is essential for data manipulation, including datetime conversion.

Line 4-10 – Create Sample Data:

Product: An array containing the names of the products (‘A’, ‘B’, ‘C’). This represents the different items being sold.
January, February, March: These keys correspond to arrays containing integers ([100, 150, 200], [120, 160, 210], [130, 170, 220]), representing the sales figures for each product for each month.

Line 11 – Creating a Pandas DataFrame: Constructs a DataFrame from the data dictionary. The DataFrame, df, organizes the data in a tabular form conducive to manipulation and visualization.

Line 12-13 – Print the sample data.

Line 16 – Implement pd.melt():

id_vars: Specifies the columns that will remain vertical (as identifier variables), which in this case are [‘Product’].
var_name: Names the new column that will contain the former column headers of melted columns, which are designated as Month.
value_name: Names the new column that will contain the values from the melted columns, here called Sales.

Line 17-18 – Printing Results: The converted datetime objects are printed to demonstrate the conversion effect.

When we run this code, the output will show the original wide-format dataframe and then the melted long-format dataframe. The pandas melt() function takes the ‘Product’ column as the identifier variable (id_vars), and it treats the other columns (‘January’, ‘February’, ‘March’) as value variables, which are unpivoted to the row axis, forming two new columns: ‘Month’ and ‘Sales’. The output should looks similar to below:

Original Extended Data:
  Product  January  February  March
       A      100       120    130
       B      150       160    170
       C      200       210    220




Melted Extended Data:
Product    Month  Sales
   	A  January	100
   	B  January	150
   	C  January	200
   	A  February	120
   	B  February	160
   	C  February	210
   	A  March	130
   	B  March	170
   	C  March	220

This transformation is particularly useful for statistical modeling or plotting functions that expect data in a long format, making pd.melt() an indispensable tool for data scientists.

Understanding the ignore_index Parameter

The ignore_index parameter in the pd.melt() function controls whether the index of the original DataFrame is retained in the reshaped DataFrame. By default, pd.melt() assigns a new sequential index to the resulting DataFrame, which can sometimes be beneficial for creating cleaner outputs or simplifying further operations. However, retaining the original index may be important in cases where the index conveys meaningful information.

Here’s a breakdown of how the ignore_index parameter works:

Default Behavior (ignore_index=True): The reshaped DataFrame will have a new integer index, starting from 0.
Retaining Original Index (ignore_index=False): The reshaped DataFrame will preserve the original index, allowing you to trace rows back to the original DataFrame.

Example: Using ignore_index with pd.melt()

Below is an example demonstrating the effect of the ignore_index parameter:

import pandas as pd

# Sample DataFrame
data = {
    'Product': ['A', 'B', 'C'],
    'January': [100, 150, 200],
    'February': [120, 160, 210],
    'March': [130, 170, 220]
}

df = pd.DataFrame(data)

# Applying pd.melt with ignore_index=True (default)
melted_default = pd.melt(df, id_vars=['Product'], var_name='Month', value_name='Sales')
print("Melted Data (ignore_index=True):")
print(melted_default)

# Applying pd.melt with ignore_index=False
melted_with_index = pd.melt(df, id_vars=['Product'], var_name='Month', value_name='Sales', ignore_index=False)
print("\nMelted Data (ignore_index=False):")
print(melted_with_index)

Output:

Melted Data (ignore_index=True):
  Product     Month  Sales
0       A   January    100
1       B   January    150
2       C   January    200
3       A  February    120
4       B  February    160
5       C  February    210
6       A     March    130
7       B     March    170
8       C     March    220

Melted Data (ignore_index=False):
  Product     Month  Sales
0       A   January    100
1       B   January    150
2       C   January    200
0       A  February    120
1       B  February    160
2       C  February    210
0       A     March    130
1       B     March    170
2       C     March    220

In the second case, where ignore_index=False, you can see that the original indices from the wide-format DataFrame are preserved. This is particularly useful for tracing data back to the source, especially in complex datasets.

When to Use ignore_index=False

When the original index carries important information, such as time stamps, identifiers, or categories.
When you need to reference rows in the reshaped DataFrame back to the original structure.

Advanced Example Using Pandas melt()

Building on our previous example, let’s explore more complex scenarios where pd.melt() can be particularly useful. This time, we’ll introduce additional variables and demonstrate how to handle multiple identifier variables and filter which columns to melt. This gives us more control over the reshaping process.

Imagine we have a more detailed dataset that includes not only the sales data for products A, B, and C across several months but also their corresponding categories and target sales figures.

# Extended example dataframe
data = {
    'Product': ['A', 'B', 'C'],
    'Category': ['Electronics', 'Furniture', 'Electronics'],
    'Target_Sales': [300, 450, 500],
    'January': [100, 150, 200],
    'February': [120, 160, 210],
    'March': [130, 170, 220]
}
df = pd.DataFrame(data)
print("Original Extended Data:")
print(df)


# Melting the dataframe with multiple identifier variables
melted_df = pd.melt(df, id_vars=['Product', 'Category', 'Target_Sales'], var_name='Month', value_name='Sales')
print("\nMelted Extended Data:")
print(melted_df)

Line 17-24 – Sample Data with more Complex Arrays:

Product: An array containing the names of the products (‘A’, ‘B’, ‘C’). This represents the different items being sold.
Category: An array indicating the category of each product (‘Electronics’, ‘Furniture’, ‘Electronics’). This provides a classification for each product.
Target_Sales: An array of integers representing the target sales figures for each product ([300, 450, 500]). This is the sales goal for each item.
January, February, March: These keys correspond to arrays containing integers ([100, 150, 200], [120, 160, 210], [130, 170, 220]), representing the sales figures for each product for each month.

Line 25 – Creating a Pandas DataFrame Creation (df):

Constructs a DataFrame from the data dictionary. The DataFrame, df, organizes the data in a tabular form conducive to manipulation and visualization.

Line 26-27 – Printing Results: We print the original complex array date.

Line 30 – Implement pd.melt():

id_vars: Specifies the columns that will remain vertical (as identifier variables), which in this case are [‘Product’, ‘Category’, ‘Target_Sales’].
var_name: Names the new column that will contain the former column headers of melted columns, which are designated as Month.
value_name: Names the new column that will contain the values from the melted columns, here called Sales.

Line 31-32 – Printing Results: We print the converted datetime objects to show how the function handles different formats and errors.

Running this code will produce an output that includes both the original extended dataframe and the melted dataframe, now including category and target sales data aligned with each month’s sales. Here’s the output reflecting the melted dataframe using the pd.melt function:

Original Extended Data:
Product 	Category  Target_Sales  January  February  March
	A  Electronics       	300  	100   	120	130
 	B  Furniture           	450  	150   	160	170
 	C  Electronics       	500  	200   	210	220


Melted Extended Data:
Product 	Category  Target_Sales 	Month  Sales
   	A  Electronics       	300   January	100
   	B  Furniture           	450   January	150
   	C  Electronics       	500   January	200
   	A  Electronics       	300  February	120
   	B  Furniture           	450  February	160
   	C  Electronics       	500  February	210
   	A  Electronics       	300 	March	130
   	B  Furniture           	450 	March	170
   	C  Electronics       	500 	March	220

This enhanced example illustrates the flexibility of pd.melt(), allowing us to keep essential categorical information alongside the reshaped sales data, which can be pivotal for more detailed analysis or reporting.

This should clarify how each part of the pandas melt() function works and how the variables are structured to facilitate understanding and enable effective application in data manipulation tasks.

Consideration while using Pandas melt

When using the pandas melt function, there are several considerations and common issues that users should be aware of to effectively manage and avoid potential pitfalls:

Common Issues and Considerations with pd.melt():

Loss of Data Integrity:

When melting data, ensure that the identifier variables (id_vars) adequately summarize the necessary key columns. If not, the melting process can lead to a loss of context about the data values, making the dataset harder to understand or analyze accurately.

Performance Issues:

Melting can significantly increase the size of the DataFrame because it transforms it into a longer format. This can lead to performance issues, especially with very large datasets. It’s important to consider whether the long format is necessary for your specific analysis or if there are ways to aggregate the data before melting.

Repetitive Variable Names:

If the column names that are being melted are not unique or are only numerically different (e.g., ‘January_1’, ‘January_2’, etc.), it can be challenging to distinguish between them in the melted DataFrame. It’s essential to rename these columns meaningfully before melting to maintain clarity.

Alternative Function: pd.pivot_table()

While pd.melt() is useful for transforming data from wide to long format, sometimes you might need to perform the inverse operation or need a more controlled reshaping of your DataFrame. In such cases, pd.pivot_table() can be a more suitable alternative.

Example of `pd.pivot_table()`:

Suppose we want to pivot the melted data back into a wide format where we summarize the sales by the average per month and category.

# Assuming melted_df is already defined from previous steps
pivot_df = pd.pivot_table(melted_df, values='Sales', index=['Category'], columns=['Month'], aggfunc=np.mean)
print("Pivoted Data:")
print(pivot_df)

This code will summarize the sales by calculating the average for each product category per month, effectively pivoting the data back to a format that might be more useful for certain types of analysis.

This example demonstrates the flexibility of pd.pivot_table() for aggregating and reshaping data, providing an alternative method that might better suit certain analytical needs compared to pd.melt().

Next, we’ll provide a summary of what has been covered in this tutorial and highlight important considerations for using pandas melt.

What's next? We recommend PyImageSearch University.

Course information:
86+ total classes • 115+ hours hours of on-demand code walkthrough videos • Last updated: July 2026
★★★★★ 4.84 (128 Ratings) • 16,000+ Students Enrolled

I strongly believe that if you had the right teacher you could master computer vision and deep learning.

Do you think learning computer vision and deep learning has to be time-consuming, overwhelming, and complicated? Or has to involve complex mathematics and equations? Or requires a degree in computer science?

That’s not the case.

All you need to master computer vision and deep learning is for someone to explain things to you in simple, intuitive terms. And that’s exactly what I do. My mission is to change education and how complex Artificial Intelligence topics are taught.

If you're serious about learning computer vision, your next stop should be PyImageSearch University, the most comprehensive computer vision, deep learning, and OpenCV course online today. Here you’ll learn how to successfully and confidently apply computer vision to your work, research, and projects. Join me in computer vision mastery.

Inside PyImageSearch University you'll find:

✓ 86+ courses on essential computer vision, deep learning, and OpenCV topics
✓ 86 Certificates of Completion
✓ 115+ hours hours of on-demand video
✓ Brand new courses released regularly, ensuring you can keep up with state-of-the-art techniques
✓ Pre-configured Jupyter Notebooks in Google Colab
✓ Run all code examples in your web browser — works on Windows, macOS, and Linux (no dev environment configuration required!)
✓ Access to centralized code repos for all 540+ tutorials on PyImageSearch
✓ Easy one-click downloads for code, datasets, pre-trained models, etc.
✓ Access on mobile, laptop, desktop, etc.

Click here to join PyImageSearch University

Summary

This tutorial provides an in-depth exploration of the Pandas melt() function, a key tool in the Python Pandas library for reshaping dataframes from a wide to a long format. The function is particularly useful for making datasets easier to analyze, visualize, and model. The guide includes practical examples to demonstrate how to use pd.melt() effectively in real-world scenarios, illustrating the transformation of a dataset of sales data for different products across multiple months.

The tutorial also discusses common issues and considerations, such as potential loss of data integrity and performance issues due to the increase in DataFrame size. It offers insights into alternative methods like pd.pivot_table(), useful for the inverse operation of reshaping data back into a wide format, where it summarizes sales by the average per month and category. By the end of this tutorial, you should have a solid understanding of how to reshape any dataframe to fit your analytical needs with pd.melt(). To learn more about all pandas melt capabilities check out the developer doc.

Unleash the potential of computer vision with Roboflow - Free!

Step into the realm of the future by signing up or logging into your Roboflow account. Unlock a wealth of innovative dataset libraries and revolutionize your computer vision operations.
Jumpstart your journey by choosing from our broad array of datasets, or benefit from PyimageSearch’s comprehensive library, crafted to cater to a wide range of requirements.
Transfer your data to Roboflow in any of the 40+ compatible formats. Leverage cutting-edge model architectures for training, and deploy seamlessly across diverse platforms, including API, NVIDIA, browser, iOS, and beyond. Integrate our platform effortlessly with your applications or your favorite third-party tools.
Equip yourself with the ability to train a potent computer vision model in a mere afternoon. With a few images, you can import data from any source via API, annotate images using our superior cloud-hosted tool, kickstart model training with a single click, and deploy the model via a hosted API endpoint. Tailor your process by opting for a code-centric approach, leveraging our intuitive, cloud-based UI, or combining both to fit your unique needs.
Embark on your journey today with absolutely no credit card required. Step into the future with Roboflow.

Join Roboflow Now

Join the PyImageSearch Newsletter and Grab My FREE 17-page Resource Guide PDF

Enter your email address below to join the PyImageSearch Newsletter and download my FREE 17-page Resource Guide PDF on Computer Vision, OpenCV, and Deep Learning.

Introduction to pandas melt() function

Configuring Your Development Environment

Project Structure

Implementing Pandas melt()

Creating a Sample DataFrame for Pandas Melt

Simple pd.melt Example:

Understanding the ignore_index Parameter

Advanced Example Using Pandas melt()

Consideration while using Pandas melt

Common Issues and Considerations with pd.melt():

Alternative Function: pd.pivot_table()

Example of `pd.pivot_table()`:

What's next? We recommend PyImageSearch University.

Summary

Unleash the potential of computer vision with Roboflow - Free!

Join the PyImageSearch Newsletter and Grab My FREE 17-page Resource Guide PDF

About the Author

Comment section

PyImageSearch University

OpenCV Haar Cascades

LeNet – Convolutional Neural Network in Python

Count the total number of frames in a video with OpenCV and Python

Topics

Books & Courses

PyImageSearch

Introduction to pandas melt() function

Configuring Your Development Environment

Project Structure

Implementing Pandas melt()

Creating a Sample DataFrame for Pandas Melt

Simple pd.melt Example:

Understanding the ignore_index Parameter

Advanced Example Using Pandas melt()

Consideration while using Pandas melt

Common Issues and Considerations with pd.melt():

Alternative Function: pd.pivot_table()

Example of pd.pivot_table():

What's next? We recommend PyImageSearch University.

Summary

Unleash the potential of computer vision with Roboflow - Free!

Join the PyImageSearch Newsletter and Grab My FREE 17-page Resource Guide PDF

About the Author

Integrating Document Embedding in Gemini Pro: An Approach to Retrieval-Augmented Generation

Unlocking Image Clarity: A Comprehensive Guide to Super-Resolution Techniques

Comment section

Similar articles

You can learn Computer Vision, Deep Learning, and OpenCV.

Footer

Topics

Books & Courses

PyImageSearch

Access the code to this tutorial and all other 500+ tutorials on PyImageSearch

What's included in PyImageSearch University?

Example of `pd.pivot_table()`: