how-to-guide

How to Reorder Columns in pandas: 4 Methods to Reorganize Your DataFrame

February 12, 2026 pandas-dev/pandas ↗

Use df.reindex(columns=new_order) when you need to handle missing columns safely, or df[new_order] for the fastest performance when all columns exist.

Reorganizing column layout is a common data preparation task when working with tabular data. Whether you need to prioritize certain features for visualization or match a specific schema for export, understanding how to perform a pandas reorder columns operation efficiently ensures your data pipelines remain performant. This guide examines the actual implementation in the pandas-dev/pandas repository to show you exactly how column reordering works under the hood and which method to choose for your specific use case.

How pandas Stores Column Order Internally

In pandas, a DataFrame stores its column labels in an Index object defined in the underlying NDFrame base class. The column order is not merely cosmetic; it determines how the underlying 2-dimensional data aligns with the label index. When you request a pandas reorder columns operation, the library constructs a new Index with your desired sequence and aligns the data accordingly, returning a new DataFrame while leaving the original immutable.

This architecture explains why all reordering methods ultimately rely on Index manipulation. The high-level APIs in pandas/core/frame.py and pandas/core/generic.py delegate to the core Index operations defined in pandas/core/indexes/base.py and exposed through pandas/core/indexes/api.py.

Method 1: Reorder Columns with DataFrame.reindex

The DataFrame.reindex method is the canonical approach for reordering columns, especially when your target order might include labels not present in the original DataFrame. According to the implementation in pandas/core/frame.py, this method builds a new DataFrame by looking up each label in your supplied columns list.

Internally, reindex calls the generic NDFrame.reindex implementation, which ultimately uses the column Index's reindex method to compute new positions. If a column label in your new order does not exist, pandas inserts a column filled with NaN values (or a user-provided fill_value), ensuring your schema remains intact.

import pandas as pd

df = pd.DataFrame({
    "A": [1, 2, 3],
    "B": [4, 5, 6],
    "C": [7, 8, 9]
})

# Reorder with potential missing columns

new_order = ["B", "D", "A"]  # "D" does not exist

df_reordered = df.reindex(columns=new_order, fill_value=0)

Method 2: Reorder Columns Using loc Indexing

When you need to pandas reorder columns and you are certain all target columns exist in the DataFrame, loc indexing provides the most explicit label-based selection. As implemented in pandas/core/generic.py, df.loc[:, new_order] selects columns by their labels in the exact sequence you specify.

Because loc works directly on the label axis without additional alignment checks for missing data, it avoids the overhead associated with reindex. This makes it particularly fast for pure reordering tasks where the column set is static and known.


# Reorder using loc - all columns must exist

new_order = ["C", "A", "B"]
df_reordered = df.loc[:, new_order]

Method 3: Direct Column Selection with Bracket Notation

For the most concise syntax when performing a pandas reorder columns operation, direct bracket notation df[new_order] offers syntactic sugar over the loc mechanism. When you pass a list of column labels to the DataFrame's __getitem__ method (defined in pandas/core/frame.py), pandas treats it as a column-axis slice.

This approach is functionally equivalent to loc for existing columns but requires less typing. It is the idiomatic choice for quick data exploration and simple reordering tasks in interactive environments.


# Most concise syntax

new_order = ["C", "A", "B"]
df_reordered = df[new_order]

Method 4: Advanced Reordering with the Index API

For advanced use cases requiring low-level control over the pandas reorder columns process, you can manipulate the underlying Index object directly. The Index class in pandas/core/indexes/base.py provides methods like reindex and take that power the high-level DataFrame operations.

When you call df.columns.reindex(new_order), you receive a tuple containing the new Index and an indexer array. You can then use df.take(indexer, axis=1) to reorder the data accordingly. This approach exposes the internal mechanics used by DataFrame.reindex and is useful for library authors or performance-critical applications requiring custom alignment logic.


# Low-level Index API approach

new_order = ["C", "A", "B"]
new_idx, indexer = df.columns.reindex(new_order)
df_reordered = df.take(indexer, axis=1)

Choosing the Right Method for Your Use Case

Selecting the optimal approach for reordering columns depends on your specific requirements regarding missing data handling, performance, and code clarity.

Situation	Recommended API	Reason
All columns exist and you need maximum performance	`df[new_order]` or `df.loc[:, new_order]`	Minimal overhead; no missing value checks according to `pandas/core/generic.py`
Some columns may be missing (need NaN placeholders)	`df.reindex(columns=new_order, fill_value=…)`	Automatically inserts missing columns as implemented in `pandas/core/frame.py`
Simultaneous row and column reorder	`df.reindex(index=row_order, columns=col_order)`	Single call for both axes using `NDFrame.reindex`
Custom alignment logic required	`df.columns.reindex()` + `df.take()`	Direct access to Index mechanics in `pandas/core/indexes/base.py`

Because pandas stores column order in the Index object, any operation that produces a new Index automatically updates the DataFrame's column layout. The high-level APIs listed above hide these details while still leveraging the same underlying C-extension code paths for performance.

Summary

Reordering columns in pandas leverages the library's Index-based architecture to align data with new label sequences efficiently. The key approaches include:

DataFrame.reindex for flexible reordering that handles missing columns by inserting NaN values or custom fill values.
loc indexing for explicit label-based selection when all target columns exist and you want to avoid alignment overhead.
Direct bracket notation for the most concise syntax in simple reordering tasks.
Index API methods for low-level control over the reordering mechanics using reindex and take.

All methods respect pandas' immutable-by-default design, returning new DataFrame objects while preserving the original data and metadata.

Frequently Asked Questions

How do I reorder columns in pandas without losing data?

All standard reordering methods in pandas return new DataFrame objects and leave the original data intact. Use df[new_order] for existing columns or df.reindex(columns=new_order) if you need to handle potential missing columns. Both approaches preserve your data and metadata according to the implementation in pandas/core/frame.py.

What is the fastest way to reorder DataFrame columns?

For maximum performance when all columns exist, use direct bracket notation df[new_order] or df.loc[:, new_order]. These methods avoid the alignment overhead present in reindex because they work directly on the label axis without checking for missing values, as implemented in pandas/core/generic.py.

Can I reorder columns and rows at the same time in pandas?

Yes, use df.reindex(index=row_order, columns=col_order) to reorder both axes in a single operation. This method leverages the generic NDFrame.reindex implementation to align both the row and column indices simultaneously, handling any missing labels according to your specified fill_value.

How does pandas handle missing columns when reordering?

When using df.reindex(columns=new_order), pandas automatically inserts columns for any labels in new_order that do not exist in the original DataFrame. These new columns are filled with NaN values by default, or with a custom value if you specify fill_value. This behavior is defined in the Index.reindex method in pandas/core/indexes/base.py.

Have a question about this repo?

These articles cover the highlights, but your codebase questions are specific. Give your agent direct access to the source. Share this with your agent to get started:

Share the following with your agent to get started:

curl -s https://instagit.com/install.md

Works with

Claude Codex Cursor VS Code OpenClaw Any MCP Client