how-to-guide

How to Convert a Python Dictionary to a pandas DataFrame

February 12, 2026 pandas-dev/pandas ↗

To convert a Python dictionary to a pandas DataFrame, pass the dict directly to pd.DataFrame() where keys become column names and values become column data, or use pd.DataFrame.from_dict() with orient='index' when keys represent row labels.

The pandas library provides optimized C-backed routines to convert Python mappings into tabular data structures. In the pandas-dev/pandas repository, the DataFrame constructor handles dictionary inputs through dedicated internal helpers that normalize data into efficient block storage. Understanding how to convert a Python dictionary to a pandas DataFrame properly ensures you leverage automatic type inference, alignment, and missing value handling without manual preprocessing.

How pandas Converts Dictionaries Internally

When you call pd.DataFrame(data) with a dictionary, pandas executes a specific code path in pandas/core/frame.py. Beginning around line 269, the DataFrame.__init__ method detects mappings via isinstance(data, Mapping) and delegates to the _from_dict helper function.

The _from_dict logic performs four critical steps:

Orientation inference: By default, dictionary keys map to column labels. Passing orient='index' transposes this so keys become row indices.
Data alignment: pandas standardizes column lengths by filling missing values with NaN to create a rectangular structure.
Type normalization: Values are coerced from lists, NumPy arrays, or Series into unified columnar arrays.
BlockManager creation: The normalized data feeds into BlockManager (defined in pandas/core/internals/managers.py), which organizes homogeneous data types into contiguous memory blocks for performance.

This process handles nested dictionaries, mixed-type values, and irregular lengths automatically.

Methods to Create a DataFrame from a Dict

pandas exposes two primary interfaces for dictionary conversion: the standard constructor and the specialized from_dict() class method.

Standard Constructor (Default Orientation)

The default pd.DataFrame() constructor treats dictionary keys as column names. Each value must be list-like (list, array, or Series) representing the column's data.

import pandas as pd

data = {
    "name": ["Alice", "Bob", "Charlie"],
    "age": [25, 30, 35],
    "city": ["NY", "LA", "Chicago"]
}
df = pd.DataFrame(data)
print(df)

      name  age     city
0    Alice   25       NY
1      Bob   30       LA
2  Charlie   35  Chicago

Using from_dict() with orient='index'

When your dictionary represents row-oriented data (outer keys are row labels, inner keys are column names), use pd.DataFrame.from_dict() with orient='index'.

nested = {
    "row1": {"A": 1, "B": 2},
    "row2": {"A": 3, "B": 4, "C": 5}
}
df = pd.DataFrame.from_dict(nested, orient="index")
print(df)

       A    B    C
row1   1    2  NaN
row2   3    4  5.0

Missing keys across inner dictionaries are automatically filled with NaN during the alignment phase.

Handling Nested and Mixed-Type Dictionaries

The constructor accepts heterogeneous value types—lists, NumPy arrays, and pandas Series—within the same dictionary. As implemented in pandas-dev/pandas, the internal _from_dict routine normalizes these into aligned columns.

import pandas as pd
import numpy as np

s = pd.Series([10, 20, 30], name="scores")
mixed = {
    "ids": [101, 102, 103],
    "values": np.array([0.1, 0.2, 0.3]),
    "scores": s
}
df = pd.DataFrame(mixed)
print(df)

   ids  values  scores
0  101     0.1      10
1  102     0.2      20
2  103     0.3      30

Even when combining standard Python lists with NumPy arrays or Series objects, pandas creates the underlying BlockManager structure to store each column in its optimal format.

Summary

Pass a dictionary directly to pd.DataFrame() to map keys to columns and values to column data.
Use pd.DataFrame.from_dict(data, orient='index') when dictionary keys represent row labels rather than column names.
The conversion logic resides in pandas/core/frame.py, specifically within DataFrame.__init__ and the _from_dict helper.
Missing values are automatically padded with NaN to ensure rectangular structure.
Mixed data types (lists, arrays, Series) are normalized during the BlockManager creation phase in pandas/core/internals/managers.py.

Frequently Asked Questions

Can I convert a dictionary with different length lists to a DataFrame?

Yes. When you convert a Python dictionary to a pandas DataFrame with unequal list lengths, pandas automatically extends shorter columns with NaN values to match the longest list. This alignment happens internally during the _from_dict processing in pandas/core/frame.py.

What is the difference between pd.DataFrame() and pd.DataFrame.from_dict()?

pd.DataFrame() is the general constructor that infers orientation automatically, while pd.DataFrame.from_dict() provides explicit control via the orient parameter. The from_dict() method is specifically optimized for dictionary inputs and allows you to specify 'columns' (default) or 'index' orientation explicitly, which modifies how the DataFrame class interprets the mapping keys during initialization.

How do I convert a dictionary to a DataFrame with keys as rows instead of columns?

Pass orient='index' to pd.DataFrame.from_dict(). This instructs pandas to treat the dictionary's outer keys as row labels and inner keys as column names. According to the source code in pandas/core/frame.py, this parameter flips the default orientation logic in the _from_dict helper to transpose the resulting structure.

Does pandas handle nested dictionaries automatically?

Yes. When you supply a nested dictionary (values are themselves dictionaries), pandas treats the outer keys as row indices and inner keys as columns, provided you specify orient='index'. Without this parameter, pandas attempts to treat the inner dictionaries as scalar values, which typically requires explicit orientation specification to interpret correctly.

Have a question about this repo?

These articles cover the highlights, but your codebase questions are specific. Give your agent direct access to the source. Share this with your agent to get started:

Share the following with your agent to get started:

curl -s https://instagit.com/install.md

Works with

Claude Codex Cursor VS Code OpenClaw Any MCP Client