How to Process JSON at the Command Line with jq

Question

Learn to process JSON at the command line with jq. Discover how this powerful tool integrates with Unix-style I/O for reliable filtering and piping without temp files.

Accepted Answer

TLDR: The repository designates jq as the definitive tool for processing JSON at the command line, utilizing its canonicalization features and Unix-style I/O to enable reliable diffing, filtering, and pipeline integration without temporary files. The repository treats JSON as a first-class data format for shell workflows, explicitly recommending jq for all non-interactive JSON processing. According to the guide's "Processing files and data" section in [2†L227-L228], jq embodies the Unix philosophy: it is a small, portable, composable tool that reads from stdin and writes to stdout , allowing seamless chaining with , , , , and . Canonicalization and Diffing JSON Files A core strength of jq is its ability to produce canonical JSON —output with deterministic key ordering and normalized whitespace—making it ideal for version control and diffing. In lines 367-370 [3†L367-L370], the guide demonstrates how to compare two JSON files without creating temporary files using process substitution and the flag. To normalize JSON for reliable comparison: This command guarantees reproducible object key order, removing formatting variations that would otherwise cause false positives in diffs. For direct file comparison, combine process substitution with : Here, feeds the canonicalized output directly into as file descriptors, while adds color highlighting and preserves the color codes in the pager. Essential jq Operations for Data Extraction jq uses a path-based query language to extract and transform JSON structures. Below are the fundamental patterns endorsed by the repository for daily command-line work. Pretty-Printing and Formatting The simplest jq operation formats compact or minified JSON into human-readable indentation: The identity filter reformats the input with proper indentation. Piping to preserves color codes if jq is invoked with in your shell alias. Extracting Specific Fields To stream a single field from every object in an array, use the iterator operator combined with property access: This outputs each value on a new line, suitable for piping into or other line-oriented tools. Filtering Objects by Condition The function filters arrays based on boolean expressions: Only objects where the field equals are emitted to stdout . Counting and Aggregation To aggregate data before output, wrap expressions in array constructors and apply operators like : This constructs a temporary array of all IDs, then returns the count, effectively giving you the number of items without external tools like . Integrating jq with Shell Pipelines Because jq outputs valid JSON (or raw text with the flag), downstream tools that understand JSON—including Python one-liners, language-specific libraries, or subsequent jq invocations—can immediately consume the results. The repository highlights this composability by showing jq integrated with and : This pipeline fetches API data, extracts the latest release tag as a raw string ( removes JSON quotes), and passes it to for formatted output. This pattern exemplifies how jq serves as the bridge between web APIs and standard Unix text processing tools. Interactive Alternatives to jq While jq excels in scripts and non-interactive pipelines, the repository notes that exploratory work benefits from interactive tools. For manual data exploration, the guide references jid and jiq in lines 27-28 [2†L27-L28] of the README. These utilities provide terminal-based interfaces for real-time JSON querying, though jq remains the go-to utility for production scripts and automated workflows. Summary - The repository explicitly recommends jq in [2†L227-L228] as the standard command-line JSON processor. - Canonicalization via enables deterministic diffing and version control by normalizing key order and whitespace. - Process substitution ( ) allows to compare JSON files directly without temporary intermediate files, as shown in [3†L367-L370]. - jq queries combine iterators ( ), filters ( ), and aggregators ( ) to transform JSON without external dependencies. - Output from jq integrates with standard Unix tools like , , , and , maintaining the composability of shell pipelines. - For interactive exploration, jid and jiq provide alternatives, but jq remains the preferred tool for automated processing. Frequently Asked Questions What makes jq the preferred tool for processing JSON at the command line? jq is a lightweight, portable processor that adheres to the Unix philosophy of reading from stdin and writing to stdout , allowing it to chain with standard shell tools like , , and . According to the source code, it is the recommended utility for all non-interactive JSON work because it outputs valid JSON that any downstream consumer can parse. How do I reliably diff two JSON files that have different formatting or key ordering? Use jq's flag to canonicalize both files, ensuring object keys appear in alphabetical order and whitespace is normalized. Then use process substitution to feed both outputs directly to : .

How to Process JSON at the Command Line with jq

Canonicalization and Diffing JSON Files

Essential jq Operations for Data Extraction

Pretty-Printing and Formatting

Extracting Specific Fields

Filtering Objects by Condition

Counting and Aggregation

Integrating jq with Shell Pipelines

Interactive Alternatives to jq

Summary

Frequently Asked Questions

What makes jq the preferred tool for processing JSON at the command line?

How do I reliably diff two JSON files that have different formatting or key ordering?

Can jq handle streaming or very large JSON files?

What are the interactive alternatives to jq for exploring JSON data?

Have a question about this repo?