Question 1

How to filter a TSV file for rows where a column matches a regex using tsv-utils?

Accepted Answer

Use tsv-filter with the --regex operator. For example, 'tsv-filter -H --regex column_name:pattern file.tsv' outputs rows where the column matches the regular expression. Include --help-verbose for detailed options and Unicode support.

Question 2

tsv-utils vs awk for processing large CSV files?

Accepted Answer

tsv-utils is optimized for speed in specific tabular operations like filtering and joining, often outperforming awk for large datasets due to streaming algorithms. However, awk is more flexible for custom scripting and complex transformations beyond TSV/CSV.

Question 3

How to install tsv-utils on Windows?

Accepted Answer

Windows support is limited; the README recommends using Windows Subsystem for Linux (WSL) or Docker to run Linux builds. Pre-built binaries are available for Linux and macOS, but native Windows builds are not fully supported, as noted in issue #317.

Question 4

What's the best way to join two TSV files on a common key with tsv-join?

Accepted Answer

Use tsv-join with --filter-file for the static dataset and --key-fields to specify join keys. For an inner join, run 'tsv-join -H --filter-file lookup.tsv --key-fields id data.tsv'. Add --append-fields to include columns from the filter file in the output.

Question 5

How to generate mean and sum statistics grouped by multiple columns in tsv-summarize?

Accepted Answer

Specify multiple fields with --group-by and the desired operations. For example, 'tsv-summarize -H --group-by region,date --sum sales --mean profit data.tsv' computes sums and means grouped by region and date, without needing pre-sorting.

Question 6

Can tsv-utils handle CSV files with quoted fields and embedded commas?

Accepted Answer

Yes, use csv2tsv to convert CSV to TSV, which removes CSV escapes by replacing tabs and newlines with spaces, ensuring reliable processing. The tool handles various CSV dialects, but complex nested quotes may require preprocessing.

eBay's TSV utilities

What is eBay's TSV utilities?

Overview

Use Cases

Best For

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions