Skip to main content
When working with raw data from APIs, databases, and file uploads, you’ll often need to reshape, clean, or enrich it before it’s ready for analysis. This section provides step-by-step recipes for the most common transformations you’ll encounter. Each recipe includes the same transformation implemented in Athena SQL, BigQuery, and Python notebooks (using the Nekt SDK with PySpark), so you can pick the approach that fits your stack.

Schema and column operations

Structured data handling

Data quality

Combining and reshaping