Parquet is a columnar storage file format designed for efficient data storage and retrieval in big data processing systems. It provides high compression ratios and fast query performance, making it ideal for analytical workloads and data warehousing applications.
In the Sources tab, click on the Add source button located on the top right of your screen. Then, select the Parquet option from the list of connectors.
Click Next and you’ll be prompted to upload your file.
As soon as you upload it, optionally, you have the chance to edit the file’s name. You can’t have more than one file with the same name, but you can have the same file uploaded more than once with different names if you ever need it.
Your Parquet source was added! Now, for you to be able to see it on your datalake, you have to Trigger the source pipeline and wait for the complete run.
As soon as it has been successfully run for the first time, you’ll be able to play with your new table in the Catalog.
Let us know through our chat if you face any blocker and we’ll be happy to help!