Parsing a CSV file
CSV files are a popular data source for ETL applications since numerous database systems import and export data in this format.
To parse a CSV file, you can follow these steps:
From the home page, click on Soma. This will lead you to the Soma Connection page.
On the Soma Connection page, select the CSV file you want to upload. Your CSV file will appear on the Data page within the Soma workspace.
Now, you need to parse the raw CSV data to view it in a spreadsheet format that segregates it into rows and columns. To do this, click on the dropdown menu next to the 'body' column.
Choose 'Parse > CSV' from the menu.
Select the type of delimiter in the file, and then choose 'Set first row as header'. The dataset will now appear in a spreadsheet format. Soma will add the 'parse-as-csv' directive to the recipe.
If you don't need the 'body' column for data cleansing and analysis, you can delete it. From the dropdown menu, choose 'Delete column'. Soma will add the 'drop' directive to the recipe.
You are now ready to parse your dataset!
Last updated