Applying a Hashing algorithm to a column

In Soma, you can apply a hashing algorithm to a column of values, which converts each input into a unique, fixed-length string, or hash value. Hashing can be used for a variety of purposes, including data integrity checks and password storage.

Here's how to apply a hashing algorithm to a column in Soma:

  1. Click the dropdown button next to the column name that you want to apply the hashing algorithm to.

  2. Click Hash.

  3. Select the hashing algorithm you want to use from the list of options. Soma provides a wide variety of algorithms, including multiple variants of BLAKE2B, GOST3411, KECCAK, MD (Message Digest), RIPEMD (RIPE Message Digest), SHA (Secure Hash Algorithm), Skein, SM3, Tiger, and WHIRLPOOL.

  4. Optionally, choose to encode the message digest. If you choose to encode the hash values, Soma will encode them as hexadecimal strings with left-padded zeroes.

  5. Click Apply.

After you apply the hash directive to a String column, Soma replaces the original content of the column with the hash values. It does not create a new column for the hash values. Instead, the transformation is added as a new step in your recipe, and when you run your data axon, the hash operation will be applied to the values in your chosen column.

Hashing is a one-way operation – it's computationally infeasible to retrieve the original input from the hash value. Because of this, it's a common method for storing sensitive data, such as passwords, in a secure way.


Last updated