Subscribe to Email Updates

    Workflow Security | 6 min read

    Data Integrity

    Nodeum provides data integrity validation in each workflow, the three main different mechanisms are available : checksums (e.g. CRC32), non-cryptographic hashes (e.g. xxHash) and cryptographic hashes (e.g. MD5). 

    Definition

    • MD5 : This hash algorithm is a commonly used function for validating data integrity.  An MD5 checksum is a 32 digit hexadecimal number that represents the hash of the contents of a file. The calculation of an MD5 is an industry standard so the integrity can be checked on any system.
    • xxHash : It is an Extremely fast Hash algorithm, the hash is faster to be generated. It is highly portable, and hashes are identical across all platforms (little / big endian). This is more and more popular in the video device, Nodeum uses the xxHash64be algorithm which is compatible with other product and software in the industries.

    Benefits

    This type of verification which compare checksum/hash is a great way to ensure that a file has not been changed during its life-cycle. It is always possible to do a calculation at the source and at the destination and compare both. Nodeum automates the calculation process in the workflow which improves the end-user experience and the time to operate.

    Furthermore the Integrity Checksum Analysis allows the verification of the integrity of the data processed by Nodeum against a reference list.

    How to proceed ?

    In each workflow definition, when defining a new task, browser to others options panel and you can activate the option you prefer : MD5-Hash or xxHash64.

    Compare the result with original Checksum

    Once your task ran, you can compare the final list Checksums with the original calculated Checksums.

    To do so, in the information of the task, under Processed filed, you can click on the  Button. This will import the checksum list into the Integrity Checksum Analsys feature.

    How to analyze the Integrity Checksum ?

    When you launched the Check Integrity utility, the following screen will be displayed:

    You can now upload your reference Checksum report(s) into the utility.

    To do so, click the button and select the report files.

    ❗❗ Information about Reference Checksum Report Format.

    The reference MUST

    • be in CSV format
    • comprise only one column containing solely the file path
    • comprise only one column containing solely the checksum
    • have the same checksum format as configured in this task
    • be consistent

    The reference MAY

    • be split among multiple files
    • contains header or footer lines
    • be in random order
    • be based on other root path

    Once your Reference Checksum report is uploaded, Nodeum allows to discard lines that don't contain relevant information.

    Once you have selected the line to discard, click on  to go to the next step.

    Associate the Reference Root Directory with the Handled root directory:

    Once done, you can click on  to have view the result of the analysis.

    The analysis report can be downloaded as well via the  button.

    Related Categories

    Workflow Security

    You may also like:

    Workflow Tape Library management

    Tape Rehydratation

    The tape rehydration process is design to optimize the space on a tape. When files are deleted from a tape, they are not...

    Workflow Best Practice

    Best Practice - Workflow Multi copies on tape

    HOW TO PROCEED TO IMPLEMENT SPECIFIC WORKFLOWS WHEN DEALING WITH MULTI TAPE COPIES ? In this following blog, we are look...

    Workflow Settings Best Practice

    Primary Storage

    Definition Primary storage is the type of storage and technologies used to store and retain digital information that is ...

    Let Us Know What You Thought about this Post.

    Put your Comment Below.

    Learn and grow with award-winning support and a thriving community behind you.

    Get the free version