Base64-encoded data in JSON

JSON data can contain base64-encoded data within certain attributes. For example:

Json
Copy

Deep CDR can be configured to perform an in-depth inspection of such data by:

  • Decoding base64-encoded content.
  • Recursively sanitizing the extracted data.
  • Re-encoding the sanitized data in base64.
  • Reinserting the processed data back into the JSON structure.

Configuration

This feature can be enabled at Workflow Management> Workflows > [Workflow name] > Deep CDR > Advanced configuration > File type handling > Others > JSON > Process Base64 encoded data.

  • Data following "data" URL scheme: Base64 encoded data that start with "data" URL scheme, e.g., .
  • Data at certain JSON path: Regular expressions of JSON path to fields to be processed, e.g., user..*Picture$.
  • Action on data when sanitization not applied: Choose what action should be applied to data when sanitization is not supported or fails

Regular expression examples

Below are some examples on regular expressions and corresponding JSON nodes from an example JSON that match with the expressions.

JSON
Copy
ExpressionJSON node at path that matches
data\.image1data.image1
data\.images\.0data.images[0]
data\.images\..*data.images[0], data.images[1]
data\.image.*data.image1, data.image2, data.images[0], data.images[1]

Node: item index in a JSON array starts from 0.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard