If Data quality studio is installed on the same environment as Data entry workflow, you can apply a fuzzy duplicate check in Data entry workflow.
For each data entry workflow template, you can define a fuzzy duplicate check step. Use a fuzzy duplicate check step to automatically check if the data, as entered in the previous workflow tasks, is a possible duplicate record.

You can use these dependency types for fuzzy duplicate check steps:

  • Step: Adds a step next to a selected step in the workflow.
  • Parallel step: Adds a parallel step to a selected step in the workflow. You can add a parallel step to any step, except for the first step.
  • Group subsequent step: Adds a step that brings together parallel steps in one next step. You can only add a group step if at least one parallel step is defined for the workflow template. For a group step, you must select the steps that are brought together in the group step.


Standard procedure

1. Go to Data entry workflow > Design > Data entry workflow templates.
2. In the list, click the link in the desired data entry workflow template.
3. Click Edit.
 

Note: Make sure the Workflow diagram section is in edit mode.

4. In the Workflow diagram section, click Add.
 

Note: If you want to add a step next to or parallel to another step, first select the desired step in the workflow diagram.

5. Click the desired dependency type.
6. In the Step name field, type a value.
7. In the Description field, type a value.
8. In the Type field, select 'Fuzzy duplicate check'.
9. During workflow execution, all entered data is stored in a staging table. When the workflow is finished, the entered data is copied from the staging table to the target company tables.
However, for each step, you can indicate if the entered data must already be copied from the staging table to the target company tables when the step is finished.
  Select Yes in the Transfer to target field.
10. Select the fuzzy duplicate check to be done for the step.
  In the Fuzzy duplicate check field, enter or select a value.
 

Note: You select the fuzzy duplicate check from Data quality studio.

11. You can set a time limit within which the workflow step must be done when possible duplicates are found.
Based on the time limit and the calendar, which is defined in the Data entry workflow parameters, the elapsed time is calculated for the workflow tasks.
  In the Time unit field, select an option.
 

Note:

  • Usually, you don't set a time limit for the initial step.
  • You can use custom alerts to notify when a time limit is due. You can do so based on the elapsed time of a workflow task.

12. In the Time length field, enter the number of time units for the time limit.
13. You can enter the desired instructions for the workflow step.
The header instructions are shown at the top of the workflow task page.
  In the Header instructions field, type a value.
14. You can enter the desired additional instructions for the workflow step. For example, notes or checks to be done before finishing the workflow task.
The footer instructions are shown at the bottom of the workflow task page.
  In the Footer instructions field, type a value.
15. Sub-task: Select previous steps for 'Group subsequent step'.
  15.1 If you create a 'Group subsequent step', you must select at least two steps that must have the group step as next step.
  Expand the Previous steps section.
  15.2 In the list, find and select the desired previous steps.
16. Click OK.

Notes

Do not set up a fuzzy duplicate step as the first step of a data entry workflow. Make sure to have at least one data entry step before a fuzzy duplicate check step.

See also

Provide feedback