Skip to main content Skip to complementary content

Deduplicating the email addresses

Use the procedure below to add and configure a tUniqRow component that will be used to identify duplicate email addresses.

Procedure

  1. Next the tFileInputDelimited component, add a new addComponent {} function and its sub-function setComponentDefinition {} to add the tUniqRow component.
    addComponent {
    	setComponentDefinition {
    		TYPE: "tUniqRow",
    		NAME: "tUniqRow_1",
    		POSITION: 416, 192
    	}
    
    }
  2. Next to the setComponentDefinition {} function, enter the setSettings {} function to define the deduplication process and label the component.
    	setSettings {
    		UNIQUE_KEY {
    			SCHEMA_COLUMN : "email",
    			KEY_ATTRIBUTE : "true"
    		},
    		LABEL : "deduplicate"
    	}
  3. Next to the setSettings {} function, enter two addSchema {} functions to define the schemas for the output flows, one for the unique emails addresses and the other for the duplicates.
    	addSchema {
    		NAME: "UNIQUE",
    		CONNECTOR: "UNIQUE"
    		addColumn {
    			NAME: "email",
    			TYPE: "id_String"
    		}
    	}
    	addSchema {
    		NAME: "DUPLICATE",
    		CONNECTOR: "DUPLICATE"
    		addColumn {
    			NAME: "email",
    			TYPE: "id_String"
    		}
    	}

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!