Explore Data Integration

Data integration is the process of extracting and combining data from various sources, changing its format, and storing the data in destinations.

In HULFT Square, you can handle various input and output data from different resources via connectors even without the deep knowledge of the specification of each resource. The following diagram depicts how data integration is realized in HULFT Square.

Figure 1.1 Data integration overview

Designer

Designer enables you to develop, run, and debug ETL scripts graphically. No coding is required to create the scripts. You can set up data connections and data integration processes by simply dragging icons and setting configurations.

To launch Designer, select the Open project icon for the target project on the Projects page.

Figure 1.2 Designer

Operations

Operations are units that perform processing such as reading, converting, and writing data. Each operation on Designer is displayed as an icon (component icon). For example, an operation to read data from a CSV file is called the Read CSV File operation and is represented by the component icon csv_read.

Components

A component represents an operation.

Components are categorized into the following types:

  • Connector

  • Converter

  • Script component

In the Tool palette in Designer, components are displayed as icons.

Figure 1.3 Components

Connectors

A connector is a component to read and write data mainly from external systems such as databases. The following categories of connectors are available:

  • Basic

  • Database

  • File

  • Application

  • Network

  • Cloud

  • Cryptography

  • HULFT

Connectors are displayed in the lower part of the Tool palette in Designer.

Figure 1.4 Connectors

Some of the connectors may require additional licenses in order to use them.

= Remarks =

Connectors are known as adapters in DataSpider Servista.

Converters

A converter is a component for converting variables and result data read by a connector.

The following types of converters are available:

  • Mapper

  • Character

  • XSLT

In the Tool palette in Designer, converters are displayed in the Basic and Conversion categories.

Figure 1.5 Converters

Mappers

A Mapper is a component that converts or modifies data read by specific components and writes the data to other components or assigns the data to variables.

The following types of Mappers are available:

In the Tool palette in Designer, the three Mappers are displayed in the Basic and Conversion categories.

Figure 1.6 Mappers

The Mapper editor is used to set up Mappers.

Script components

Script components represent the functions required to create scripts, such as controlling flows, calling other scripts, and creating memos.

In the Tool palette of Designer, they belong to the Basic category and are classified into the Processing, Flow, or Others subcategories.

Figure 1.7 Script components

Flow

For scripts, there are process flows, which show the flow of the processing, and data flows, which show the flow of data. Process flows and data flows are collectively called "flows."

In HULFT Square, flows are divided into process flows and data flows. This allows you to perform processing once, but use the processing results in the subsequent component repeatedly.

To create a flow, first select a component icon of the component to be the starting point, then drag it to the component icon of the component to which you want to connect it.

Process flows

A process flow shows the flow of processing. A script processes in the order the components were connected by the process flow.

A process flow begins with a start component, and ends with an end or break component.

Data flows

A data flow shows the flow of data.

You can draw a data flow from a read-type connector or converter to a write-type connector or converter.

Result data

The result of the processing performed by components is called the "result data."

Result data is primarily generated by read-type connectors and converters. Write-type connectors and converters handle the result data as input data.

Input/Output

"Input/Output" refers to the script arguments (script input variables) and return values (script output variables). Input/Output is achieved by adding attributes for input or output to the script variables defined in the script.

HULFT Square scripts can use multiple arguments, and return multiple return values.

Execution ID

Scripts are assigned a unique ID within HULFT Square (one per script). This is called the "Execution ID."

The execution ID is necessary to distinguish each script in the execution log view of Designer and the event log of Monitoring.