Execute Hive Job

Operation name

Execute Hive Job

Function overview

Executes a Hive job.

= Remarks =

This operation uses WebHCat. For more details on API specifications, refer to the following page:

  • "WebHCat - Apache Hive - Apache Software Foundation" (https://cwiki.apache.org/confluence/display/Hive/WebHCat)

Data model

The data model of this component is table model type.

Properties

= Remarks =

For details on use of variables, refer to Variables.

Basic settings

Item name

Required/Optional

Use of variables

Description

Remarks

Name

Required

Not available

Enter a name that is used on the script canvas.

 

Required settings

Item name

Required/Optional

Use of variables

Description

Remarks

Cluster name

Required

Available

Enter a cluster name.

 

Cluster user name

Required

Available

Enter the cluster user name.

 

Cluster password

Required

Available

Enter the cluster password.

 

Query

Required

Available

Enter a query.

For more details about Hive queries, refer to the following page:

  • "LanguageManual - Apache Hive - Apache Software Foundation" (https://cwiki.apache.org/confluence/display/Hive/LanguageManual)

Output job status

Optional

Not available

Select whether or not to output the job status.

Selected

The job status is output to the container.

Not selected

(Default)

The job status isn't output.

  • The job status is output to the default container for the cluster.

  • When outputting the job status, you can get the virtual directory of the output destination with the component variable statusdir_virtual_directory in subsequent operations.

Wait for execution completion

Optional

Not available

Select whether to wait for completion of job execution to execute an operation.

Selected

(Default)

Wait for completion of job execution to execute an operation.

Not selected

Proceed to the subsequent operation without waiting after a job execution request is sent.

 

Comment

Item name

Required/Optional

Use of variables

Description

Remarks

Comment

Optional

Not available

You can write a short description of this connector.

 

Schemas

Input schema

None.

Output schema

None.

Loading schema in Mapper

The schema is loaded automatically.

= Remarks =

For details, refer to Edit Schema.

Mass data processing

Mass data processing isn't supported.

Parallel Stream Processing

PSP isn't supported.

Available component variables

Component variable name

Description

Remarks

job_id

The ID of the executed job is stored.

  • The default value is null.

statusdir_virtual_directory

The virtual directory to which the job status is output is stored.

  • The default value is null.

  • It can be obtained if you select Output job status.

message_category

When an error occurs, the category of the message code corresponding to the error is stored.

  • The default value is null.

message_code

When an error occurs, the code of the message code corresponding to the error is stored.

  • The default value is null.

message_level

When an error occurs, the severity of the message code corresponding to the error is stored.

  • The default value is null.

error_type

When an error occurs, the error type is stored.

  • The default value is null.

  • The format of the error type is as follows.

    Example: java.io.FileNotFoundException

error_message

When an error occurs, the error message is stored.

  • The default value is null.

error_trace

When an error occurs, the trace information for the error is stored.

  • The default value is null.

Message codes, exception messages, and limitations

Connector

Message code

Exception message

Limitations

Messages and limitations of the Azure HDInsight connector

check

check

check