Execute Hive Job

Operation Name

Execute Hive Job

Function Overview

Execute a Hive job.

This operation uses WebHCat. For more details on API specifications, refer to the following page.

Properties

For details on use of variables, refer to "Variables".
Basic settings
Item name Required/Optional Use of variables Description Remarks
Name Required Not available Enter the name to show on the script canvas.  
Required settings
Item name Required/Optional Use of variables Description Remarks
Cluster name Required Available Enter a cluster name.  
Cluster user name Required Available Enter a cluster user name.  
Cluster password Required Available Enter the cluster password.  
Query Required Available Enter a query.
  • For details on query in Hive, refer to the following page.
    • “LanguageManual - Apache Hive - Apache Software Foundation”(https://cwiki.apache.org/confluence/display/Hive/LanguageManual)
Output job status Optional Not available Select whether or not to output job status.
  • [Checked]:
    Job status will be output to container.
  • [Not Checked]: (default)
    Job status will not be output.
  • Job status is output to the default container of cluster.
  • When job status is output, the target virtual directory of the output destination can be got from the component variable statusdir_virtual_directory in the subsequent operations.
Wait for execution completion Optional Not available Select whether to wait for completion of job execution or not.
  • [Checked]: (default)
    Wait for completion of job execution.
  • [Not checked]:
    Proceed to the subsequent operation without waiting after sending a request of executing job.
 
Comment
Item name Required/Optional Use of variables Description Remarks
Comment Optional Not available You can write a short description of this adapter.
The description will be reflected in the specifications.
 

Schema

Input schema

None.

Output schema

None.

Mass Data Processing

Mass data processing is not supported.

PSP Usage

PSP is not supported.

Available Component Variables

Component variable name Description Remarks
job_id The executed job ID is stored.
  • Default value is null.
statusdir_virtual_directory The virtual directory to output the job status is stored.
  • Default value is null.
  • Available when [Output job status] is checked.
message_category When an error occurs, category of the corresponding message code is stored.
  • Default value is null.
message_code When an error occurs, code of the corresponding message code is stored.
  • Default value is null.
message_level When an error occurs, severity of the corresponding message code is stored.
  • Default value is null.
error_type When an error occurs, the error type is stored.
  • Default value is null.
  • The error type would be in the following format.
    Example: “java.io.FileNotFoundException”
  • The contents that will be stored may vary depending on DataSpider Servista versions.
error_message When an error occurs, the error message is stored.
  • Default value is null.
  • The contents that will be stored may vary depending on DataSpider Servista versions.
error_trace When an error occurs, trace information of the error is stored.
  • Default value is null.
  • The contents that will be stored may vary depending on DataSpider Servista versions and client applications.

Specification Limits

Major Exceptions

Exception name Cause Solution
InvalidPropertyConfigurationException
<property name> is not specified.
[<property name>] is not specified. Specify [<property name>].
java.net.UnknownHostException The cluster does not exist. Check [Cluster name].
AzureHdinsightException
jobId=<job ID>, status=<job status>
The job status is invalid. Check the job contents.
AzureHdinsightException Failed to execute a job. Check the contents of the error message.

Notes