Execute Hive Job

Operation Name

Execute Hive Job

Execute a Hive job.

This operation uses WebHCat. For more details on API specifications, refer to the following page.

“WebHCat - Apache Hive - Apache Software Foundation”(https://cwiki.apache.org/confluence/display/Hive/WebHCat)

For details on use of variables, refer to "Variables".

Item name	Required/Optional	Use of variables	Description	Remarks
Name	Required	Not available	Enter the name to show on the script canvas.

Item name	Required/Optional	Use of variables	Description	Remarks
Cluster name	Required	Available	Enter a cluster name.
Cluster user name	Required	Available	Enter a cluster user name.
Cluster password	Required	Available	Enter the cluster password.
Query	Required	Available	Enter a query.	For details on query in Hive, refer to the following page. “LanguageManual - Apache Hive - Apache Software Foundation”(https://cwiki.apache.org/confluence/display/Hive/LanguageManual)
Output job status	Optional	Not available	Select whether or not to output job status. [Checked]: Job status will be output to container. [Not Checked]: (default) Job status will not be output.	Job status is output to the default container of cluster. When job status is output, the target virtual directory of the output destination can be got from the component variable statusdir_virtual_directory in the subsequent operations.
Wait for execution completion	Optional	Not available	Select whether to wait for completion of job execution or not. [Checked]: (default) Wait for completion of job execution. [Not checked]: Proceed to the subsequent operation without waiting after sending a request of executing job.

Item name	Required/Optional	Use of variables	Description	Remarks
Comment	Optional	Not available	You can write a short description of this adapter. The description will be reflected in the specifications.

None.

None.

Mass data processing is not supported.

PSP is not supported.

Component variable name	Description	Remarks
job_id	The executed job ID is stored.	Default value is null.
statusdir_virtual_directory	The virtual directory to output the job status is stored.	Default value is null. Available when [Output job status] is checked.
message_category	When an error occurs, category of the corresponding message code is stored.	Default value is null.
message_code	When an error occurs, code of the corresponding message code is stored.	Default value is null.
message_level	When an error occurs, severity of the corresponding message code is stored.	Default value is null.
error_type	When an error occurs, the error type is stored.	Default value is null. The error type would be in the following format. Example: “java.io.FileNotFoundException” The contents that will be stored may vary depending on DataSpider Servista versions.
error_message	When an error occurs, the error message is stored.	Default value is null. The contents that will be stored may vary depending on DataSpider Servista versions.
error_trace	When an error occurs, trace information of the error is stored.	Default value is null. The contents that will be stored may vary depending on DataSpider Servista versions and client applications.

Exception name	Cause	Solution
InvalidPropertyConfigurationException <property name> is not specified.	[<property name>] is not specified.	Specify [<property name>].
java.net.UnknownHostException	The cluster does not exist.	Check [Cluster name].
AzureHdinsightException jobId=<job ID>, status=<job status>	The job status is invalid.	Check the job contents.
AzureHdinsightException	Failed to execute a job.	Check the contents of the error message.

This specifications is subject to change without any prior notification depending on the changes in Azure HDInsight specifications.