All Downloads are FREE. Search and download functionalities are using the official Maven repository.

com.amazonaws.services.datapipeline.DataPipeline Maven / Gradle / Ivy

Go to download

The AWS Java SDK for AWS Data Pipeline module holds the client classes that are used for communicating with AWS Data Pipeline Service

There is a newer version: 1.11.7
Show newest version
/*
 * Copyright 2011-2016 Amazon.com, Inc. or its affiliates. All Rights Reserved.
 * 
 * Licensed under the Apache License, Version 2.0 (the "License"). You may not
 * use this file except in compliance with the License. A copy of the License is
 * located at
 * 
 * http://aws.amazon.com/apache2.0
 * 
 * or in the "license" file accompanying this file. This file is distributed on
 * an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either
 * express or implied. See the License for the specific language governing
 * permissions and limitations under the License.
 */
package com.amazonaws.services.datapipeline;

import com.amazonaws.*;
import com.amazonaws.regions.*;

import com.amazonaws.services.datapipeline.model.*;

/**
 * Interface for accessing AWS Data Pipeline.
 * 

*

* AWS Data Pipeline configures and manages a data-driven workflow called a * pipeline. AWS Data Pipeline handles the details of scheduling and ensuring * that data dependencies are met so that your application can focus on * processing the data. *

*

* AWS Data Pipeline provides a JAR implementation of a task runner called AWS * Data Pipeline Task Runner. AWS Data Pipeline Task Runner provides logic for * common data management scenarios, such as performing database queries and * running data analysis using Amazon Elastic MapReduce (Amazon EMR). You can * use AWS Data Pipeline Task Runner as your task runner, or you can write your * own task runner to provide custom data management. *

*

* AWS Data Pipeline implements two main sets of functionality. Use the first * set to create a pipeline and define data sources, schedules, dependencies, * and the transforms to be performed on the data. Use the second set in your * task runner application to receive the next task ready for processing. The * logic for performing the task, such as querying the data, running data * analysis, or converting the data from one format to another, is contained * within the task runner. The task runner performs the task assigned to it by * the web service, reporting progress to the web service as it does so. When * the task is done, the task runner reports the final success or failure of the * task to the web service. *

*/ public interface DataPipeline { /** * The region metadata service name for computing region endpoints. You can * use this value to retrieve metadata (such as supported regions) of the * service. * * @see RegionUtils#getRegionsForService(String) */ String ENDPOINT_PREFIX = "datapipeline"; /** * Overrides the default endpoint for this client * ("datapipeline.us-east-1.amazonaws.com"). Callers can use this method to * control which AWS region they want to work with. *

* Callers can pass in just the endpoint (ex: * "datapipeline.us-east-1.amazonaws.com") or a full URL, including the * protocol (ex: "datapipeline.us-east-1.amazonaws.com"). If the protocol is * not specified here, the default protocol from this client's * {@link ClientConfiguration} will be used, which by default is HTTPS. *

* For more information on using AWS regions with the AWS SDK for Java, and * a complete list of all available endpoints for all AWS services, see: http://developer.amazonwebservices.com/connect/entry.jspa?externalID= * 3912 *

* This method is not threadsafe. An endpoint should be configured when * the client is created and before any service requests are made. Changing * it afterwards creates inevitable race conditions for any service requests * in transit or retrying. * * @param endpoint * The endpoint (ex: "datapipeline.us-east-1.amazonaws.com") or a * full URL, including the protocol (ex: * "datapipeline.us-east-1.amazonaws.com") of the region specific AWS * endpoint this client will communicate with. */ void setEndpoint(String endpoint); /** * An alternative to {@link DataPipeline#setEndpoint(String)}, sets the * regional endpoint for this client's service calls. Callers can use this * method to control which AWS region they want to work with. *

* By default, all service endpoints in all regions use the https protocol. * To use http instead, specify it in the {@link ClientConfiguration} * supplied at construction. *

* This method is not threadsafe. A region should be configured when the * client is created and before any service requests are made. Changing it * afterwards creates inevitable race conditions for any service requests in * transit or retrying. * * @param region * The region this client will communicate with. See * {@link Region#getRegion(com.amazonaws.regions.Regions)} for * accessing a given region. Must not be null and must be a region * where the service is available. * * @see Region#getRegion(com.amazonaws.regions.Regions) * @see Region#createClient(Class, * com.amazonaws.auth.AWSCredentialsProvider, ClientConfiguration) * @see Region#isServiceSupported(String) */ void setRegion(Region region); /** *

* Validates the specified pipeline and starts processing pipeline tasks. If * the pipeline does not pass validation, activation fails. *

*

* If you need to pause the pipeline to investigate an issue with a * component, such as a data source or script, call * DeactivatePipeline. *

*

* To activate a finished pipeline, modify the end date for the pipeline and * then activate it. *

* * @param activatePipelineRequest * Contains the parameters for ActivatePipeline. * @return Result of the ActivatePipeline operation returned by the service. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.ActivatePipeline */ ActivatePipelineResult activatePipeline( ActivatePipelineRequest activatePipelineRequest); /** *

* Adds or modifies tags for the specified pipeline. *

* * @param addTagsRequest * Contains the parameters for AddTags. * @return Result of the AddTags operation returned by the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.AddTags */ AddTagsResult addTags(AddTagsRequest addTagsRequest); /** *

* Creates a new, empty pipeline. Use PutPipelineDefinition to * populate the pipeline. *

* * @param createPipelineRequest * Contains the parameters for CreatePipeline. * @return Result of the CreatePipeline operation returned by the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.CreatePipeline */ CreatePipelineResult createPipeline( CreatePipelineRequest createPipelineRequest); /** *

* Deactivates the specified running pipeline. The pipeline is set to the * DEACTIVATING state until the deactivation process completes. *

*

* To resume a deactivated pipeline, use ActivatePipeline. By * default, the pipeline resumes from the last completed execution. * Optionally, you can specify the date and time to resume the pipeline. *

* * @param deactivatePipelineRequest * Contains the parameters for DeactivatePipeline. * @return Result of the DeactivatePipeline operation returned by the * service. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.DeactivatePipeline */ DeactivatePipelineResult deactivatePipeline( DeactivatePipelineRequest deactivatePipelineRequest); /** *

* Deletes a pipeline, its pipeline definition, and its run history. AWS * Data Pipeline attempts to cancel instances associated with the pipeline * that are currently being processed by task runners. *

*

* Deleting a pipeline cannot be undone. You cannot query or restore a * deleted pipeline. To temporarily pause a pipeline instead of deleting it, * call SetStatus with the status set to PAUSE on * individual components. Components that are paused by SetStatus can * be resumed. *

* * @param deletePipelineRequest * Contains the parameters for DeletePipeline. * @return Result of the DeletePipeline operation returned by the service. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.DeletePipeline */ DeletePipelineResult deletePipeline( DeletePipelineRequest deletePipelineRequest); /** *

* Gets the object definitions for a set of objects associated with the * pipeline. Object definitions are composed of a set of fields that define * the properties of the object. *

* * @param describeObjectsRequest * Contains the parameters for DescribeObjects. * @return Result of the DescribeObjects operation returned by the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.DescribeObjects */ DescribeObjectsResult describeObjects( DescribeObjectsRequest describeObjectsRequest); /** *

* Retrieves metadata about one or more pipelines. The information retrieved * includes the name of the pipeline, the pipeline identifier, its current * state, and the user account that owns the pipeline. Using account * credentials, you can retrieve metadata about pipelines that you or your * IAM users have created. If you are using an IAM user account, you can * retrieve metadata about only those pipelines for which you have read * permissions. *

*

* To retrieve the full pipeline definition instead of metadata about the * pipeline, call GetPipelineDefinition. *

* * @param describePipelinesRequest * Contains the parameters for DescribePipelines. * @return Result of the DescribePipelines operation returned by the * service. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.DescribePipelines */ DescribePipelinesResult describePipelines( DescribePipelinesRequest describePipelinesRequest); /** *

* Task runners call EvaluateExpression to evaluate a string in * the context of the specified object. For example, a task runner can * evaluate SQL queries stored in Amazon S3. *

* * @param evaluateExpressionRequest * Contains the parameters for EvaluateExpression. * @return Result of the EvaluateExpression operation returned by the * service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws TaskNotFoundException * The specified task was not found. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.EvaluateExpression */ EvaluateExpressionResult evaluateExpression( EvaluateExpressionRequest evaluateExpressionRequest); /** *

* Gets the definition of the specified pipeline. You can call * GetPipelineDefinition to retrieve the pipeline definition * that you provided using PutPipelineDefinition. *

* * @param getPipelineDefinitionRequest * Contains the parameters for GetPipelineDefinition. * @return Result of the GetPipelineDefinition operation returned by the * service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.GetPipelineDefinition */ GetPipelineDefinitionResult getPipelineDefinition( GetPipelineDefinitionRequest getPipelineDefinitionRequest); /** *

* Lists the pipeline identifiers for all active pipelines that you have * permission to access. *

* * @param listPipelinesRequest * Contains the parameters for ListPipelines. * @return Result of the ListPipelines operation returned by the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.ListPipelines */ ListPipelinesResult listPipelines(ListPipelinesRequest listPipelinesRequest); /** * Simplified method form for invoking the ListPipelines operation. * * @see #listPipelines(ListPipelinesRequest) */ ListPipelinesResult listPipelines(); /** *

* Task runners call PollForTask to receive a task to perform * from AWS Data Pipeline. The task runner specifies which tasks it can * perform by setting a value for the workerGroup parameter. * The task returned can come from any of the pipelines that match the * workerGroup value passed in by the task runner and that was * launched using the IAM user credentials specified by the task runner. *

*

* If tasks are ready in the work queue, PollForTask returns a * response immediately. If no tasks are available in the queue, * PollForTask uses long-polling and holds on to a poll * connection for up to a 90 seconds, during which time the first newly * scheduled task is handed to the task runner. To accomodate this, set the * socket timeout in your task runner to 90 seconds. The task runner should * not call PollForTask again on the same * workerGroup until it receives a response, and this can take * up to 90 seconds. *

* * @param pollForTaskRequest * Contains the parameters for PollForTask. * @return Result of the PollForTask operation returned by the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws TaskNotFoundException * The specified task was not found. * @sample DataPipeline.PollForTask */ PollForTaskResult pollForTask(PollForTaskRequest pollForTaskRequest); /** *

* Adds tasks, schedules, and preconditions to the specified pipeline. You * can use PutPipelineDefinition to populate a new pipeline. *

*

* PutPipelineDefinition also validates the configuration as it * adds it to the pipeline. Changes to the pipeline are saved unless one of * the following three validation errors exists in the pipeline. *

*
    *
  1. An object is missing a name or identifier field.
  2. *
  3. A string or reference field is empty.
  4. *
  5. The number of objects in the pipeline exceeds the maximum allowed * objects.
  6. *
  7. The pipeline is in a FINISHED state.
  8. *
*

* Pipeline object definitions are passed to the * PutPipelineDefinition action and returned by the * GetPipelineDefinition action. *

* * @param putPipelineDefinitionRequest * Contains the parameters for PutPipelineDefinition. * @return Result of the PutPipelineDefinition operation returned by the * service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.PutPipelineDefinition */ PutPipelineDefinitionResult putPipelineDefinition( PutPipelineDefinitionRequest putPipelineDefinitionRequest); /** *

* Queries the specified pipeline for the names of objects that match the * specified set of conditions. *

* * @param queryObjectsRequest * Contains the parameters for QueryObjects. * @return Result of the QueryObjects operation returned by the service. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.QueryObjects */ QueryObjectsResult queryObjects(QueryObjectsRequest queryObjectsRequest); /** *

* Removes existing tags from the specified pipeline. *

* * @param removeTagsRequest * Contains the parameters for RemoveTags. * @return Result of the RemoveTags operation returned by the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.RemoveTags */ RemoveTagsResult removeTags(RemoveTagsRequest removeTagsRequest); /** *

* Task runners call ReportTaskProgress when assigned a task to * acknowledge that it has the task. If the web service does not receive * this acknowledgement within 2 minutes, it assigns the task in a * subsequent PollForTask call. After this initial acknowledgement, * the task runner only needs to report progress every 15 minutes to * maintain its ownership of the task. You can change this reporting time * from 15 minutes by specifying a reportProgressTimeout field * in your pipeline. *

*

* If a task runner does not report its status after 5 minutes, AWS Data * Pipeline assumes that the task runner is unable to process the task and * reassigns the task in a subsequent response to PollForTask. Task * runners should call ReportTaskProgress every 60 seconds. *

* * @param reportTaskProgressRequest * Contains the parameters for ReportTaskProgress. * @return Result of the ReportTaskProgress operation returned by the * service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws TaskNotFoundException * The specified task was not found. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.ReportTaskProgress */ ReportTaskProgressResult reportTaskProgress( ReportTaskProgressRequest reportTaskProgressRequest); /** *

* Task runners call ReportTaskRunnerHeartbeat every 15 minutes * to indicate that they are operational. If the AWS Data Pipeline Task * Runner is launched on a resource managed by AWS Data Pipeline, the web * service can use this call to detect when the task runner application has * failed and restart a new instance. *

* * @param reportTaskRunnerHeartbeatRequest * Contains the parameters for ReportTaskRunnerHeartbeat. * @return Result of the ReportTaskRunnerHeartbeat operation returned by the * service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.ReportTaskRunnerHeartbeat */ ReportTaskRunnerHeartbeatResult reportTaskRunnerHeartbeat( ReportTaskRunnerHeartbeatRequest reportTaskRunnerHeartbeatRequest); /** *

* Requests that the status of the specified physical or logical pipeline * objects be updated in the specified pipeline. This update might not occur * immediately, but is eventually consistent. The status that can be set * depends on the type of object (for example, DataNode or Activity). You * cannot perform this operation on FINISHED pipelines and * attempting to do so returns InvalidRequestException. *

* * @param setStatusRequest * Contains the parameters for SetStatus. * @return Result of the SetStatus operation returned by the service. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @sample DataPipeline.SetStatus */ SetStatusResult setStatus(SetStatusRequest setStatusRequest); /** *

* Task runners call SetTaskStatus to notify AWS Data Pipeline * that a task is completed and provide information about the final status. * A task runner makes this call regardless of whether the task was * sucessful. A task runner does not need to call SetTaskStatus * for tasks that are canceled by the web service during a call to * ReportTaskProgress. *

* * @param setTaskStatusRequest * Contains the parameters for SetTaskStatus. * @return Result of the SetTaskStatus operation returned by the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws TaskNotFoundException * The specified task was not found. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.SetTaskStatus */ SetTaskStatusResult setTaskStatus(SetTaskStatusRequest setTaskStatusRequest); /** *

* Validates the specified pipeline definition to ensure that it is well * formed and can be run without error. *

* * @param validatePipelineDefinitionRequest * Contains the parameters for ValidatePipelineDefinition. * @return Result of the ValidatePipelineDefinition operation returned by * the service. * @throws InternalServiceErrorException * An internal service error occurred. * @throws InvalidRequestException * The request was not valid. Verify that your request was properly * formatted, that the signature was generated with the correct * credentials, and that you haven't exceeded any of the service * limits for your account. * @throws PipelineNotFoundException * The specified pipeline was not found. Verify that you used the * correct user and account identifiers. * @throws PipelineDeletedException * The specified pipeline has been deleted. * @sample DataPipeline.ValidatePipelineDefinition */ ValidatePipelineDefinitionResult validatePipelineDefinition( ValidatePipelineDefinitionRequest validatePipelineDefinitionRequest); /** * Shuts down this client object, releasing any resources that might be held * open. This is an optional method, and callers are not expected to call * it, but can if they want to explicitly release any open resources. Once a * client has been shutdown, it should not be used to make any more * requests. */ void shutdown(); /** * Returns additional metadata for a previously executed successful request, * typically used for debugging issues where a service isn't acting as * expected. This data isn't considered part of the result data returned by * an operation, so it's available through this separate, diagnostic * interface. *

* Response metadata is only cached for a limited period of time, so if you * need to access this extra diagnostic information for an executed request, * you should use this method to retrieve it as soon as possible after * executing a request. * * @param request * The originally executed request. * * @return The response metadata for the specified request, or null if none * is available. */ ResponseMetadata getCachedResponseMetadata(AmazonWebServiceRequest request); }





© 2015 - 2025 Weber Informatics LLC | Privacy Policy