org.hpccsystems.spark.package-info Maven / Gradle / Ivy
Go to download
Show more of this group Show more artifacts with this name
Show all versions of spark-hpcc Show documentation
Show all versions of spark-hpcc Show documentation
Spark connector for reading files residing in an HPCC cluster environment
/*******************************************************************************
* HPCC SYSTEMS software Copyright (C) 2018 HPCC Systems®.
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
* You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*******************************************************************************/
/**
* Provides access to data residing in HPCC Systems or Spark environments.
*
* The DFSClient from HPCC Systems is used to used to access
* HPCC Systems data files' metadata including the location and layout of the file, and
* also requests data file access privilages.
* An RDD is provided to read the file in parallel by file part.
*
* The main classes are:
*
* - Content is the abstract class defining field content. There are concrete
* classes for each of the different content types.
* - FieldType is an enumeration type listing the types of content.
* - HpccPart implements the Spark Partition interface.
* - HpccFile is the metadata for a file on an HPCC THOR cluster.
* - HpccFileException is the general exception class.
* - HpccRDD extends RDD(Record) class for Spark.
* - HpccRemoteFileReader is the facade for the type of file reader.
* - Record is the container class holding the data for a record from THOR.
*
*
*/
package org.hpccsystems.spark;
© 2015 - 2025 Weber Informatics LLC | Privacy Policy