public class ParquetStreamFactory extends AvroStreamFactory
LOGGER| Constructor and Description |
|---|
ParquetStreamFactory() |
ParquetStreamFactory(org.apache.hadoop.conf.Configuration configuration) |
| Modifier and Type | Method and Description |
|---|---|
static org.apache.hadoop.conf.Configuration |
cloneDefaultConfiguration() |
org.apache.hadoop.conf.Configuration |
getConfiguration() |
protected org.apache.parquet.filter2.compat.FilterCompat.Filter |
makeFilter() |
protected IGenericRecordConsumer |
prepareRecordConsumer(org.apache.avro.Schema schema,
URI uri) |
static Stream<Map<String,?>> |
readParquetAsStream(URI uriToParquet,
Map<String,?> exampleTypes) |
Stream<org.apache.avro.generic.GenericRecord> |
stream(InputStream rawInputStream) |
Stream<org.apache.avro.generic.GenericRecord> |
stream(URI uri) |
protected org.apache.hadoop.fs.Path |
toHadoopPath(URI uri) |
protected Stream<org.apache.avro.generic.GenericRecord> |
toStream(org.apache.parquet.hadoop.ParquetReader<org.apache.avro.generic.GenericRecord> reader) |
Stream<org.apache.avro.generic.GenericRecord> |
toStream(org.apache.hadoop.fs.Path hadoopPath) |
outputStream, prepareRecordConsumer, serialize, serialize, transcodeclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waittoStream, toStream, writeToPath, writeToPathpublic ParquetStreamFactory()
public ParquetStreamFactory(org.apache.hadoop.conf.Configuration configuration)
public static org.apache.hadoop.conf.Configuration cloneDefaultConfiguration()
public org.apache.hadoop.conf.Configuration getConfiguration()
public Stream<org.apache.avro.generic.GenericRecord> stream(URI uri) throws IOException
stream in interface IAvroStreamFactorystream in class AvroStreamFactoryIOExceptionpublic Stream<org.apache.avro.generic.GenericRecord> stream(InputStream rawInputStream) throws IOException
rawInputStream - a stream of bytes associated to a Parquet file. It is sub-optimal as Parquet require a
SeekableInputStream (i.e. an InputStream with RandomAccess). We will then copy the file in local FSIOExceptionpublic Stream<org.apache.avro.generic.GenericRecord> toStream(org.apache.hadoop.fs.Path hadoopPath) throws IOException
IOExceptionprotected org.apache.parquet.filter2.compat.FilterCompat.Filter makeFilter()
protected Stream<org.apache.avro.generic.GenericRecord> toStream(org.apache.parquet.hadoop.ParquetReader<org.apache.avro.generic.GenericRecord> reader)
public static Stream<Map<String,?>> readParquetAsStream(URI uriToParquet, Map<String,?> exampleTypes) throws FileNotFoundException, IOException
FileNotFoundExceptionIOExceptionprotected IGenericRecordConsumer prepareRecordConsumer(org.apache.avro.Schema schema, URI uri) throws IOException
prepareRecordConsumer in class AvroStreamFactoryIOExceptionprotected org.apache.hadoop.fs.Path toHadoopPath(URI uri)
Copyright © 2014–2019. All rights reserved.