Class ParquetLoader

java.lang.Object
org.dflib.parquet.ParquetLoader

public class ParquetLoader extends Object
  • Constructor Details

    • ParquetLoader

      public ParquetLoader()
  • Method Details

    • cols

      public ParquetLoader cols(String... columns)
      Configures the loader to only process the specified columns, and include them in the DataFrame in the specified order.
      Returns:
      this loader instance
    • cols

      public ParquetLoader cols(int... columns)
      Returns:
      this loader instance
    • colsExcept

      public ParquetLoader colsExcept(String... columns)
      Returns:
      this loader instance
    • colsExcept

      public ParquetLoader colsExcept(int... columns)
      Returns:
      this loader instance
    • compactCol

      public ParquetLoader compactCol(int column)
      Configures a Parquet column to be loaded with value compaction. Should be used to save memory for low-cardinality columns. Note that Parquet already does compaction on String columns by default, but some other column types can take advantage of an explicit compaction.
    • compactCol

      public ParquetLoader compactCol(String column)
      Configures a Parquet column to be loaded with value compaction. Should be used to save memory for low-cardinality columns. Note that Parquet already does compaction on String columns by default, but some other column types can take advantage of an explicit compaction.
    • load

      public DataFrame load(File file)
    • load

      public DataFrame load(String filePath)
    • load

      public DataFrame load(Path filePath)
    • load

      public DataFrame load(ByteSource src)
      Since:
      1.1.0
    • loadAll

      public Map<String,DataFrame> loadAll(ByteSources src)
      Since:
      1.1.0