Class ParquetLoader


  • public class ParquetLoader
    extends Object
    • Constructor Detail

      • ParquetLoader

        public ParquetLoader()
    • Method Detail

      • cols

        public ParquetLoader cols​(String... columns)
        Configures the loader to only process the specified columns, and include them in the DataFrame in the specified order.
        Returns:
        this loader instance
      • cols

        public ParquetLoader cols​(int... columns)
        Returns:
        this loader instance
      • colsExcept

        public ParquetLoader colsExcept​(String... columns)
        Returns:
        this loader instance
      • colsExcept

        public ParquetLoader colsExcept​(int... columns)
        Returns:
        this loader instance
      • compactCol

        public ParquetLoader compactCol​(int column)
        Configures a Parquet column to be loaded with value compaction. Should be used to save memory for low-cardinality columns. Note that Parquet already does compaction on String columns by default, but some other column types can take advantage of an explicit compaction.
      • compactCol

        public ParquetLoader compactCol​(String column)
        Configures a Parquet column to be loaded with value compaction. Should be used to save memory for low-cardinality columns. Note that Parquet already does compaction on String columns by default, but some other column types can take advantage of an explicit compaction.