Packages

  • package root
    Definition Classes
    root
  • package org
    Definition Classes
    root
  • package apache
    Definition Classes
    org
  • package spark
    Definition Classes
    apache
  • package sql
    Definition Classes
    spark
  • package catalyst

    Catalyst is a library for manipulating relational query plans.

    Catalyst is a library for manipulating relational query plans. All classes in catalyst are considered an internal API to Spark SQL and are subject to change between minor releases.

    Definition Classes
    sql
  • package expressions

    A set of classes that can be used to represent trees of relational expressions.

    A set of classes that can be used to represent trees of relational expressions. A key goal of the expression library is to hide the details of naming and scoping from developers who want to manipulate trees of relational operators. As such, the library defines a special type of expression, a NamedExpression in addition to the standard collection of expressions.

    Standard Expressions

    A library of standard expressions (e.g., Add, EqualTo), aggregates (e.g., SUM, COUNT), and other computations (e.g. UDFs). Each expression type is capable of determining its output schema as a function of its children's output schema.

    Named Expressions

    Some expression are named and thus can be referenced by later operators in the dataflow graph. The two types of named expressions are AttributeReferences and Aliases. AttributeReferences refer to attributes of the input tuple for a given operator and form the leaves of some expression trees. Aliases assign a name to intermediate computations. For example, in the SQL statement SELECT a+b AS c FROM ..., the expressions a and b would be represented by AttributeReferences and c would be represented by an Alias.

    During analysis, all named expressions are assigned a globally unique expression id, which can be used for equality comparisons. While the original names are kept around for debugging purposes, they should never be used to check if two attributes refer to the same value, as plan transformations can result in the introduction of naming ambiguity. For example, consider a plan that contains subqueries, both of which are reading from the same table. If an optimization removes the subqueries, scoping information would be destroyed, eliminating the ability to reason about which subquery produced a given attribute.

    Evaluation

    The result of expressions can be evaluated using the Expression.apply(Row) method.

    Definition Classes
    catalyst
  • package aggregate
    Definition Classes
    expressions
  • package codegen

    A collection of generators that build custom bytecode at runtime for performing the evaluation of catalyst expression.

    A collection of generators that build custom bytecode at runtime for performing the evaluation of catalyst expression.

    Definition Classes
    expressions
  • package objects
    Definition Classes
    expressions
  • package xml
    Definition Classes
    expressions
  • UDFXPathUtil
  • XPathBoolean
  • XPathDouble
  • XPathExtract
  • XPathFloat
  • XPathInt
  • XPathList
  • XPathLong
  • XPathShort
  • XPathString

package xml

Type Members

  1. class UDFXPathUtil extends AnyRef

    Utility class for all XPath UDFs.

    Utility class for all XPath UDFs. Each UDF instance should keep an instance of this class.

    This is based on Hive's UDFXPathUtil implementation.

  2. case class XPathBoolean(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()
  3. case class XPathDouble(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()
  4. abstract class XPathExtract extends BinaryExpression with ExpectsInputTypes with CodegenFallback

    Base class for xpath_boolean, xpath_double, xpath_int, etc.

    Base class for xpath_boolean, xpath_double, xpath_int, etc.

    This is not the world's most efficient implementation due to type conversion, but works.

  5. case class XPathFloat(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()
  6. case class XPathInt(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()
  7. case class XPathList(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()
  8. case class XPathLong(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()
  9. case class XPathShort(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()
  10. case class XPathString(xml: Expression, path: Expression) extends XPathExtract with Product with Serializable
    Annotations
    @ExpressionDescription()

Ungrouped