[Relay][Docs] Relay Language Reference (#2232)

a64d81fa · Steven S. Lyubomirsky · Tianqi Chen · b1188485 · a64d81fa · a64d81fa
Commit a64d81fa authored Jan 02, 2019 by Steven S. Lyubomirsky Committed by Tianqi Chen Jan 02, 2019
Hide whitespace changes
Inline Side-by-side

Showing with 837 additions and 2 deletions

docs/langref/index.rst
+33 -2

docs/langref/relay_expr.rst
+566 -0

docs/langref/relay_type.rst
+238 -0

No files found.
--- a/docs/langref/index.rst
+++ b/docs/langref/index.rst
 Language Reference
 ==================
-This document provide references to
-embedded languages in TVM stack.
+This document provides references to
+embedded languages and IRs in the TVM stack.
+
+Introduction to Relay
+---------------------
+
+Relay is a functional, differentiable programming language
+designed to be an expressive intermediate representation for machine
+learning systems. Relay supports closures, control flow, and
+recursion, allowing it to directly represent more complex models
+than can computation graph-based IRs.
+Relay also includes a form of dependent typing using *type relations*
+in order to handle shape analysis for operators with complex
+requirements on argument shapes.
+
+Relay is extensible by design and makes it easy for machine learning
+researchers and practitioners to develop new large-scale program
+transformations and optimizations.
+
+The below pages describe the grammar, type system, and operators in Relay, respectively.

 .. toctree::
   :maxdepth: 2

+   relay_expr
+   relay_type
   relay_op
+
+Hybrid Script
+-------------
+
+The below page describes the TVM hybrid script front-end,
+which uses software emulation to support some constructs not
+officially supported in TVM.
+
+.. toctree::
+   :maxdepth: 2
+
   hybrid_script
--- a/docs/langref/relay_expr.rst
+++ b/docs/langref/relay_expr.rst
+====================
+Expressions in Relay
+====================
+
+The Relay IR is a pure, expression-oriented language. The below sections
+describe the different expressions in Relay and give details of their semantics.
+
+Dataflow and Control Fragments
+==============================
+
+For the purposes of comparing Relay to traditional computational graph-based IRs, it
+can be useful to consider Relay exrpessions in terms of dataflow and control fragments.
+Each portion of a Relay program containing expressions that only affect the dataflow can
+be viewed as a traditional comptuation graph when writing and expressing transformations.
+
+The dataflow fragment covers the set of Relay expressions that do not involve
+control flow. That is, any portion of a program containing only the following
+constructs corresponds to a pure computation graph:
+- `Variables`_
+- Tuple `Construction`_ and `Projection`_
+- `Let Bindings`_
+- `Graph Bindings`_
+- Calls to `Operators`_
+
+Control flow expressions allow the graph topology to change
+based on the value of previously executed expressions. The control
+fragment in Relay includes the following constructs:
+- `If-Then-Else`_ Expressions
+- Recursive Calls in Functions
+
+From the point of view of a computation graph, a function is a subgraph and a function call inlines the subgraph, substituting its arguments for the free variables in the subgraph with corresponding names.
+Thus if a function's body uses only dataflow constructs
+, a call to that function is in the dataflow fragment; conversely, if the
+function's body contains control flow, a call to that function is not part of the dataflow fragment.
+
+Variables
+=========
+
+Inspired by LLVM, Relay explicitly distinguishes between local and
+global variables both in the AST and in the text format. In the text format,
+global and local variables are distinguished by prefixes, or *sigils*.
+Global variables are prefixed with :code:`@` and local variables with :code:`%`.
+
+This explicit distinction makes certain optimizations easier to implement.
+For example, inlining a global definition requires no analysis: simply
+substituting the definition suffices.
+
+Global Variable
+~~~~~~~~~~~~~~~~~~
+
+Global identifiers are prefixed by the :code:`@` sigil, such as ":code:`@global`".
+A global identifier always references a globally visible definition contained in the
+globally visible environment, known as the `module`__.
+Global identifiers must be unique.
+
+__ `Module and Global Functions`_
+
+See :py:class:`~tvm.relay.expr.GlobalVar` for its implementation
+and documentation.
+
+Local Variable
+~~~~~~~~~~~~~~
+
+Local identifiers are prefixed by the :code:`%` sigil,
+such as ":code:`%local`". A local identifier always references
+a function argument or a variable bound in a :code:`let` expression,
+and will be scoped to the function where it appears or the :code:`let`
+expression where it is bound, respectively.
+
+In the below code segment, notice that :code:`%a` is defined twice. This is
+permitted, as in most functional languages; in the scope of the second
+:code:`let` expression, the name :code:`%a` is "shadowed," meaning all
+references to :code:`%a` in the inner scope refer to the later defintion, while
+references to :code:`%a` in the outer scope continue to refer to
+the first one.
+
+.. code-block:: python
+let %a = 1;
+let %b = 2 * %a;  // %b = 2
+let %a = %a + %a; // %a = 2. %a is shadowed
+%a + %b           // has value 2 + 2 = 4
+
+(Note that in Relay's implementation, each definition of a local variable
+creates a new :py:class:`~tvm.relay.expr.Var`, so a shadowed local variable,
+despite having the same name as one in an outer scope, will be a different
+object. This allows for comparing local variables by pointer identity with the
+knowledge that the same local variable object corresponds to a different binding site.)
+
+See :py:class:`~tvm.relay.expr.Var` for its implementation
+and documentation.
+
+Functions
+=========
+
+Functions in Relay act similarly to procedures or functions in
+other programming languages and serve to generalize the concept
+of a named subgraph.
+
+Functions are first class in Relay, which means they are expressions just like variables, constants, and tuples.
+Additionally, functions in Relay are higher-order, which means that a function can be passed as an argument to a
+function or returned by a function, as function expressions evaluate to closures (see the `Closures`_ subsection),
+which are values like tensors and tuples.
+
+See :py:class:`~tvm.relay.expr.Function` for the definition and documentation of function nodes.
+
+Syntax
+~~~~~~
+
+A definition minimally consists of the keyword :code:`fn`, an empty set of
+parameters, and a body expression (:py:class:`~tvm.relay.expr.Expr`)
+contained by curly braces.
+
+.. code-block:: python
+
+    fn() { body }
+
+A definition may contain any number of parameters. For example, a
+simple function that invokes the :code:`add` operator:
+
+.. code-block:: python
+
+    fn(%x, %y) { add(%x, %y) }
+
+Notice that within the function's body, the parameters are local
+variables, just like those bound in a :code:`let` expression.
+
+One may also annotate explicit types on functions.
+For example, we can restrict the above function to only work
+on certain types:
+
+.. code-block:: python
+
+    fn(%x : Tensor[(10, 10), float32], %y : Tensor[(10, 10), float32])
+               -> Tensor[(10, 10), float32] {
+        add(%x, %y)
+    }
+
+The above function only takes arguments of type :code:`Tensor[(10, 10), float32]` and returns a value of
+type :code:`Tensor[(10, 10), float32]`. A function parameter is just a local 
+variable (:py:class:`~tvm.relay.expr.LocalVar`) optionally annotated with a type, written as :code:`%x : T`.
+
+When the type information is omitted, Relay attempts to infer the most general type
+for the users. This property is known as generalization: for a definition without
+explicit annotations, Relay attempts to assign the most general type to the
+parameters and return type based on the function body and call sites.
+
+A recursive function expression can be defined using a :code:`let` binding,
+as here:
+
+.. code-block:: python
+
+    let %fact = fn(%x : Tensor[(10, 10), float32]) -> Tensor[(10, 10), float32] {
+        if (%x == Constant(0, (10, 10), float32)) {
+            Constant(1, (10, 10), float32)
+        } else {
+            %x * %fact(%x - Constant(1, (10, 10), float32))
+        }
+    };
+    %fact(Constant(10, (10, 10), float32))
+
+Closures
+~~~~~~~~
+
+A function expression evaluates to a closure. Closures
+are values that are represented as a pair of a local environment
+(storing the values for all variables defined outside the scope
+of the function's body) and the function itself.
+
+For example, in the below example, the final result will be
+a tensor of zero values because the closure for :code:`%f` stores the value of
+:code:`%x` at the pointer where :code:`%f` was defined.
+
+.. code-block:: python
+
+    let %g = fn() {
+      let %x = Constant(0, (10, 10), float32);
+      // %x is a free variable in the below function
+      fn(%y) { %y * %x }
+    };
+    // the %x in %g's body is not in scope anymore
+    // %f is a closure where %x maps to Constant(0, (10, 10), float32)
+    let %f = %g();
+    let %x = Constant(1, (10, 10), float32);
+    %f(%x) // evaluates to Constant(0, (10, 10), float32)
+
+Polymorphism and Type Relations
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+.. *Note: type parameter syntax is not yet supported in the text format.*
+
+A function may also be given a set of type parameters, which can be
+substituted for specific types at call sites. Functions with
+type parameters are *type polymorphic*; their return type or the types
+of arguments they will accept can vary based on the type arguments
+given at call sites.
+
+Type parameters are classified by *kind* and can
+only appear in parts of the type signature where their kind is appropriate
+(e.g., type parameters of kind :code:`Shape` can only appear where a shape
+would be expected in a tensor type); for a full discussion, 
+see `the documentation on type parameters`__.
+
+__ `Type Parameter`_
+
+For example, one can define a polymorphic identity function for
+any Relay type as follows:
+
+.. code-block:: python
+    fn<t : Type>(%x : t) -> t {
+        %x
+    }
+
+The below definition is also polymorphic, but restricts its
+arguments to tensor types:
+
+.. code-block:: python
+    fn<s : Shape, bt : BaseType>(%x : Tensor[s, bt]) {
+        %x
+    }
+
+Notice that the return type is omitted and will be inferred.
+
+.. *Note: :code:`where` syntax is not yet supported in the text format.*
+
+A function may also be subject to one or more type relations, such as in
+the following:
+
+.. code-block:: python
+
+    fn(%x, %y) where Broadcast { add(%x, %y) }
+
+In the above definition, the types of :code:`%x` and :code:`%y` and the return type
+are subject to the :code:`Broadcast` relation, meaning all three must be tensors
+and their shapes follow the elementwise broadcast relation. As with
+operators, the definitions of relations are not transparent to Relay
+and they are instead implemented externally in either C++ or Python.
+
+As in the case of :code:`Broadcast`, relations are used to express complicated
+constraints on types (especially tensor shapes).
+All function relations must hold at all call sites;
+type checking is thus treated as a constraint-solving problem.
+For more detail on type relations and their implementations,
+please see the documentation on `Relay's Type System`_.
+
+Operators
+=========
+
+An operator is a primitive operation, such as :code:`add` or :code:`conv2d`, not defined in the Relay
+language. Operators are declared in the global operator
+registry in C++. Many common operators are backed by TVM's
+Tensor Operator Inventory (`TOPI <https://github.com/dmlc/tvm/tree/master/topi>`__).
+
+To register an operator a user must provide an implementation
+of the operator, its type, and any other desired metadata.
+The operator registry is a column-based store where
+operators are keys, so any metadata (which might be referenced
+by optimization passes) may be registered as a new column.
+
+From the perspective of Relay's type system, an operator is a function,
+so operators may be called like any other function and have function
+types. In particular, operator types are registered using a single
+type relation (see `the documentation on type relations`__), typically a relation
+specialized to that operator. For example, the :code:`add` operator
+is registered with the :code:`Broadcast` relation, indicating that the
+arguments of :code:`add` must be tensors and that the return type
+is a tensor whose shape depends on those of its arguments.
+
+__ `Type Relation`_
+
+Operators are rendered without a sigil (e.g :code:`conv2d`, :code:`flatten`)
+when pretty-printing Relay programs.
+Operators are explicitly contained in the program and are uniquely
+identifiable by pointer.
+
+Note that common arithmetic operators such as :code:`add` and :code:`multiply`
+may be written using the corresponding arithmetic operators in the text format
+(e.g., :code:`+` or :code:`*`) as syntactic sugar.
+
+See :py:class:`~tvm.relay.op.Op` for the definition and documentation
+of operator nodes, demonstrating the infrastructure for registering
+operator metadata. The other files in :py:class:`~tvm.relay.op` give
+handles for generating a call to various pre-registered operators.
+The `tutorial on adding operators to Relay`__ shows how to add further
+operators into the language.
+
+__ `Adding an Operator to Relay`_
+
+Call
+====
+
+Expressions with function types in Relay are "callable,"
+meaning that they can be invoked via a function call. These consist of
+any expression that evaluates to a closure (i.e., function expressions
+or global functions) and Relay operators.
+
+The syntax of calls follows that used in C-like languages, demonstrated in the
+example below:
+
+.. code-block:: python
+
+   let %c = 1;
+   let %f = fn(%x : Tensor[(), float32], %y : Tensor[(), float32]) { %x + %y + %c };
+   %f(10, 11)
+
+When a closure is called (see `Closures`_),
+the closure's body is evaluated in the stored environment 
+(i.e., using the stored values for free variables) with
+local variable bindings added for each argument; the final value
+obtained by evaluating the body is the call's return value.
+Thus, in the above example, the call evaluates to 22.
+In the case of operators, the implementation is opaque to Relay,
+so the result is left up to the registered TVM implementation.
+
+.. *Note: type parameters are not yet supported in the text format.* 
+
+A type-polymorphic function can also include type arguments at a call
+site. The type arguments are substituted for type parameters when
+type checking. If a function is type-polymorphic and type arguments are not
+given, type inference will attempt to infer type arguments if possible.
+The following code gives examples of explicit and inferred type arguments:
+
+.. code-block:: python
+
+    // %f : fn<a : Type, b : Type, c : Type>(a, b) -> c
+    let %x1 = %f<Tensor[(), bool], Tensor[(), bool], Tensor[(), bool)]>(True, False);
+    // %x1 is of type Tensor[(), bool]
+    let %x2 : () = %f(%x1, %x1)
+    // the type arguments in the second call are inferred to be <Tensor[(), bool], Tensor[(), bool], ()>
+
+Note that all type relations in the function type must hold at each
+call site. Specifically, this means that the relation will be checked
+against the specific types of the arguments at a given call site. This 
+is also a form of polymorphism, since there may be multiple valid
+assignments of argument types and a return type so long as the relation
+is satisfied.
+
+For example, if we have a function :code:`%f` that takes tensor arguments
+and has the :code:`Broadcast` relation, then there are many different
+shapes that the arguments in the below call could have that would satisfy
+the type annotation:
+
+.. code-block:: python
+
+   let %x : Tensor[(100, 100, 100), float32] = %f(%a, %b);
+   %x
+
+See :py:class:`~tvm.relay.expr.Call` for its definition and documentation.
+
+Module and Global Functions
+===========================
+
+Relay keeps a global data structure known as a "module" (often called an "environment" in other
+functional programming languages) to keep track of the definitions of global functions.
+In particular, the module keeps a globally accessible mapping of global variables to the
+function expressions they denote. The utility of the module is that it allows global functions
+to recursively refer to themselves or any other global function (e.g., as in mutual recursion).
+
+Note Relay's module is analogous to data structures for keeping track of subgraphs in computation
+graph-based IRs.
+
+Global functions in Relay behave identically to the function expressions defined in `Functions`_,
+but have syntactic sugar in the text format to enter their definitions into the module. Namely,
+a global function definition includes a global identifier and is allowed to recursively refer to
+that identifier in the body, as in the following example:
+
+.. code-block:: python
+
+   def @ackermann(%m : Tensor[(), int32], %n : Tensor[(), int32]) -> Tensor[(), int32] {
+       if (%m == 0) {
+           %n + 1
+       } else if (%m > 0 && %n == 0) {
+           @ackermann(%m - 1, 1)
+       } else {
+           @ackermann(%m - 1, @ackermann(%m, %n - 1))
+       }
+   }
+
+This definition would result in a module entry mapping the identifier :code:`@ackermann` to a function expression
+with the parameters, return type, and body above. Any reference to the identifier :code:`@ackermann` elsewhere in the
+code could then look up the identifier in the module and replace the function definition as needed.
+
+See :py:class:`~tvm.relay.Module` for the definition and documentation of a module.
+
+Constant
+========
+
+This node represents a constant tensor value
+(see :py:mod:`~tvm.relay.Value` for more details).
+A constant is represented as a :py:class:`~tvm.NDArray`,
+allowing Relay to utilize TVM operators for constant evaluation.
+
+This node can also represent scalar constants, since
+scalars are tensors with a shape of :code:`()`. In the text format, numerical
+and boolean literals are thus syntactic sugar for constants encoding a
+tensor type with a rank-zero shape.
+
+See :py:class:`~tvm.relay.expr.Constant` for its definition and documentation.
+
+Tuples
+======
+
+Construction
+~~~~~~~~~~~~
+
+The tuple node builds a finite (that is, of statically known size) sequence of heterogeneous data. 
+These tuples match Python's closely, and their fixed length allows for efficient projection of their
+members.
+
+.. code-block:: python
+
+   fn(%a : Tensor[(10, 10), float32], %b : float32, %c : Tensor[(100, 100), float32]) {
+       let %tup = (%a, %b);     // type: (Tensor[(10, 10), float32], float32)
+       ((%tup.0 + %tup.1), %c)  // type: (Tensor[(10, 10), float32], Tensor[(100, 100), float32])
+   }
+
+See :py:class:`~tvm.relay.expr.Tuple` for its definition and documentation.
+
+Projection
+~~~~~~~~~~
+
+A tuple must be indexed by an integer constant in order to extract a
+particular member of the tuple. Projections are 0-indexed.
+
+For example, the below projection evaluates to :code:`%b`:
+
+.. code-block:: python
+
+   (%a, %b, %c).1
+
+See :py:class:`~tvm.relay.expr.TupleGetItem` for its definition and documentation.
+
+Let Bindings
+============
+
+A :code:`let` binding is an immutable local variable binding,
+allowing the user to bind an expression to a name.
+
+A :code:`let` binding contains a local variable,
+an optional type annotation, a value, and a body expression
+that may reference the bound identifier. If a type annotation
+on the bound variable is omitted, Relay attempts to infer the
+most general type permitted for the variable.
+
+The bound variable in a :code:`let` expression is only in scope 
+in its body, except when the variable defines a function expression.
+When a :code:`let` expression creates a function, the variable is also
+in scope in its value to allow for recursively defined functions 
+(see the previous subsection).
+
+The value of a :code:`let` binding is the value of the final expression
+after evaluating the bindings it depends on. For example, in the
+following example the entire expression evaluates to a tensor
+of shape :code:`(10, 10)` where all elements are 2:
+
+.. code-block:: python
+
+   let %x : Tensor[(10, 10), float32] = Constant(1, (10, 10), float32);
+   %x + %x
+
+A sequence of :code:`let` bindings can be considered as a dataflow graph,
+where the bindings are a series of sub-graphs connected
+by bound variables. Since these binding sequences are
+pure, a pair of bindings where neither depends on the other can be safely reordered.
+For example, the first and second :code:`let` bindings below
+may be evaluated in either order because neither has a dataflow
+dependency on the other:
+
+.. code-block:: python
+
+   let %x = %a + %b;
+   let %y = %c + %d;
+   %x * %y
+
+See :py:class:`~tvm.relay.expr.Let` for its definition and documentation.
+
+Graph Bindings
+==============
+
+A :code:`let` binding creates a named variable that is bound to the given value
+and scoped to the subsequent expression. By contrast, a graph binding allows for
+explicitly constructing dataflow graphs in a Relay program by binding an expression
+(graph node) directly to a temporary variable, which is not scoped. Each reference
+to the variable corresponds to an edge in the dataflow graph. This has the
+semantics of substituting the expression wherever the variable appears, even though
+the graph node will only be evaluated once by the compiled program.
+
+These bindings allow for a style of programming that corresponds to that already
+employed by NNVM and other dataflow graph-based input formats. The fact that the variables
+are not scoped offers some flexibility in evaluation order compared to :code:`let`
+bindings, though this can also introduce some ambiguity in programs (the
+`developer introduction to the Relay IR`__ includes more detailed discussion
+of this nuance).
+
+__ `Introduction to Relay IR`_
+
+.. *Note: Graph bindings are not currently parsed by the text format.*
+
+In Relay's text format, a graph binding can be written as below (note the lack of a
+:code:`let` keyword and a semicolon):
+
+.. code-block:: python
+
+   %1 = %a + %b
+   %2 = %1 + %1
+   %2 * %2
+
+Unlike a let binding, a graph binding is not represented as an AST node in Relay, but rather as a meta-variable referencing its AST node value.
+For example, a program like the above could be constructed in Relay's
+Python front-end by setting *Python variables* equal to the corresponding Relay AST node and
+using the variables repeatedly, as below (a C++ program using the corresponding API bindings
+could accomplish the same thing):
+
+.. code-block:: python
+
+   sum1 = relay.add(a, b)
+   sum2 = relay.add(sum1, sum1)
+   relay.multiply(sum2, sum2)
+
+For development purposes and to enable certain optimizations, Relay includes passes to
+convert between dataflow graphs defined using graph bindings and programs with :code:`let`
+bindings in A-normal form, employed by many compiler optimizations from the functional
+programming community (see `"A-Normalization: Why and How" by
+Matt Might<http://matt.might.net/articles/a-normalization/>`__ for an introduction
+to A-normal form).
+
+If-Then-Else
+============
+
+Relay has a simple if-then-else expression that allows programs to branch
+on a single value of type :code:`bool`, i.e., a zero-rank
+tensor of booleans (:code:`Tensor[(), bool]`).
+
+.. code-block:: python
+
+    if (%t == %u) {
+        %t
+    } else {
+        %u
+    }
+
+Since if-then-else branches are expressions, they may appear inline
+wherever any other expression may be expected, like invocations of
+the ternary operator in C-like languages. The if-then-else expression
+evaluates to the value of the "then" branch if the condition value
+evaluates to :code:`True` and evaluates to the value of the "else" branch if the
+condition value evaluates to :code:`False`.
+
+See :py:class:`~tvm.relay.expr.If` for its definition and documentation.
+
+TempExprs
+=========
+
+Program transformations (passes) in Relay may require inserting temporary 
+state into the program AST to guide further transformations. The
+:code:`TempExpr` node is provided as a utility to developers for this purpose;
+nodes inheriting from :code:`TempExpr` cannot appear directly in user-provided
+code but may be inserted in a pass. Any :code:`TempExpr`s created in a pass
+should ideally be eliminated before the pass is complete, as 
+:code:`TempExpr`s only store internal state and have no semantics of their own.
+
+For an example of :code:`TempExpr`s being used in a pass, 
+see :code:`src/relay/pass/alter_op_layout.cc`, which uses :code:`TempExpr` nodes
+to store information about operator layouts as the pass tries to rearrange operator
+calls.
+
+See :py:class:`~tvm.relay.expr.TempExpr` for its definition and documentation.
--- a/docs/langref/relay_type.rst
+++ b/docs/langref/relay_type.rst
+===================
+Relay's Type System
+===================
+
+We briefly introduced types while detailing Relay's expression language
+, but have not yet described its type system. Relay is
+a statically typed and type-inferred language, allowing programs to
+be fully typed while requiring just a few explicit type annotations.
+
+Static types are useful when performing compiler optimizations because they
+communicate properties about the data a program manipulates, such as runtime
+shape, data layout, and storage, without needing to run the program.
+
+Relay's type system features a form of *dependent typing* for shapes. That is, its type system keeps track of the shapes of tensors in a Relay program. Treating tensor
+shapes as types allows Relay to perform more powerful reasoning at compile time;
+in particular, Relay can statically reason about operations whose output shapes
+vary based on the input shapes in complex ways. Casting shape inference as a type
+inference problem allows Relay to infer the shapes of all tensors at compile time,
+including in programs that use branching and function calls.
+
+Statically reasoning about shapes in this manner allows
+Relay to be ahead-of-time compiled and provides much more information about
+tensors for optimizations further in the compilation pipeline. Such optimizations
+can be implemented as passes, which are Relay-to-Relay AST transformations, and
+may use the inferred types (e.g., shape information) for making decisions about
+program transformations. For instance, :code:`src/relay/pass/fuse_ops.cc` gives
+an implementation of a pass that uses inferred tensor shapes to replace invocations
+of operators in a Relay program with fused operator implementations.
+
+Reasoning about tensor types in Relay is encoded using *type relations*, which means
+that the bulk of type checking in Relay is constraint solving (ensuring that all
+type relations are satisfied at call sites). Type relations offer a flexible and
+relatively simple way of making the power of dependent typing available in Relay
+without greatly increasing the complexity of its type system.
+
+Types
+=====
+
+Below we detail the language of types in Relay and how they are assigned to Relay expressions.
+
+Type
+~~~~
+
+The base type for all Relay types. All Relay types are sub-classes of this base type.
+
+See :py:class:`~tvm.relay.ty.Type` for its definition and documentation.
+
+Tensor Type
+~~~~~~~~~~~
+
+A concrete tensor type in Relay.
+
+Tensors are typed according to data type and shape. At present, these use TVM's
+data types and shapes, but in the future, Relay may include a separate AST for
+shapes. In particular, data types include :code:`bool`, :code:`float32`, :code:`int8` and various
+other bit widths and numbers of lanes. Shapes are given as tuples of dimensions (TVM :code:`IndexExpr`),
+such as :code:`(5, 5)`; scalars are also given tuple types and have a shape of :code:`()`.
+
+Note, though, that TVM shapes can also include variables and arithmetic expressions
+including variables, so Relay's constraint solving phase will attempt to find
+assignments to all shape variables to ensure all shapes will be concrete before
+running a program.
+
+For example, here is a simple concrete tensor type corresponding to a 10-by-10 tensor of 32-bit floats:
+
+.. code-block:: python
+
+   Tensor[(10, 10), float32]
+
+See :py:class:`~tvm.relay.ty.TensorType` for its definition and documentation.
+
+Tuple Type
+~~~~~~~~~~
+
+A type of a tuple in Relay.
+
+Just as a tuple is simply a sequence of values of statically known length, the type
+of a tuple consists of a sequence of the types corresponding to each member of the tuple.
+
+Because a tuple type is of statically known size, the type of a tuple projection
+is simply the corresponding index into the tuple type.
+
+For example, in the below code, :code:`%t` is of type
+`(Tensor[(), bool], Tensor[(10, 10), float32])`
+and :code:`%c` is of type `Tensor[(10, 10), float32]`.
+
+.. code-block:: python
+   let %t = (False, Constant(1, (10, 10), float32));
+   let %c = %t.1;
+   %c
+
+See :py:class:`~tvm.relay.ty.TupleType` for its definition and documentation.
+
+Type Parameter
+~~~~~~~~~~~~~~
+
+Type parameters represent placeholder types used for polymorphism in functions.
+Type parameters are specified according to *kind*, corresponding to the types
+those parameters are allowed to replace: 
+
+- :code:`Type`, corresponding to top-level Relay types like tensor types, tuple types, and function types
+- :code:`BaseType`, corresponding to the base type of a tensor (e.g., :code:`float32`, :code:`bool`)
+- :code:`Shape`, corresponding to a tensor shape
+- :code:`ShapeVar`, corresponding to variables within a tensor shape
+
+Relay's type system enforces that type parameters are only allowed to appear where their kind permits them,
+so if type variable :code:`t` is of kind :code:`Type`, :code:`Tensor[t, float32]` is not a valid type.
+
+.. *Note: At present, only type parameters of kind :code:`Type` are supported.*
+
+Like normal parameters, concrete arguments must be given for type parameters at call sites.
+
+.. *Note: type parameter syntax is not yet supported in the text format.*
+
+For example, :code:`s` below is a type parameter of kind :code:`Shape` and it will
+be substituted with :code:`(10, 10)` at the call site below:
+
+.. code-block:: python
+
+   def @plus<s : Shape>(%t1 : Tensor[s, float32], %t2 : Tensor[s, float32]) {
+        add(%t1, %t2)
+   }
+   plus<(10, 10)>(%a, %b)
+
+See :py:class:`~tvm.relay.ty.TypeVar` for its definition and documentation.
+
+Type Constraint
+~~~~~~~~~~~~~~~
+
+This is an abstract class representing a type constraint, to be elaborated
+upon in further releases. Currently, type relations are the only
+type constraints provided; they are discussed below.
+
+See :py:class:`~tvm.relay.ty.TypeConstraint` for its definition and documentation.
+
+Function Type
+~~~~~~~~~~~~~
+
+A function type in Relay, see `tvm/relay/type.h` for more details.
+
+This is the type assigned to functions in Relay. A function type
+consists of a list of type parameters, a set of type constraints,
+a sequence of argument types, and a return type.
+
+We informally write function types as:
+:code:`fn<type_params>(arg_types) -> ret_type where type_constraints`
+
+A type parameter in the function type may appear in the argument
+types or the return types. Additionally, each of the type constraints
+must hold at every call site of the function. The type constraints
+typically take the function's argument types and the function's return
+type as arguments, but may take a subset instead.
+
+See :py:class:`~tvm.relay.ty.FuncType` for its definition and documentation.
+
+Type Relation
+~~~~~~~~~~~~~
+
+A type relation is the most complex type system feature in Relay.
+It allows users to extend type inference with new rules.
+We use type relations to define types for operators that work with
+tensor shapes in complex ways, such as broadcasting operators or
+:code:`flatten`, allowing Relay to statically reason about the shapes
+in these cases.
+
+A type relation :code:`R` describes a relationship between the input and output types of a Relay function.
+Namely, :code:`R` is a function on types that
+outputs `true` if the relationship holds and `false`
+if it fails to hold. Types given to a relation may be incomplete or
+include shape variables, so type inference must assign appropriate
+values to incomplete types and shape variables for necessary relations
+to hold, if such values exist.
+
+For example we can define an identity relation to be:
+
+.. code-block:: prolog
+    Identity(I, I) :- true
+
+It is usually convenient to type operators
+in Relay by defining a relation specific to that operator that
+encodes all the necessary constraints on the argument types
+and the return type. For example, we can define the relation for :code:`flatten`:
+
+.. code-block:: prolog
+    Flatten(Tensor(sh, bt), O) :-
+      O = Tensor(sh[0], prod(sh[1:]))
+
+If we have a relation like :code:`Broadcast` it becomes possible
+to type operators like :code:`add`:
+
+.. code-block:: python
+    add : fn<t1 : Type, t2 : Type, t3 : Type>(t1, t2) -> t3
+                where Broadcast
+
+The inclusion of :code:`Broadcast` above indicates that the argument
+types and the return type must be tensors where the shape of `t3` is
+the broadcast of the shapes of `t1` and `t2`. The type system will
+accept any argument types and return type so long as they fulfill
+:code:`Broadcast`.
+
+Note that the above example relations are written in Prolog-like syntax,
+but currently the relations must be implemented by users in C++
+or Python. More specifically, Relay's type system uses an *ad hoc* solver
+for type relations in which type relations are actually implemented as
+C++ or Python functions that check whether the relation holds and
+imperatively update any shape variables or incomplete types. In the current
+implementation, the functions implementing relations should return :code:`False`
+if the relation fails to hold and :code:`True` if the relation holds or if
+there is not enough information to determine whether it holds or not.
+
+The functions for all the relations are run as needed (if an input is updated)
+until one of the following conditions holds:
+
+1. All relations hold and no incomplete types remain (typechecking succeeds).
+2. A relation fails to hold (a type error).
+3. A fixpoint is reached where shape variables or incomplete types
+remain (either a type error or more type annotations may be needed).
+
+Presently all of the relations used in Relay are implemented in C++.
+See the files in `src/relay/op` for examples of relations implemented
+in C++.
+
+See :py:class:`~tvm.relay.ty.TypeRelation` for its definition and documentation.
+
+Incomplete Type
+~~~~~~~~~~~~~~~
+
+An incomplete type is a type or portion of a type that is not yet known.
+This is only used during type inference. Any omitted type annotation is
+replaced by an incomplete type, which will be replaced by another
+type at a later point.
+
+Incomplete types are known as "type variables" or "type holes" in the programming languages
+literature. We use the name "incomplete type" in order to more clearly distinguish them from type
+parameters: Type parameters must be bound to a function and are replaced with concrete type arguments (instantiated)
+at call sites, whereas incomplete types may appear anywhere in the program and are filled in during type inference.
+
+See :py:class:`~tvm.relay.ty.IncompleteType` for its definition and documentation.