* Support for batch ComputeOp * Support for batch ComputeOp * Fix CrossThreadReduction * Fix lint * Add UpdateArray, remove support for batch reduce * Tuple input support for reduce * rfactor works with multiple reducer; support multiple reducers with different types * Small fix * Small fix * Change return type of rfactor to Array<Expr> * Fix lint * Improve * Add tutorial * Improve tutorial * Improve tutorial