User Documentation for Apache MADlib
Data Types and Transformations

Detailed Description

Data types and operations that transform and shape data.


 Arrays and Matrices
 Mathematical operations for arrays and matrices.
 Encoding Categorical Variables
 Functions to encode categorical variables to prepare data for input into predictive algorithms.
 A function to perform complex pattern matching across rows and extract useful information about the matches.
 Pivoting and data summarization tools for preparing data for modeling operations.
 Session reconstruction of data consisting of a time stampled sequence of events.
 Provides porter stemmer operations supporting other MADlib modules.