A Versatile Linear Regression Tree Algorithm, PaLM-tree


Background

@In the research fields, such as computer vision and intelligent robot control, there have been various researches where the relation between phenomena can be regarded as causal relationship based on a physical law, and expressed by a mathematical model.  However, if we try to describe various phenomena in the real world based on the such approach, we face the following problems:

As a result, we have not yet been able to build the system which realizes intelligent behaviors in the real world.  Though causal relationship between phenomena is mostly unknown, we can only estimate the correlation between phenomena from real examples of the phenomena. Therefore, we can realize intelligent behavior of the system in the real world to a certain amount of level, by learning correlation between phenomena rather than expressing those causal relationship with a mathematical model. Generally, such correlation can be regared as a nonlinear mapping between arbitrary dimensional spaces.

Drawbacks in the previous work

There have been many methods for learning the nonlinear mapping, such as the neural network (NN) method, the Group Method of DataHandling (GMDH) method, the RBF network (RBFnet) method and the NGnet method. In addition to these methods, there have been learning methods based on memorizing all events caused by a phenomenon, such as k-nearest neighbor  (k-NN) method. However, we have faced the following problems so far:

Basic idea and Advantages

We developed a versatile regression tree algorithm which can represents a nonlinear mapping between arbitrary dimensional spaces, and can achieve the reliable estimation of the objective mapping. Providing a feasible error threshold and training samples, our method automatically divides the objective mapping into partially linear mappings. In order to estimate the most reliable linear mappings satisfying the feasible error criterion, we employ split-and-merge strategy for the decomposition.  Due to such strategy, our method doesn't spend long time to split an input hyperspace, and to construct a regression tree for estimating a complicated mapping. Since decomposed mappings are maintained by a binary tree, the linear mapping corresponding to an input is quickly calculated. We call this method a ''Partially Linear Mapping  tree (PaLM-tree).''


PaLM-tree iteratively divides the input hyperspace into smaller local hyperspaces with equal dimension each of which is called gdomainh based on an estimation error. Hereafter, let  and  be a domain and the range corresponding to the domain , respectively. Then, it estimates each linear mapping between  and  by principal components regression using a training data set  within each domain  and range . If the estimation error of linear mapping is more than a feasible threshold, it proceeds to divide the domain. Otherwise, it stops to divide.  As a result, it represents the nonlinear mapping  between  and  by multiple linear mappings

In the PaLM-tree, decomposition of the input space is maintained by a binary tree. The root of the tree does not have any incoming edges. Every other node has one incoming edge and have two outgoing edges. We call a node without outgoing edges a leaf node, otherwise is called an internal node. PaLM-tree divides an input hyperspace  into regions defined by axis-aligned hyperrectangles. A hyperrectangle ( dimensional rectangle) is recursively subdivided into two hyperrectangles with equal dimension. Each node of PaLM-tree is associated with a hyperrectangle, and therefore it is implicitly associated with a set of training samples lying within this hyperrectangle. In particular, each leaf node represents a final subdivision of the input and output hyperspace.


The PaLM-tree has attractive properties as follows:

Example result

The following graph shows an examples of function approximation by PaLM-tree. In this graph, the green points show the training data that are generated by , a red patch shows the estimated local linear mapping.

Application area

It can be applied to various problems.

Reference

  1. T. Nakamura, E. Kato and T. Wada, hA novel nonlinear mapping method (PaLM-tree) and its applicationhCProc. Of 9th Robotics SymposiumCpp.360-366, 2004. [PDF]  (written in Japanese).
  2. T. Kawahara, T. Nakamura and T. Wada, hMotion planning for wheeled mobile robots based on PaLM-treehCProc. Of  Robotics Mechatronics 2004.  2P1-L1-27, 2004. [PDF]  (written in Japanese).

Software
@If you are interested in using the PaLM-tree, please contact me (ntakayuk@sys.wakayama-u.ac.jp) by e-mail.

Attention !
This software is designed for the research communities. @We do not make any warranty as to the results that may be obtained from the use of this software and do not take any responsibility for them.