Background
@In the research fields, such as computer vision and intelligent robot control, there have been various researches where the relation between phenomena can be regarded as causal relationship based on a physical law, and expressed by a mathematical model. However, if we try to describe various phenomena in the real world based on the such approach, we face the following problems:
As a result, we have not yet been able to build the system which realizes intelligent behaviors in the real world. Though causal relationship between phenomena is mostly unknown, we can only estimate the correlation between phenomena from real examples of the phenomena. Therefore, we can realize intelligent behavior of the system in the real world to a certain amount of level, by learning correlation between phenomena rather than expressing those causal relationship with a mathematical model. Generally, such correlation can be regared as a nonlinear mapping between arbitrary dimensional spaces.
Drawbacks in the previous work
There have been many methods for learning the nonlinear mapping, such as the neural network (NN) method, the Group Method of DataHandling (GMDH) method, the RBF network (RBFnet) method and the NGnet method. In addition to these methods, there have been learning methods based on memorizing all events caused by a phenomenon, such as k-nearest neighbor (k-NN) method. However, we have faced the following problems so far:
Basic idea and Advantages
We developed a versatile regression tree algorithm which can represents a nonlinear mapping between arbitrary dimensional spaces, and can achieve the reliable estimation of the objective mapping. Providing a feasible error threshold and training samples, our method automatically divides the objective mapping into partially linear mappings. In order to estimate the most reliable linear mappings satisfying the feasible error criterion, we employ split-and-merge strategy for the decomposition. Due to such strategy, our method doesn't spend long time to split an input hyperspace, and to construct a regression tree for estimating a complicated mapping. Since decomposed mappings are maintained by a binary tree, the linear mapping corresponding to an input is quickly calculated. We call this method a ''Partially Linear Mapping tree (PaLM-tree).''
PaLM-tree iteratively divides the input hyperspace into smaller local hyperspaces with equal dimension each of which is called gdomainh based on an estimation error. Hereafter, let and be a domain and the range corresponding to the domain , respectively. Then, it estimates each linear mapping between and by principal components regression using a training data set within each domain and range . If the estimation error of linear mapping is more than a feasible threshold, it proceeds to divide the domain. Otherwise, it stops to divide. As a result, it represents the nonlinear mapping between and by multiple linear mappings
In the PaLM-tree, decomposition of the input space is maintained by a binary tree. The root of the tree does not have any incoming edges. Every other node has one incoming edge and have two outgoing edges. We call a node without outgoing edges a leaf node, otherwise is called an internal node. PaLM-tree divides an input hyperspace into regions defined by axis-aligned hyperrectangles. A hyperrectangle ( dimensional rectangle) is recursively subdivided into two hyperrectangles with equal dimension. Each node of PaLM-tree is associated with a hyperrectangle, and therefore it is implicitly associated with a set of training samples lying within this hyperrectangle. In particular, each leaf node represents a final subdivision of the input and output hyperspace.
The PaLM-tree has attractive properties as follows:
Example result
The following graph shows an examples of function approximation by PaLM-tree. In this graph, the green points show the training data that are generated by , a red patch shows the estimated local linear mapping.
Application area
It can be applied to various problems.
Reference
Software
@If you are interested in using the PaLM-tree, please contact me (ntakayuk@sys.wakayama-u.ac.jp)
by e-mail.
Attention !
This software is designed for the research communities. @We do not make
any warranty as to the results that may be obtained from the use of this
software and do not take any responsibility for them.