Background
@In the research fields, such as computer vision and intelligent robot control, there have been various researches where the relation between phenomena can be regarded as causal relationship based on a physical law, and expressed by a mathematical model. However, if we try to describe various phenomena in the real world based on the such approach, we face the following problems:
As a result, we have not yet been able to build the system which realizes intelligent behaviors in the real world. Though causal relationship between phenomena is mostly unknown, we can only estimate the correlation between phenomena from real examples of the phenomena. Therefore, we can realize intelligent behavior of the system in the real world to a certain amount of level, by learning correlation between phenomena rather than expressing those causal relationship with a mathematical model. Generally, such correlation can be regared as a nonlinear mapping between arbitrary dimensional spaces.
Drawbacks in the previous work
There have been many methods for learning the nonlinear mapping, such as the neural network (NN) method, the Group Method of DataHandling (GMDH) method, the RBF network (RBFnet) method and the NGnet method. In addition to these methods, there have been learning methods based on memorizing all events caused by a phenomenon, such as k-nearest neighbor (k-NN) method. However, we have faced the following problems so far:
Basic idea and Advantages
We developed a versatile regression tree algorithm which can represents a nonlinear mapping between arbitrary dimensional spaces, and can achieve the reliable estimation of the objective mapping. Providing a feasible error threshold and training samples, our method automatically divides the objective mapping into partially linear mappings. In order to estimate the most reliable linear mappings satisfying the feasible error criterion, we employ split-and-merge strategy for the decomposition. Due to such strategy, our method doesn't spend long time to split an input hyperspace, and to construct a regression tree for estimating a complicated mapping. Since decomposed mappings are maintained by a binary tree, the linear mapping corresponding to an input is quickly calculated. We call this method a ''Partially Linear Mapping tree (PaLM-tree).''
PaLM-tree iteratively
divides the input hyperspace into smaller local hyperspaces with equal dimension each of
which is called gdomainh based on an estimation error. Hereafter, let
and
be a domain and
the range corresponding to the domain
, respectively. Then, it estimates each linear mapping
between
and
by principal
components regression using a training data set
within each
domain
and range
. If the estimation error of linear mapping is more than a
feasible threshold, it proceeds to divide the domain. Otherwise, it stops to
divide. As a result, it represents
the nonlinear mapping
between
and
by multiple
linear mappings
In the PaLM-tree, decomposition of the input space is maintained by a binary
tree. The root of the tree does not have any incoming edges. Every other node
has one incoming edge and have two outgoing edges. We call a node without
outgoing edges a leaf node, otherwise is called an internal node. PaLM-tree divides an input hyperspace into regions
defined by axis-aligned hyperrectangles. A hyperrectangle (
dimensional rectangle) is recursively subdivided into two hyperrectangles
with equal dimension. Each node of PaLM-tree is associated with a
hyperrectangle, and therefore it is implicitly associated with a set of
training samples lying within this hyperrectangle. In particular, each leaf node represents a final subdivision
of the input and output hyperspace.
The PaLM-tree has attractive properties as follows:
Example result
The following graph shows an examples of function
approximation by PaLM-tree. In this graph, the green points show the training
data that are generated by , a red patch shows the estimated local linear mapping.
Application area
It can be applied to various problems.
Reference
Software
@If you are interested in using the PaLM-tree, please contact me (ntakayuk@sys.wakayama-u.ac.jp)
by e-mail.
Attention !
This software is designed for the research communities. @We do not make
any warranty as to the results that may be obtained from the use of this
software and do not take any responsibility for them.