Jano's Homepage
Personal
Publications
 · Constraint Satisfaction
 · Data Mining
 · Dynamic Problems
 · Evolutionary Art
 · Problem Evolving  
Presentations
Projects
Photos
Past



 
Publications » Data Mining


Matching Spatial Regions with Combinations of Interacting Gene Expression Patterns
inproceedings van Hemert, J.I. and R.A. Baldock @ 2008/07/07
Proceedings of the 2nd International Conference on BioInformatics Research and Development, pages 347-361.

The Edinburgh Mouse Atlas aims to capture in-situ gene expression patterns in a common spatial framework. In this study, we construct a grammar to define spatial regions by combinations of these patterns. Combinations are formed by applying operators to curated gene expression patterns from the atlas, thereby resembling gene interactions in a spatial context. The space of combinations is searched using an evolutionary algorithm with the objective of finding the best match to a given target pattern. We evaluate the method by testing its robustness and the statistical significance of the results it finds.



Mining spatial gene expression data for association rules
inproceedings van Hemert, J.I. and R.A. Baldock @ 2007/03/12
Proceedings of the 1st International Conference on BioInformatics Research and Development, pages 66-76.
[ pdf | url ]

We analyse data from the Edinburgh Mouse Atlas Gene-Expression Database (EMAGE) which is a high quality data source for spatio-temporal gene expression patterns. Using a novel process whereby generated patterns are used to probe spatially-mapped gene expression domains, we are able to get unbiased results as opposed to using annotations based predefined anatomy regions. We describe two processes to form association rules based on spatial configurations, one that associates spatial regions, the other associates genes.



Application of Evolutionary Computation to Constraint Satisfaction and Data Mining
phdthesis van Hemert, J.I. @ 2002/11/28

A limited number of hardcopies is available for those who are interested, drop me an e-mail. Contents (chapter level):

  • 1. Introduction
  • 2. Evolutionary Computation
  • Part I: Constraint Satisfaction
  • 3. Constraint Satisfaction problems
  • 4. Solving Constraint Satisfaction Problems
  • 5. Empirical Research on Constraint Satisfaction
  • 6. Measuring the Resampling Ratio
  • 7. Constraint Satisfaction: Conclusions
  • Part II: Data Mining
  • 8. Introduction
  • 9. Classification
  • 10. Symbolic Regression
  • 11. Data Mining Conclusions
  • 12. Dynamic Behaviour
  • 13. Bridging the Gap
  • A. RandomCSP Library
  • B. Library for Evolutionary Algorithm Programming
  • C. Case Study: Scheduling a Telescope

Application of Evolutionary Computation to Constraint Satisfaction and Data Mining



Evolutionary Computation in Constraint Satisfaction and Machine Learning - An abstract of my PhD.
inproceedings van Hemert, J.I. @ 2001/05/01
Proceedings of the Brussels Evolutionary Algorithms Day (BEAD-2001).
[ pdf | ps.gz ]


Adaptive Genetic Programming Applied to New and Existing Simple Regression Problems
inproceedings J. Eggermont and van Hemert, J.I. @ 2001/05/01
Genetic Programming, pages 23-35.
[ pdf | ps.gz ]

In this paper we continue our study on adaptive genetic pro-gramming. We use Stepwise Adaptation of Weights to boost performance of a genetic programming algorithm on simple symbolic regression problems. We measure the performance of a standard GP and two variants of SAW extensions on two different symbolic regression prob-lems from literature. Also, we propose a model for randomly generating polynomials which we then use to further test all three GP variants.



Stepwise Adaptation of Weights for Symbolic Regression with Genetic Programming
inproceedings J. Eggermont and van Hemert, J.I. @ 2000/11/01
Proceedings of the Twelfth Belgium/Netherlands Conference on Artificial Intelligence (BNAIC'00), pages 259-266.
[ pdf | ps.gz ]

In this paper we continue study on the Stepwise Adaptation of Weights (SAW) technique. Previous studies on constraint satisfaction and data clas-sification have indicated that SAW is a promising technique to boost the performance of evolutionary algorithms. Here we use SAW to boost per-formance of a genetic programming algorithm on simple symbolic regression problems. We measure the performance of a standard GP and two variants of SAW extensions on two different symbolic regression problems.



Adapting the Fitness Function in GP for Data Mining
inproceedings J. Eggermont and A.E. Eiben and van Hemert, J.I. @ 1999/05/26
Genetic Programming, pages 195-204.
[ ps.gz ]

In this paper we describe how the Stepwise Adaptation of Weights (SAW) technique can be applied in genetic programming. The SAW-ing mechanism has been originally developed for and successfully used in EAs for constraint satisfaction problems. Here we identify the very basic underlying ideas behind SAW-ing and point out how it can be used for different types of problems. In particular, SAW-ing is well suited for data mining tasks where the fitness of a candidate solution is composed by `local scores' on data records. We evaluate the power of the SAW-ing mechanism on a number of benchmark classification data sets. The results indicate that extending the GP with the SAW-ing feature increases its performance when different types of misclassifications are not weighted differently, but leads to worse results when they are.



A comparison of genetic programming variants for data classification
inproceedings J. Eggermont and A.E. Eiben and van Hemert, J.I. @ 1999/08/09
Advances in Intelligent Data Analysis, pages 281-290.
[ ps.gz ]

In this paper we report the results of a comparative study on different variations of genetic programming applied on binary data classification problems. The first genetic programming variant is weighting data records for calculating the classification error and modifying the weights during the run. Hereby the algorithm is defining its own fitness function in an on-line fashion giving higher weights to `hard' records. Another novel feature we study is the atomic representation, where `Booleanization' of data is not performed at the root, but at the leafs of the trees and only Boolean functions are used in the trees' body. As a third aspect we look at generational and steady-state models in combination of both features.



Comparing genetic programming variants for data classification
inproceedings J. Eggermont and A.E. Eiben and van Hemert, J.I. @ 1999/11/03
Proceedings of the Eleventh Belgium/Netherlands Conference on Artificial Intelligence (BNAIC'99), pages 253-254.
[ ps.gz ]

This article is a combined summary of two papers written by the authors. Binary data classification problems (with exactly two disjoint classes) form an important application area of machine learning techniques, in particular genetic programming (GP). In this study we compare a number of different variants of GP applied to such problems whereby we investigate the effect of two significant changes in a fixed GP setup in combination with two different evolutionary models



Applying Adaptive Evolutionary Algorithms to Hard Problems
mastersthesis van Hemert, J.I. @ 1998/08/31
[ ps.gz ]

Supervised by A.E. Eiben and E. Marchiori