For decades, many researchers have proposed and developed methods of content-based image retrieval using image features such as color, shape, texture, and spatial relationship. In particular, shape-based image retrieval has received considerable attention recently with interest in establishment of large biological databases. If images contain similar color or texture, shape-based image retrieval method is more effective than other methods using color or texture. For instance, leaves of most plants are green or brown; but the leaf shapes are distinctive and thus can be used for identification.
In this thesis, we present an effective and robust leaf image retrieval system based on shape feature. For the effective measurement of leaf similarity, we consider contour-based shape feature as well as region-based venation feature which corresponds to the blood vessel of organisms. In the contour-based shape domain, a matrix of interest points is constructed to model the similarity between two leaf images. In order to improve the retrieval performance, an improved Minimum Perimeter Polygons (MPP) and an adaptive grid-based matching algorithm are developed for the shape representation and data matching, respectively. In the region-based venation domain, an adjacency matrix is constructed from intersection and end points of a venation to model the similarity between two leaf images.
Additionally, we propose a novel leaf image classification scheme which first analyzes the venation for leaf categorization and then extracts and utilizes shape feature to find similar ones from the categorized group in the database. The venations are represented using points selected by the curvature scale scope corner detection method on the venation image, and categorized by calculating the density of feature points using non-parametric estimation density.
In order to reduce the matching time, we have proposed an adaptive matching algorithm using leaf symmetric distribution. Using this property, matching scope on the shape can be reduced. We also implemented a grid-based matching algorithm on our system to fast access to records and efficient process range queries. Several tests have shown that the algorithms are suitable for image databases with a varying number of attributes.
We implemented a prototype system and performed various experiments to show its effectiveness. Its performance is compared with other methods including Centroid Contour Distance (CCD), Fourier Descriptor, Curvature Scale Space Descriptor (CSSD), Moment Invariants, and MPP. Experimental results on one thousand leaf images show that our approach achieves a better performance than other methods