tp-apprentissage/clustering-benchmark-master/README.md
Titouan Labourdette e3009c62af 1er commit
2021-09-28 15:35:45 +02:00

139 lines
33 KiB
Markdown

# Clustering benchmarks
## Datasets
This project contains collection of labeled clustering problems that can be found in the literature. Most of datasets were artificially created.
The benchmark includes:
* [artificial datasets](https://github.com/deric/clustering-benchmark/tree/master/src/main/resources/datasets/artificial)
* [real world datasets](https://github.com/deric/clustering-benchmark/tree/master/src/main/resources/datasets/real-world)
### Artificial data
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/2d-10c.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/2d-10c.png" alt="2d-10c" title="2d-10c" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/2d-20c-no0.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/2d-20c-no0.png" alt="2d-20c-no0" title="2d-20c-no0" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/2d-3c-no123.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/2d-3c-no123.png" alt="2d-3c-no123" title="2d-3c-no123" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/2d-4c-no4.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/2d-4c-no4.png" alt="2d-4c-no4" title="2d-4c-no4" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/2d-4c-no9.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/2d-4c-no9.png" alt="2d-4c-no9" title="2d-4c-no9" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/2d-4c.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/2d-4c.png" alt="2d-4c" title="2d-4c" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/2sp2glob.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/2sp2glob.png" alt="2sp2glob" title="2sp2glob" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/3-spiral.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/3-spiral.png" alt="3-spiral" title="3-spiral" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/3MC.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/3mc.png" alt="3MC" title="3MC" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/D31.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/d31.png" alt="D31" title="D31" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/DS577.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/ds577.png" alt="DS577" title="DS577" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/DS850.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/ds850.png" alt="DS850" title="DS850" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/R15.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/r15.png" alt="R15" title="R15" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/aggregation.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/aggregation.png" alt="aggregation" title="aggregation" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/atom.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/atom.png" alt="atom" title="atom" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/banana.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/banana.png" alt="banana" title="banana" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/birch-rg1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/birch-rg1.png" alt="birch-rg1" title="birch-rg1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/birch-rg2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/birch-rg2.png" alt="birch-rg2" title="birch-rg2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/birch-rg3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/birch-rg3.png" alt="birch-rg3" title="birch-rg3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/chainlink.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/chainlink.png" alt="chainlink" title="chainlink" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/cluto-t4.8k.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/cluto-t4.8k.png" alt="cluto-t4.8k" title="cluto-t4.8k" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/cluto-t5.8k.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/cluto-t5.8k.png" alt="cluto-t5.8k" title="cluto-t5.8k" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/cluto-t7.10k.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/cluto-t7.10k.png" alt="cluto-t7.10k" title="cluto-t7.10k" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/cluto-t8.8k.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/cluto-t8.8k.png" alt="cluto-t8.8k" title="cluto-t8.8k" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/complex8.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/complex8.png" alt="complex8" title="complex8" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/complex9.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/complex9.png" alt="complex9" title="complex9" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/compound.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/compound.png" alt="compound" title="compound" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/cure-t0-2000n-2D.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/cure-t0-2000n-2d.png" alt="cure-t0-2000n-2D" title="cure-t0-2000n-2D" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/cure-t1-2000n-2D.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/cure-t1-2000n-2d.png" alt="cure-t1-2000n-2D" title="cure-t1-2000n-2D" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/cure-t2-4k.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/cure-t2-4k.png" alt="cure-t2-4k" title="cure-t2-4k" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/curves1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/curves1.png" alt="curves1" title="curves1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/curves2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/curves2.png" alt="curves2" title="curves2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/dartboard1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/dartboard1.png" alt="dartboard1" title="dartboard1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/dartboard2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/dartboard2.png" alt="dartboard2" title="dartboard2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/dense-disk-3000.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/dense-disk-3000.png" alt="dense-disk-3000" title="dense-disk-3000" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/dense-disk-5000.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/dense-disk-5000.png" alt="dense-disk-5000" title="dense-disk-5000" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/diamond9.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/diamond9.png" alt="diamond9" title="diamond9" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/disk-1000n.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/disk-1000n.png" alt="disk-1000n" title="disk-1000n" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/disk-3000n.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/disk-3000n.png" alt="disk-3000n" title="disk-3000n" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/disk-4000n.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/disk-4000n.png" alt="disk-4000n" title="disk-4000n" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/disk-4500n.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/disk-4500n.png" alt="disk-4500n" title="disk-4500n" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/disk-4600n.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/disk-4600n.png" alt="disk-4600n" title="disk-4600n" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/disk-5000n.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/disk-5000n.png" alt="disk-5000n" title="disk-5000n" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/disk-6000n.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/disk-6000n.png" alt="disk-6000n" title="disk-6000n" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/donut1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/donut1.png" alt="donut1" title="donut1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/donut2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/donut2.png" alt="donut2" title="donut2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/donut3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/donut3.png" alt="donut3" title="donut3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/donutcurves.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/donutcurves.png" alt="donutcurves" title="donutcurves" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/ds2c2sc13.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/ds2c2sc13.png" alt="ds2c2sc13" title="ds2c2sc13" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/ds3c3sc6.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/ds3c3sc6.png" alt="ds3c3sc6" title="ds3c3sc6" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/ds4c2sc8.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/ds4c2sc8.png" alt="ds4c2sc8" title="ds4c2sc8" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/elliptical_10_2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/elliptical_10_2.png" alt="elliptical_10_2" title="elliptical_10_2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/elly-2d10c13s.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/elly-2d10c13s.png" alt="elly-2d10c13s" title="elly-2d10c13s" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/engytime.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/engytime.png" alt="engytime" title="engytime" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/flame.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/flame.png" alt="flame" title="flame" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/fourty.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/fourty.png" alt="fourty" title="fourty" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/golfball.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/golfball.png" alt="golfball" title="golfball" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/hepta.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/hepta.png" alt="hepta" title="hepta" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/insect.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/insect.png" alt="insect" title="insect" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/jain.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/jain.png" alt="jain" title="jain" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/long1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/long1.png" alt="long1" title="long1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/long2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/long2.png" alt="long2" title="long2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/long3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/long3.png" alt="long3" title="long3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/longsquare.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/longsquare.png" alt="longsquare" title="longsquare" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/lsun.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/lsun.png" alt="lsun" title="lsun" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/mopsi-finland.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/mopsi-finland.png" alt="mopsi-finland" title="mopsi-finland" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/mopsi-joensuu.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/mopsi-joensuu.png" alt="mopsi-joensuu" title="mopsi-joensuu" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/pathbased.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/pathbased.png" alt="pathbased" title="pathbased" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/rings.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/rings.png" alt="rings" title="rings" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/s-set1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/s-set1.png" alt="s-set1" title="s-set1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/s-set2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/s-set2.png" alt="s-set2" title="s-set2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/s-set3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/s-set3.png" alt="s-set3" title="s-set3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/s-set4.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/s-set4.png" alt="s-set4" title="s-set4" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/sizes1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/sizes1.png" alt="sizes1" title="sizes1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/sizes2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/sizes2.png" alt="sizes2" title="sizes2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/sizes3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/sizes3.png" alt="sizes3" title="sizes3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/sizes4.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/sizes4.png" alt="sizes4" title="sizes4" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/sizes5.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/sizes5.png" alt="sizes5" title="sizes5" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/smile1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/smile1.png" alt="smile1" title="smile1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/smile2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/smile2.png" alt="smile2" title="smile2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/smile3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/smile3.png" alt="smile3" title="smile3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/spherical_4_3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/spherical_4_3.png" alt="spherical_4_3" title="spherical_4_3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/spherical_5_2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/spherical_5_2.png" alt="spherical_5_2" title="spherical_5_2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/spherical_6_2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/spherical_6_2.png" alt="spherical_6_2" title="spherical_6_2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/spiral.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/spiral.png" alt="spiral" title="spiral" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/spiralsquare.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/spiralsquare.png" alt="spiralsquare" title="spiralsquare" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/square1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/square1.png" alt="square1" title="square1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/square2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/square2.png" alt="square2" title="square2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/square3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/square3.png" alt="square3" title="square3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/square4.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/square4.png" alt="square4" title="square4" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/square5.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/square5.png" alt="square5" title="square5" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/st900.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/st900.png" alt="st900" title="st900" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/target.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/target.png" alt="target" title="target" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/tetra.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/tetra.png" alt="tetra" title="tetra" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/triangle1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/triangle1.png" alt="triangle1" title="triangle1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/triangle2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/triangle2.png" alt="triangle2" title="triangle2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/twenty.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/twenty.png" alt="twenty" title="twenty" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/twodiamonds.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/twodiamonds.png" alt="twodiamonds" title="twodiamonds" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/wingnut.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/wingnut.png" alt="wingnut" title="wingnut" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/xclara.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/xclara.png" alt="xclara" title="xclara" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/zelnik1.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/zelnik1.png" alt="zelnik1" title="zelnik1" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/zelnik2.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/zelnik2.png" alt="zelnik2" title="zelnik2" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/zelnik3.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/zelnik3.png" alt="zelnik3" title="zelnik3" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/zelnik4.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/zelnik4.png" alt="zelnik4" title="zelnik4" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/zelnik5.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/zelnik5.png" alt="zelnik5" title="zelnik5" width="239px" style="max-width: 100%;float:left;"/></a>
<a href="https://github.com/deric/clustering-benchmark/blob/master/src/main/resources/datasets/artificial/zelnik6.arff"><img src="https://github.com/deric/clustering-benchmark/blob/images/fig/artificial/zelnik6.png" alt="zelnik6" title="zelnik6" width="239px" style="max-width: 100%;float:left;"/></a>
## Experiments
This project contains set of clustering methods benchmarks on various dataset. The project is dependent on [Clueminer project](https://github.com/deric/clueminer).
in order to run benchmark compile dependencies into a single JAR file:
mvn assembly:assembly
# Consensus experiment
allows running repeated runs of the same algorithm:
```
./run consensus --dataset "triangle1" --repeat 10
```
by default k-means algorithm is used.
For available datasets see [resources folder](https://github.com/deric/clustering-benchmark/tree/master/src/main/resources/datasets/artificial).