Using five variables and four clusters, here's what k-means found (click for big picture).
This includes all y=50, nothing normalized etc. This seems to work. I suspect normalizing the data will help sort things out. Goal is to automate clusters of 2,3,4 and 5 to find best fit, apply it, and go from there.
Sunday, September 16, 2007
Ted Lilly Clusters
Subscribe to:
Post Comments (Atom)





0 comments:
Post a Comment