Parallel k-most similar neighbor classifier for mixed data Conference Paper uri icon

abstract

  • This paper presents a paralellization of the incremental algorithm inc-k-msn, for mixed data and similarity functions that do not satisfy metric properties. The algorithm presented is suitable for processing large data sets, because it only stores in main memory the k-most similar neighbors processed in step t, traversing only once the training data set. Several experiments with synthetic and real data are presented. © 2012 Springer-Verlag.

publication date

  • 2012-01-01