Commit daec1e47 authored by He Guanlin's avatar He Guanlin
Browse files

Update README.md

parent 835de4ff
**CPU-GPU-kmeans**\\
**CPU-GPU-kmeans**
Optimized parallel implementations of the k-means clustering algorithm:
1. on multi-core CPU with vector units: thread parallelization using OpenMP, auto-vectorization using AVX units
2. on NVIDIA GPU: using shared memory, dynamic parallelism, and multiple streams\\
2. on NVIDIA GPU: using shared memory, dynamic parallelism, and multiple streams
In particular, for both implementations we use a two-step summation method with package processing to handle the effect of rounding errors that may occur during the phase of updating cluster centroids.
【Makefile Configuration】
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment