Project 2: Yu Sun #4

sunfish2010 · 2018-09-18T02:29:04Z

Features

Implemented CPU, naive, work-efficient and thrust implementations of scan/stream-compaction
Answered questions and did performance analysis
Tried to optimize the work-efficient algorithm by launching kernel with different grid size (extra credit? not working so well)
Implemented naive and efficient scan with shared memory (extra credit)

The naive algorithm with shared memory on the cuda book is not correct. Will lead to invalid memory access (uninitialized memory)

explain it more clearly

Yu Sun and others added 12 commits September 11, 2018 16:00

cpu scan done on mac

cace42a

naive scan completed on mac

dded004

working on compaction on mac

8965c15

finish coding on mac, waiting for debugging on windows

373d4a6

adding shared mem file

99e8c93

normal function done

a823af4

naive shared done

33dd869

implmentation of efficient shared memory

699f879

shared memory done, doing performance analysis

22605d5

done

9fc37e3

correct image source

894f6d6

Update README.md

18637d6

explain it more clearly