Show simple item record

Proximity coherence for chip-multiprocessors

dc.contributorMoore, Simon
dc.creatorBarrow-Williams, Nick
dc.date.accessioned2018-11-24T13:11:24Z
dc.date.available2012-01-10T14:35:56Z
dc.date.available2018-11-24T13:11:24Z
dc.date.issued2011-11-08
dc.identifierhttp://www.dspace.cam.ac.uk/handle/1810/241042
dc.identifierhttps://www.repository.cam.ac.uk/handle/1810/241042
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/123456789/2947
dc.description.abstractMany-core architectures provide an efficient way of harnessing the growing numbers of transistors available in modern fabrication processes; however, the parallel programs run on these platforms are increasingly limited by the energy and latency costs of communication. Existing designs provide a functional communication layer but do not necessarily implement the most efficient solution for chip-multiprocessors, placing limits on the performance of these complex systems. In an era of increasingly power limited silicon design, efficiency is now a primary concern that motivates designers to look again at the challenge of cache coherence. The first step in the design process is to analyse the communication behaviour of parallel benchmark suites such as Parsec and SPLASH-2. This thesis presents work detailing the sharing patterns observed when running the full benchmarks on a simulated 32-core x86 machine. The results reveal considerable locality of shared data accesses between threads with consecutive operating system assigned thread IDs. This pattern, although of little consequence in a multi-node system, corresponds to strong physical locality of shared data between adjacent cores on a chip-multiprocessor platform. Traditional cache coherence protocols, although often used in chip-multiprocessor designs, have been developed in the context of older multi-node systems. By redesign- ing coherence protocols to exploit new patterns such as the physical locality of shared data, improving the efficiency of communication, specifically in chip-multiprocessors, is possible. This thesis explores such a design – Proximity Coherence – a novel scheme in which L1 load misses are optimistically forwarded to nearby caches via new dedicated links rather than always being indirected via a directory structure.
dc.languageen
dc.publisherUniversity of Cambridge
dc.publisherFaculty of Computer Science and Technology
dc.publisherComputer Laboratory
dc.publisherTrinity Hall
dc.subjectComputer science
dc.subjectHardware design
dc.subjectProximity Coherence
dc.subjectCMP
dc.subjectCache design
dc.subjectNetwork-on-chip
dc.subjectPhysical locality
dc.titleProximity coherence for chip-multiprocessors
dc.typeThesis


Files in this item

FilesSizeFormatView
nick_barrow_williams_phd_thesis.pdf5.028Mbapplication/pdfView/Open

This item appears in the following Collection(s)

Show simple item record