标题: | 在具有快取记忆体一致性的多处理机系统下, 以引线排程方式解决共享资料问题 Scheduling Schemes for Reducing the False Sharing in Cache Coherent Multiprocessors |
作者: | 周治平 Jou, Ji-Ping 谢续平 Shiuh-Pyng Shieh 资讯科学与工程研究所 |
关键字: | 共享资料;多处理机;引线排程;记忆体一致性;平行处理;false sharing;multiprocessors;thread scheduling;cache coherence;parallel computing |
公开日期: | 1995 |
摘要: | 随着电脑的快速发展,平行处理已成为目前的趋势. 利用平行处理的方 式, 我们可以加快程式的执行速度, 因此平行处理的重要性与日俱增.为 了配合平行处理的需要, 具有快闪记忆体一致性的多处理机系统应运而 生. 虽然这类系统拥有许多优点, 然而此系统会产生资料共享的问题( sharing problem).资料共享问题包括了真正共享的资料(true sharingdata)与非真正共享的资料(false sharing data)所造成的问题. 这些问题经常造成存取快取记忆体时的失误现象(cache miss)并增加汇流 排(bus)的负担而导致系统的整体效能降因此, 在这篇论文中, 我们将提 出方法来解决这个问题.目前所提出解决的方法, 是纯然用编译器来对资 料做重新安置, 以去除非真正共享的资料. 然而这些方法有其缺点存在. 且在传统的引线排程中, 并未探讨这类的问题. 所以我们提出新的引线排 程方法对此问题做一处理. 我们将有资料共享的引线, 分开在不同的时段 来执行, 如此虽仍有共享资料或非共享资料的存在, 但却不会造成共享资 料的问题. 此外, 我们也建立了一个模拟具有快闪记忆体一致性的多处理 机系统环境, 来比较分析我们的方法与传统排程方法之间的差异数据结果 显示, 我们所提出的方法, 可以解决大部份资料共享的问题, 并能提升系 统的整体效能. With the rapid development of computers, parallel computing has become atrend in solving complex problems. Since we can speed up the execution ofprograms by parallel computing in multiprocessors, parallel computing isincreasingly important.In addition, the cache coherent multiprocessor systemis frequently used for parallel computing. Though there are many advantagesof using cache coherent multiprocessors, the sharing problems, including thetrue sharing and the false sharing, will occur in this system. These problemswill increase a large number of cache misses and increase the bus contention.Therefore, we proposed an approach to solve the sharing problem.The compilertechniques for reducing the false sharing data were proposed before. It avoidsthe sharing problem by reallocating the false sharing data. However, thereare some disadvantages of using these techniques. Moreover, the sharingproblem is not considered in the conventional thread scheduling schemes.Thus we propose new thread scheduling schemes for reducing the overheadespecially caused by the false sharing problem. We dispatch the threads, which will incur the sharing problem, into different tim e slices. By usingthis approach, we can avoid the sharing problem. We also construct a simulatedcache coherent multiprocessor system environment, and execute parallel programto testify our scheduling schemes. We compare the performance of using aconventional scheduling scheme with that of using our scheduling schemes.The result shows that our scheduling scheme can largely reduce the overheadcaused by the sharing problem. Therefore, we can use our scheduling schemesto improve the system performance. |
URI: | http://140.113.39.130/cdrfb3/record/nctu/#NT840392002 http://hdl.handle.net/11536/60342 |
显示于类别: | Thesis |