在一個動態轉譯引擎中優化SIMD指令之生成

Full metadata record

DC Field	Value	Language
dc.contributor.author	傅勝余	en_US
dc.contributor.author	Fu, Sheng-Yu	en_US
dc.contributor.author	徐慰中	en_US
dc.contributor.author	Hsu, Wei-Chung	en_US
dc.date.accessioned	2014-12-12T02:41:51Z	-
dc.date.available	2014-12-12T02:41:51Z	-
dc.date.issued	2013	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#GT070156049	en_US
dc.identifier.uri	http://hdl.handle.net/11536/74892	-
dc.description.abstract	Modern processors are increasingly enhanced with SIMD instructions. For examples, the MMX, SSE, and AVX instructions in the x86 architecture, and the Neon instruction set in the ARM architecture are all SIMD instructions. Using these SIMD instructions could significantly increase the performance of applications, hence application binaries are likely to have a greater fraction of instructions that are SIMD instructions. However, SIMD instruction translation has not attacked much attention in Dynamic Binary Translation (DBT). For example, in the popular QEMU system emulator, guest SIMD instructions are often emulated with a sequence of scalar instructions even when the host machines do have SIMD instructions to support such parallel computation, leaving a large potential for performance enhancement. In this thesis, we propose two approaches, one to leverage the existing helper function implementation in QEMU, and the other to use a newly introduced vector IR (Intermediate Representation) to enhance the performance of SIMD instructions translation in DBT of QEMU. The two approaches have been implemented in the QEMU with ARM frontend and x86-64 backend. In our experiment, the vector IR QEMU is 1.01 to 5.55 times faster than original QEMU with benchmark SPEC2006 CFP and 7.61 times faster than original QEMU with benchmark Linpack.	zh_TW
dc.description.abstract	Modern processors are increasingly enhanced with SIMD instructions. For examples, the MMX, SSE, and AVX instructions in the x86 architecture, and the Neon instruction set in the ARM architecture are all SIMD instructions. Using these SIMD instructions could significantly increase the performance of applications, hence application binaries are likely to have a greater fraction of instructions that are SIMD instructions. However, SIMD instruction translation has not attacked much attention in Dynamic Binary Translation (DBT). For example, in the popular QEMU system emulator, guest SIMD instructions are often emulated with a sequence of scalar instructions even when the host machines do have SIMD instructions to support such parallel computation, leaving a large potential for performance enhancement. In this thesis, we propose two approaches, one to leverage the existing helper function implementation in QEMU, and the other to use a newly introduced vector IR (Intermediate Representation) to enhance the performance of SIMD instructions translation in DBT of QEMU. The two approaches have been implemented in the QEMU with ARM frontend and x86-64 backend. In our experiment, the vector IR QEMU is 1.01 to 5.55 times faster than original QEMU with benchmark SPEC2006 CFP and 7.61 times faster than original QEMU with benchmark Linpack.	en_US
dc.language.iso	en_US	en_US
dc.subject	模擬器	zh_TW
dc.subject	QEMU	en_US
dc.title	在一個動態轉譯引擎中優化SIMD指令之生成	zh_TW
dc.title	Improvement of SIMD Code Generation in a Dynamic Binary Translator	en_US
dc.type	Thesis	en_US
dc.contributor.department	資訊科學與工程研究所	zh_TW
Appears in Collections:	Thesis

APA	傅., Fu, S., 徐., & Hsu, W. (2013). 在一個動態轉譯引擎中優化SIMD指令之生成. http://hdl.handle.net/11536/74892.
Bibtex	@article{傅勝余 and Fu2013, title={在一個動態轉譯引擎中優化SIMD指令之生成}, author={傅勝余 and Fu, Sheng-Yu and 徐慰中 and Hsu, Wei-Chung}, journal={http://hdl.handle.net/11536/74892}, year={2013}, url={https://ir.lib.nycu.edu.tw/handle/11536/74892?mode=full}, }