完整後設資料紀錄
DC 欄位語言
dc.contributor.authorYu, JSKen_US
dc.contributor.authorHwang, JKen_US
dc.contributor.authorTang, CYen_US
dc.contributor.authorYu, CHen_US
dc.date.accessioned2014-12-08T15:39:31Z-
dc.date.available2014-12-08T15:39:31Z-
dc.date.issued2004-03-01en_US
dc.identifier.issn0095-2338en_US
dc.identifier.urihttp://hdl.handle.net/11536/26978-
dc.description.abstractA number of recently released numerical libraries including Automatically Tuned Linear Algebra Subroutines (ATLAS) library, Intel Math Kernel Library (MKL), GOTO numerical library, and AMD Core Math Library (ACML) for AMD Opteron processors, are linked against the executables of the Gaussian 98 electronic structure calculation package, which is compiled by updated versions of Fortran compilers such as Intel Fortran compiler (ifc/efc) 7.1 and PGI Fortran compiler (pgf77/pgf90) 5.0. The We 7.1 delivers about 3% of improvement on 32-bit machines compared to the former version 6.0. Performance improved from pgf77 3.3 to 5.0 is also around 3% when utilizing the original unmodified optimization options of the compiler enclosed in the software. Nevertheless, if extensive compiler tuning options are used, the speed can be further accelerated to about 25%. The performances of these fully optimized numerical libraries are similar. The double-precision floating-point (FP) instruction sets (SSE2) are also functional on AMD Opteron processors operated in 32-bit compilation, and Intel Fortran compiler has performed better optimization. Hardware-level tuning is able to improve memory bandwidth by adjusting the DRAM timing, and the efficiency in the CL2 mode is further accelerated by 2.6% compared to that of the CL2.5 mode. The FP throughput is measured by simultaneous execution of two identical copies of each of the test jobs. Resultant performance impact suggests that IA64 and AMD64 architectures are able to fulfill significantly higher throughput than the IA32, which is consistent with the SpecFPrate2000 benchmarks.en_US
dc.language.isoen_USen_US
dc.titleNumerical performance and throughput benchmark for electronic structure calculations in PC-Linux systems with new architectures, updated compilers, and librariesen_US
dc.typeArticleen_US
dc.identifier.journalJOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCESen_US
dc.citation.volume44en_US
dc.citation.issue2en_US
dc.citation.spage635en_US
dc.citation.epage642en_US
dc.contributor.department生物科技學系zh_TW
dc.contributor.departmentDepartment of Biological Science and Technologyen_US
dc.identifier.wosnumberWOS:000220320000045-
dc.citation.woscount0-
顯示於類別:期刊論文


文件中的檔案:

  1. 000220320000045.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。