文件名称:smaller_and_faster_intel_sse_code
介绍说明--下载内容均来自于网络,请自行研究使用
Codes exhibiting suitable data-flow parallelism can often
profit from using Intel SSE, a SIMD extension to the CISC style In-tel 64 instruction set architecture. As SSE instructions are, on average,
larger than scalar instructions, they exhibit a heavier load on instruc-tion pre-decoding, decoding, and caching hardware. For long straight-line
SSE codes, instruction lengths become an obstacle to high performance
that is not adequately handled by available optimizing compilers
profit from using Intel SSE, a SIMD extension to the CISC style In-tel 64 instruction set architecture. As SSE instructions are, on average,
larger than scalar instructions, they exhibit a heavier load on instruc-tion pre-decoding, decoding, and caching hardware. For long straight-line
SSE codes, instruction lengths become an obstacle to high performance
that is not adequately handled by available optimizing compilers
(系统自动生成,下载前可以参看下载内容)
下载文件列表
smaller_and_faster_intel_sse_code.pdf