Windows 7/64上的VirtualAllocExNuma内存访问时间较慢

Question

Windows 7/64上的VirtualAllocExNuma内存访问时间较慢

pho*_*tom 7 c++ numa visual-studio-2008 windows-7

在我们的应用程序中,我们运行在一个双Xeon服务器上,每个处理器的内存配置为12gb,连接两个Xeon的内存总线.出于性能原因,我们希望控制分配大(> 6gb)内存块的位置.以下是简化代码 -

DWORD processorNumber = GetCurrentProcessorNumber();
UCHAR   nodeNumber = 255;
GetNumaProcessorNode((UCHAR)processorNumber, &nodeNumber );
// get amount of physical memory available of node.
ULONGLONG availableMemory = MAXLONGLONG;
GetNumaAvailableMemoryNode(nodeNumber, &availableMemory )
// make sure that we don't request too much.  Initial limit will be 75% of available memory
_allocateAmt = qMin(requestedMemory, availableMemory * 3 / 4);
// allocate the cached memory region now.
HANDLE handle = (HANDLE)GetCurrentProcess ();
cacheObject = (char*) VirtualAllocExNuma (handle, 0, _allocateAmt, 
            MEM_COMMIT | MEM_RESERVE ,
            PAGE_READWRITE| PAGE_NOCACHE , nodeNumber);

Run Code Online (Sandbox Code Playgroud)

代码原样,在Win 7/64上使用VS2008正常工作.

在我们的应用程序中,这块内存用作静态对象(1-2mb ea)的缓存存储,通常存储在硬盘驱动器上.我的问题是,当我们使用memcpy将数据传输到缓存区域时,它比我们使用分配内存的时间长10倍new char[xxxx].并且没有其他代码更改.

我们无法理解为什么会这样.关于在哪里看的任何建议？

Answer 1

Han*_*ant 7

PAGE_NOCACHE是perf的谋杀案,它会禁用CPU缓存.这是故意的吗？

不，那不是我的本意。我原以为禁用了内存块的磁盘缓存，而不是 CPU。这确实解决了我的大部分性能问题。谢谢。 (2认同)

归档时间：	15 年，4 月前
查看次数：	1106 次
最近记录：	15 年，4 月前