MMX moves (Developers)
There's no speed to gain simply by using MMX registers instead of the standard ones. At least you'll have to use the MOVNTQ instruction to get a significant boost.
A pretty good pdf about this topic, which you hopefully can find via Google, is: gdc_2002_amd.pdf
It shows how to achieve faster memcopy speed by using several "tricks" (MMX, XMM, "prefetch" ). IIRC I once wrote a small tool which implemented most of the strategies mentioned in this document. There's a small chance that I'll be able to remember how I named it.
---
MS-DOS forever!
Complete thread:
Mix view