comparison arm/dsputil_vfp.S @ 11032:01bd040f8607 libavcodec

Unroll main loop so the edge==0 case is seperate. This allows many things to be simplified away. h264 decoder is overall 1% faster with a mbaff sample and 0.1% slower with the cathedral sample, probably because the slow loop filter code must be loaded into the code cache for each first MB of each row but isnt used for the following MBs.
author michael
date Thu, 28 Jan 2010 01:24:25 +0000
parents bdcc1c52f223
children 361a5fcb4393
comparison
equal deleted inserted replaced
11031:b37a53d101c3 11032:01bd040f8607