2011-04-19, 10:06
McGeagh Wrote:after a bit of research, vfpv3-d16 is no different to standard vfpv3 (except has half the available double-registers... of which we are only using 1) so it should work for tg2 too.
i am unsure if vfpv3 code will work on vfpv2 though (says it is backwards compatible)
Also there is no need to do a NEON version, as that will only give a performance increase if it were vectorised... and currently that is not the case if changing mathutils only.
I will try test my vfpv3 and commit later.
send me the code I will test it on my board for vfpv2 compatibility
P.S. It seems that VFP code run much slower on NEON capable CPU-s. e.g there is test of VFP matrix multiplications on iPhone 3GS and VFP code is slower than ordinary C code and much slower than NEON code ... go figure it out.