Bug 4988 - [PATCH] micro-optimize VectorNormalize
Status: RESOLVED FIXED
Alias: None
Product: ioquake3
Classification: Unclassified
Component: Platform
Version: GIT MASTER
Hardware: PC Linux
: P3 normal
Assignee: Zachary J. Slater
QA Contact: ioquake3 bugzilla mailing list
URL:
Depends on:
Blocks:
 
Reported: 2011-05-14 18:27 EDT by Matt Turner
Modified: 2011-05-15 10:55:59 EDT
1 user (show)

See Also:


Attachments
Micro-optimize VectorNormalize{,2} (1.24 KB, patch)
2011-05-14 18:27 EDT, Matt Turner
c.c (331 bytes, text/plain)
2011-05-14 18:29 EDT, Matt Turner

Description Matt Turner 2011-05-14 18:27:58 EDT
Created attachment 2713 [details]
Micro-optimize VectorNormalize{,2}

The current VectorNormalize{,2} functions call sqrt unnecessarily.

vec_t VectorNormalize( vec3_t v ) {
    // NOTE: TTimo - Apple G4 altivec source uses double?
    float   length, ilength;

    length = v[0]*v[0] + v[1]*v[1] + v[2]*v[2];
    length = sqrt (length);

    if ( length ) {
        ilength = 1/length;
        v[0] *= ilength;
        v[1] *= ilength;
        v[2] *= ilength;
    }

    return length;
}

sqrt(length) == 0.0f if and only if length = 0.0f. Thus, the sqrt should be moved inside the if statement.

Additionally, gcc is unable to recognize that it may use reciprocal-sqrt when available to calculate ilength. As such, it's more efficient to do this

ilength = 1/sqrtf(length);
length *= ilength;

rather than

length = sqrt(length);
ilength = 1/length;

since the first consists of a reciprocal-sqrt and multiply and the second is a sqrt and a reciprocal or divide.

Also note, sqrt_f_ should be used instead of sqrt. Otherwise gcc wants to convert to and from double-precision which is silly. sqrtf isn't available to QVMs though, so casting (float)sqrt(length) is used since it generates identical code to sqrtf.

Attached is the final patch.
Comment 1 Matt Turner 2011-05-14 18:29:16 EDT
Created attachment 2714 [details]
c.c

If you're worried about whether floating-point rounding errors cause sqrt(x) == 0.0 but x != 0.0, try to make this program assert().
Comment 2 Thilo Schulz 2011-05-15 10:55:59 EDT
committed r1990