FFTW

For version 2.1.3, 7 November 1999


(1)

The output for the multi-dimensional rfftw is a more-conventional array of fftw_complex values, but the format here permitted us greater efficiency in one dimension.

(2)

The basic problem is the resolution of the clock: FFTW needs to run for a certain time for the clock to be reliable.

(3)

fftwnd actually may use some temporary storage (hidden in the plan), but this storage space is only the size of the largest dimension of the array, rather than being as big as the entire array. (Unless you use fftwnd to perform one-dimensional transforms, in which case the temporary storage required for in-place transforms is as big as the entire array.)

(4)

The etymologically-correct spelling would be frftw_, but it is hard to remember.

(5)

There is one exception: when performing one-dimensional in-place transforms, the out parameter is always ignored by the multi-threaded routines, instead of being used as a workspace if it is non-NULL as in the uniprocessor routines. The multi-threaded routines always allocate their own workspace (the size of which depends upon the number of threads).

(6)

The 1D transforms require much more communication. All the communication in our FFT routines takes the form of an all-to-all communication: the multi-dimensional transforms require two all-to-all communications (or one, if you use FFTW_TRANSPOSED_ORDER), while the one-dimensional transforms require three (or two, if you use scrambled input or output).

(7)

An FFT is particularly hard on communications systems, as it requires an all-to-all communication, which is more or less the worst possible case.

(8)

Technically, Fortran 77 identifiers are not allowed to have more than 6 characters, nor may they contain underscores. Any compiler that enforces this limitation doesn't deserve to link to FFTW.

(9)

Each version of cc seems to have its own magic incantation to get the fastest code most of the time--you'd think that people would have agreed upon some convention, e.g. "-Omax", by now.


This document was generated on 7 November 1999 using the texi2html translator version 1.52. (properly hacked by athena@theory.lcs.mit.edu)