AI Engine Intrinsics User Guide
(v2023.2)
|
This intrinsic function is used to perform Direct Digital Synthesis (DDS) interpolation between two points on a unit circle on the IQ plane.
Carrier extraction needs 8 sin/cos pairs equally spaced in phase. This function ensures that only every 8th sin/cos must be calculated. It uses an interpolating FIR filter to obtain the other sin/cos pairs.
The function is a 16 tap FIR filter with conjugate symmetric taps taking two sin/cos points as inputs and producing 8 as outputs.
Functions | |
v8cacc48 | dds_ipol (v16cint16 xbuf, int xidx1, int xidx2, int swap, v16int16 zbuf) |
This intrinsic function interpolates the input by 8, using a 16 tap complex symmetric FIR filter. | |
v8cacc48 | dds_ipol (v32cint16 xbuf, int xidx1, int xidx2, int swap, v16int16 zbuf) |
v8cacc48 dds_ipol | ( | v16cint16 | xbuf, |
int | xidx1, | ||
int | xidx2, | ||
int | swap, | ||
v16int16 | zbuf | ||
) |
This intrinsic function interpolates the input by 8, using a 16 tap complex symmetric FIR filter.
This intrinsic function is used by the Direct Digital Synthesis (DDS) implementation on the AIE to perform an interpolation by 8 between two points on a unit circle on the IQ plane.
xbuf | Vector of 16 complex values containing the two points on the IQ unit circle to interpolate between | Valid bits: all. |
xidx1 | Index to first interpolate value in d. This must be a compile time constant. | Valid bits: 5b LSB. |
xidx2 | Index to second interpolate value in d. This must be a compile time constant. | Valid bits: 3b LSB. |
swap | Parameter which swaps real and imaginary values of the data. If the swap flag is set, the bits representing the imaginary part of the data will come first. This must be a compile time constant. | Valid bits: 1b LSB. |
zbuf | Vector of 16 integers containing the pregenerated filter taps to be used. | Valid bits: all. |
The two input points are taken from the vector xbuf at the points indicated by the two index parameters given, xidx1 and xidx2.
The output of the function is a Vector of 8 complex values stored in an accumulator.
The effective calculation is as follows:
and produces 8 outputs in two cycles since it requires 64 multiplications in 16 bit, where x0 = xbuf[xidx1 % dsz ] and x1 = xbuf[(xidx1 & (dsz-1)) | (xidx2 % 8)]
The complex values that are part of the effective calculation shown above are converted into another set of real coefficients that are passed into this function as the v16int16 z. It is this calculation that exploits the symmetric nature of the filter to reduce the number of real multiplications to 32.
This zbuf vector is pre-calculated and stored in AIE memory as follows:
where z[0]-z[15] is the input zbuf
These z values are then used in the following formulation of intermediate values m0a-m3a and m0b-m3b
These intermediate terms are then used to generate the final output as follows:
NB: it should be noted that in order to facilitate more efficient movement of values between registers, o(4)->o(7) are reversed as the contents of the adders do not need to changed between the two operations of this function.
because of this the next function after the dds_ipol must reverse this.
v8cacc48 dds_ipol | ( | v32cint16 | xbuf, |
int | xidx1, | ||
int | xidx2, | ||
int | swap, | ||
v16int16 | zbuf | ||
) |