FPGA Implementation of Arcsine Function Using CORDIC Algorithm

Anurup Saha, Archisman Ghosh, K. Gaurav Kumar

ADES Lab, Dept. of E.T.C.E, Jadavpur University, Kolkata 700032, India
(sahaanurup24@gmail.com, archismanghosh12@gmail.com, kgauravkumar35@gmail.com)

Abstract

This paper presents finite state machine based hardware to calculate arcsine function using CORDIC algorithm. CORDIC algorithms provide an effective methodology to compute a large range of transcendental functions, since it only requires addition, shift and subtraction operations. The design has been done using verilog HDL and it has been tested in SPARTAN-3 FPGA. Since different applications may require different accuracy, the focus has been on how an FPGA implementation of arcsine function can be easily reconfigured when higher precision is required.

Key words
CORDIC, FPGA, Computer arithmetic, Arcsine, ASM.

1. Introduction

CORDIC, abbreviated from Coordinate Rotation Digital Computer, was introduced by Jack E. Volder [1]. CORDIC algorithms can be used to evaluate elementary functions like trigonometric functions, hyperbolic functions etc. [2] [3]

CORDIC algorithms are iterative. So number of iterations is chosen according to required precision.

There are 2 other approaches to compute such functions- (a) Look-Up Table (LUT) based implementation and (b) Polynomial Approximation. [4] [5] LUT-based designs can be error-prone, unless the size of LUT is sufficiently large. But, storing large LUT is a cost-inefficient solution to the problem.
Polynomial approximation, derived from Taylor series, has the disadvantage that it requires multipliers. On the other hand, CORDIC algorithms can be implemented using adders, subtractors and shifters only.

Due to its simplicity, CORDIC algorithms are used in a wide range of applications like the HP-35 calculator, 8087 mathcoprocessor [6], radar signal processors [7] and robotics.

2. CORDIC Algorithm

In 2-D plane, when a vector P(x,y) is rotated by an angle $\alpha$ as shown in figure-1, new coordinates of the vector $P'(x',y')$ can be obtained by,

\[
x' = x \cos \alpha - y \sin \alpha = \cos \alpha (x - y \tan \alpha) \tag{1}
\]
\[
y' = y \cos \alpha + x \sin \alpha = \cos \alpha (y + x \tan \alpha) \tag{2}
\]

If $\tan \alpha = 2^d$, the expression within bracket can be calculated using shift, subtraction and add operation. By choosing a proper sequence of rotations $\alpha_0, \alpha_1, \ldots, \alpha_n$, necessary trigonometric functions can be evaluated.

![Fig.1. Rotating Vector on 2-D Plane](image)

3. Steps to Compute Arcsine

By starting from the X-axis with a vector of pre-computed length, arcsine function is determined. To compute the arcsine of a given number $z$, i.e $\sin^{-1}(z)$, the following equations are used:
\[ x_{i+1} = x_i - m \cdot \frac{y_i}{2^i} \]
\[ y_{i+1} = y_i + m \cdot \frac{x_i}{2^i} \]
\[ \theta_{i+1} = \theta_i + m \cdot \tan^{-1}\left(\frac{1}{2^i}\right) \]

where, \( m = +1 \) if, \( y_i < z \) otherwise.

Based on these equations, an Algorithmic State Machine with Datapath (ASMD) chart [8] can be developed as shown below:

Fig. 2. ASMD Chart for Evaluation of Arcsine Function

Here \( N \) represents the number of iterations. As \( N \) increases, accuracy of the system improves.

The hardware architecture is as follows:
Here state_reg represents the present state of the finite state machine. A global clock has been used for synchronous operations.

4. Results and System Performance

Verilog hardware description language has been used for implementation. The design was tested in SPARTAN-3 FPGA (XC3S50). Hardware resources required for the design can be summarized as:
Computation time and accuracy depends on N. If larger N is chosen, it results in higher precision and more computation time. For example, \( \sin^{-1}(0.80) = 53.13^\circ \). Percentage of error, and number of clock cycles required for computation have been shown as a function of N in Table.2.

<table>
<thead>
<tr>
<th>N</th>
<th>Number of clock cycles</th>
<th>Error (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td>8</td>
<td>26</td>
<td>0.51%</td>
</tr>
<tr>
<td>9</td>
<td>29</td>
<td>0.13%</td>
</tr>
<tr>
<td>10</td>
<td>32</td>
<td>0.06%</td>
</tr>
</tbody>
</table>

It is to be noted that the amount of computation in each iteration is constant. As a result number of clock cycles required is a linear function of N.

**Conclusions**

In this paper, a novel FSM-based hardware to evaluate arcsine function has been proposed using CORDIC algorithm. Albeit CORDIC algorithms are relatively slower, simplicity of the algorithm ensures minimum requirement of hardware resources. As a result, these algorithms are being used in various applications. Also, depending on the necessity of the application, the architecture can be easily modified to operate under higher accuracy.

The concept can be used to determine arccosine and arctangent functions by choosing appropriate direction of rotations.

**Acknowledgment**

The authors would like to thank Prof. M. K. Naskar, Asim Maiti for their constant help, guidance, and support.
References
8. P.P. Chu, FPGA prototyping by verilog examples, John Wiley & Sons, pp 139-141.