## Exam solutions ASIC (TSTE81) 000504

1. a) Yes. Example: 2 nd order recursive halfband direct form filter structure. The algorithm has 2 operations. If $\mathrm{T}_{\text {mult }}=3$ time units and $\mathrm{T}_{\text {add }}=1$ time unit is $\mathrm{T}_{\min }=(3+1) / 2=2$ time units. A schedule with $\mathrm{T}_{\text {sample }}=\mathrm{T}_{\min }$ requires 3 concurrent operations, thus three processing elements (assuming an execution time equal to the latency of the PE).
b) Lookahead FSM, concurrent block processing
c) No, the value of $\mathrm{T}_{\text {min }}$ may be any quotient. Example: $2^{\text {nd }}$ order halfband recursive direct form filter. Assume latency of multiplier to be 2 and adder to be 1 . Then $\mathrm{T}_{\min }=(2+1) / 2=1.5$.
d) CSDC (Canonic Sign Digit Code) is a non-redundant number representation. Each value has one unique representation in this number system.
e) Standard-Cell Design, Gate Array Design, Sea-of-Gates Design, Unconstrained Design.
2. a) Connect variables that can share memory cells. Find minimal number of cliques (fully connected subgraphs):

b) Sort variables by start time: h, c, g, a, e, d, f, b. Allocate memory cell and search for assignable variables starting from left in list. Remove variable from list when assigned. When list not empty and no variable possible to assign to current cell then allocate a new cell.

3. a)


## Exam solutions ASIC (TSTE81) 000504

b) $\mathrm{u}_{1}:=\mathrm{x}(\mathrm{n})+\mathrm{v}_{1}$;
$\mathrm{u}_{3}:=\mathrm{b}_{1}$;
$\mathrm{u}_{2}:=\mathrm{a}_{1} ;$
$\mathrm{u}_{4}:=\mathrm{u}_{2}+\mathrm{u}_{3} ;$
$\mathrm{u}_{5}:=\mathrm{x}(\mathrm{n})+\mathrm{u}_{4} ;$
$\mathrm{u}_{6}:=\mathrm{c}_{4}$;
$\mathrm{y}(\mathrm{n}):=\mathrm{u}_{6}+\mathrm{v}_{2} ;$
$\mathrm{v}_{2}:=\mathrm{v}_{1} ;$
$\mathrm{v}_{1}:=\mathrm{u}_{5} ;$
c)
$\mathrm{u}_{4}:=\mathrm{a}\left(\mathrm{x}(\mathrm{n})+\mathrm{v}_{1}\right)+\mathrm{bv}_{1} ;$
$\mathrm{y}(\mathrm{n}):=\mathrm{cu}_{4}+\mathrm{v}_{2} ;$
$\mathrm{v}_{2}:=\mathrm{v}_{1} ;$
$\mathrm{v}_{1}:=\mathrm{x}(\mathrm{n})+\mathrm{u}_{4} ;$
4. a) Multiplication latency is equal to number of fractional bits in the coefficient plus one due to model 1 logic.
$\mathrm{T}_{\mathrm{a}}=2+1=3$ clock cycles. $\mathrm{T}_{\mathrm{b}}=3+1=4$ clock cycles. $\mathrm{T}_{\mathrm{c}}=3+1=4$ clock cycles.
b) Number of clock cycles per sample $=150 \mathrm{MHz} / 10 \mathrm{MSamples} / \mathrm{s}=15$ clock cycles/sample. Replace all T-element with 15 flipflops and add 15 flipflops to the output. Propagate them so all operations gets computation time.

5. a) $\mathrm{T}_{\min }=\max \left(\mathrm{T}_{\text {opi }} / \mathrm{N}_{\mathrm{i}}\right)=(4+1) / 2=2.5$ clock cycles. $\mathrm{f}_{\max }=\mathrm{f}_{\mathrm{clk}} / \mathrm{T}_{\min }=70 / 2.5 \approx 28 \mathrm{MSamples} / \mathrm{s}$.
b) The multiplication operation is longer than the sample period and the critical loop contains two delay elements. Must therefore schedule over more than one sample period. Use two sample periods.

## Exam solutions ASIC (TSTE81) 000504


6. a) Reset all flipflops before starting the computation. Signbit is one when signbit of $x_{i}$ is input, zero otherwise.

b) The largest number is larger than 1 , so the number range must be increased from [-1,1[ to [-2,2[.

| $\mathrm{x}_{1} \mathrm{x}_{2} \mathrm{x}_{3}$ | ROM value |
| :---: | :--- |
| 000 | $00.000_{2 \mathrm{C}}\left(0_{10}\right)$ |
| 000 | $00.111_{2 \mathrm{C}}\left(0.875_{10}\right)$ |
| 0110 | $11.101_{2 \mathrm{C}}\left(-0.375_{10}\right)$ |
| 011 | $00.100_{2 \mathrm{C}}\left(0.5_{10}\right)$ |
| 100 | $00.010_{2 \mathrm{C}}\left(0.25_{10}\right)$ |
| 10 | $01.001_{2 \mathrm{C}}\left(1.125_{10}\right)$ |
| 11 | $11.111_{2 \mathrm{C}}\left(-0.125_{10}\right)$ |
| 11 | $00.110_{2 \mathrm{C}}\left(0.75_{10}\right)$ |

