## Modeling Custom Hardware in VHDL

Technical Report ICS-99-29 July 6, 1999

Heiko Lehr

Supervised by Prof. Daniel D. Gajski

Research Paper "Studienarbeit"

Department of Information and Computer Science University of California, Irvine Irvine, CA 92697-3425, USA (949) 824-8059

#### Abstract

This report focuses on models for describing Hardware at different refinement levels within High Level Synthesis flow: SFSMD, FSMD and FSM Controlling Datapath. Simple data interfaces are often needed in these models. The hardware description language VHDL is used and tested for implementing these models and interfaces. Problems inherent in the models as well as problems caused by VHDL are discussed. A model related main difficulty is the one-cycle delay of data processing, when introducing a datapath. VHDL descriptions of abstract models (SFSMD) can become complicated. Templates for general problems, examples and VHDL guidelines are provided in this report to help to minimize design errors significantly.

# Contents

| 1 |      | meaning of the models                                | 1   |
|---|------|------------------------------------------------------|-----|
|   | 1.1  | SFSMD                                                | ]   |
|   |      | 1.1.1 Behavioral Description                         | 2   |
|   |      | 1.1.2 Structural Description                         | 2   |
|   | 1.2  | FSMD                                                 | 2   |
|   |      | 1                                                    | 3   |
|   |      |                                                      |     |
|   | 1.3  | 0 1                                                  | 9   |
|   |      | 1.3.1 Behavioral Description                         | 4   |
|   |      | 1.3.2 Behavioral-Structural Description              | 4   |
|   |      | 1.3.3 Structural Description                         | 5   |
|   | 1.4  | FSM                                                  | 5   |
|   | 1.5  | Conclusion                                           | 5   |
| 2 | Sch  | duling, Allocation and Binding                       | 6   |
| 3 | VHI  | OL Programming Structure                             | e   |
|   |      |                                                      |     |
| 4 |      | ribing the Models in VHDL                            | 8   |
|   | 4.1  | Abstract and Introduction                            | 8   |
|   |      | 4.1.1 The State Machines Graphically                 | 8   |
|   |      | 4.1.2 Translation into VHDL                          | 8   |
|   | 4.2  | Examples and Guidelines                              |     |
|   |      | 4.2.1 SFSMD Behavioral                               |     |
|   |      | 4.2.2 FSMD Behavioral                                | 2   |
|   |      | 4.2.3 Discussion: FSMD Behavioral                    |     |
|   |      | without time or with clock                           | 3   |
|   |      | 4.2.4 FSM Behavioral Controlling Datapath Structural | 4   |
|   |      | 4.2.5 FSM Structural Controlling Datapath Structural | 4   |
|   |      | 4.2.6 Compensation of Datapath Delay                 | 4   |
| 5 | Con  | munication                                           |     |
|   | of S | rate Machines 2                                      | 5   |
|   | 5.1  | Single Handshake                                     | 5   |
|   | 5.2  | Double Handshake                                     |     |
|   | 5.3  | Examples:                                            |     |
|   |      | Data-Transfer Constructions in VHDL                  | 2   |
|   |      | 5.3.1 Direct Data-Transfer                           |     |
|   |      | 5.3.2 Data-Transfer via Queue                        |     |
|   |      | 5.3.3 Data-Transfer via Memory                       |     |
| 6 | Des  | ribing the Models in VHDL                            |     |
| Ŭ |      | mmary of Problems                                    |     |
|   |      | Conclusions 4                                        | Δ   |
|   | 6.1  | Single Models in VHDL                                |     |
|   | U.1  | 6.1.1 SFSMD and FSMD Without Time                    |     |
|   |      | 6.1.2 FSMD With Clock                                |     |
|   |      |                                                      |     |
|   | 6.9  | 0 1                                                  |     |
|   | 6.2  | Communicating Models in VHDL                         |     |
|   |      | 6.2.1 SFSMD and FSMD Without Time                    |     |
|   |      | 6.2.2 FSMD With Clock                                | . 0 |

|              | $\begin{array}{c} 6.3 \\ 6.4 \end{array}$ |                             | Controlling Datapath                                                                              |     |
|--------------|-------------------------------------------|-----------------------------|---------------------------------------------------------------------------------------------------|-----|
| Re           | efere                                     | $_{ m nces}$                |                                                                                                   | 46  |
| A            | Qu€                                       | eue wit                     | h Single Handshake                                                                                | 47  |
|              | <b>A</b> .1                               | FSMD                        |                                                                                                   | 47  |
|              | A.2                                       | FSM f                       | or "FSM Controlling Datapath"                                                                     | 48  |
| В            |                                           |                             | v                                                                                                 | 49  |
|              | B.1                                       | FSM f                       | or Queue with Simultaneous Read/Write                                                             | 49  |
|              | B.2                                       | Extend                      | led Timing for Memory                                                                             | 50  |
| $\mathbf{C}$ | VH                                        | DL exa                      | amples                                                                                            | 51  |
|              | C.1                                       |                             | Finite State Machines                                                                             | 51  |
|              |                                           | C.1.1                       | Single SFSMD                                                                                      |     |
|              |                                           | C.1.2                       | Single FSMD without time                                                                          | 54  |
|              |                                           | C.1.3                       | Single FSMD using clock                                                                           |     |
|              |                                           | C.1.4                       | Single FSM Controlling Datapath                                                                   | 58  |
|              |                                           | $\overline{\mathrm{C.1.5}}$ | 9 -                                                                                               |     |
|              |                                           | C.1.6                       |                                                                                                   |     |
|              | C.2                                       | Double                      | e Handshake                                                                                       | 64  |
|              |                                           | C.2.1                       |                                                                                                   |     |
|              |                                           |                             |                                                                                                   |     |
|              |                                           |                             |                                                                                                   |     |
|              |                                           | C.2.4                       | A (FSMD using clock) $\rightarrow$ B (FSMD using clock)                                           |     |
|              |                                           | _                           | FSMD A                                                                                            |     |
|              |                                           | C.2.6                       | FSMD B                                                                                            |     |
|              |                                           | C.2.7                       | A (FSM Controlling Datapath) → B (FSM Controlling Datapath)                                       |     |
|              |                                           | C.2.8                       | FSM Controlling Datapath A                                                                        |     |
|              |                                           | C.2.9                       | FSM of A                                                                                          |     |
|              |                                           | C.2.10                      | Datapath of A (identical B)                                                                       |     |
|              |                                           |                             | FSM Controlling Datapath B                                                                        |     |
|              |                                           |                             | FSM of B                                                                                          |     |
|              |                                           |                             | Datapath of B                                                                                     |     |
|              | C.3                                       |                             | er via Queue with Double Handshake                                                                |     |
|              |                                           |                             | A (FSMD without time) $\rightarrow$ Queue (FSMD without time) $\rightarrow$ B (FSMD without time) |     |
|              |                                           |                             | FSMD A                                                                                            |     |
|              |                                           | C.3.3                       | FSMD Queue                                                                                        |     |
|              |                                           | C.3.4                       | FSMD B                                                                                            |     |
|              |                                           | C.3.5                       | A (FSMD using clock) $\rightarrow$ Queue (FSMD using clock) $\rightarrow$ B (FSMD using clock)    |     |
|              |                                           | C.3.6                       | FSMD A                                                                                            |     |
|              |                                           | C.3.7                       | FSMD Queue                                                                                        |     |
|              |                                           | C.3.8                       | FSMD B                                                                                            |     |
|              |                                           | C.3.9                       | A (FSM Controlling Datapath) → Queue (FSM Controlling Datapath)                                   |     |
|              |                                           | 0.0.0                       | → B (FSM Controlling Datapath)                                                                    | 110 |
|              |                                           | C.3 10                      | FSM Controlling Datapath A                                                                        |     |
|              |                                           |                             | FSM of A                                                                                          |     |
|              |                                           |                             | Datapath of A (identical B)                                                                       |     |
|              |                                           |                             | FSM Controlling Datapath Queue                                                                    |     |
|              |                                           |                             | FSM of Queue                                                                                      |     |
|              |                                           |                             | Datapath of Queue (Buffered)                                                                      |     |
|              |                                           |                             | Datapath of Unbuffered Queue                                                                      |     |

|     | C.3.17 FSM Controlling Datapath B                            | 127 |
|-----|--------------------------------------------------------------|-----|
|     | C.3.18 FSM of B                                              | 129 |
|     | C.3.19 Datapath of B identical to A (see A)                  | 132 |
| C.4 | Transfer via Memory with Double Handshake                    | 133 |
|     | C.4.1 A (FSM Controlling Datapath) → Memory    Arbiter (FSM) |     |
|     | → B (FSM Controlling Datapath)                               | 134 |
|     | C.4.2 FSM Controlling Datapath A                             | 136 |
|     | C.4.3 FSM of A                                               | 138 |
|     | C.4.4 Datapath of A (identical B)                            | 142 |
|     | C.4.5 Memory                                                 | 144 |
|     | C.4.6 Arbiter (FSM)                                          | 145 |
|     | C.4.7 FSM Controlling Datapath B                             | 147 |
|     | C.4.8 FSM of B                                               | 149 |
|     | C.4.9 Datapath of B identical to A (see A)                   | 154 |

# List of Figures

| 1  | Models for HL-Synthesis                                                               |
|----|---------------------------------------------------------------------------------------|
| 2  | System Exploration Chart                                                              |
| 3  | One-State-SFSMD                                                                       |
| 4  | SFSMD                                                                                 |
| 5  | FSMD Behavioral                                                                       |
| 6  | FSM Controlling Datapath (1)                                                          |
| 7  | Multi purpose datapath                                                                |
| 8  | FSM Controlling Datapath (2)                                                          |
| 9  | General VHDL program structure                                                        |
| 10 | A state machine inside a procedure                                                    |
| 11 | A state machine inside a process                                                      |
| 12 | A state machine for controlling a separate datapath                                   |
| 13 | Example: SFSMD Behavioral in VHDL                                                     |
| 14 | Example: FSMD Behavioral Without Time in VHDL                                         |
| 15 | Example: FSMD Behavioral With Clock in VHDL                                           |
| 16 | Example: Datapath of "FSM Behavioral Controlling Datapath Structural"                 |
| 17 | Example: FSM of "FSM Behavioral Controlling Datapath Structural"                      |
| 18 | Example: FSM of "FSM Structural Controlling Datapath Structural"                      |
| 19 | Warning: Omitting separation of next-state-process causes additional output latches   |
| 20 | Embedding the example SFSMD (fig. 13) in VHDL                                         |
| 21 | Embedding the example FSMD Without Time (fig. 14) in VHDL                             |
| 22 | Embedding the example FSMD With Clock (fig. 15) in VHDL                               |
| 23 | Embedding the example "FSM Controlling Datapath" (fig. 16 and 17) in VHDL             |
| 24 | Single Handshake                                                                      |
| 25 | Cases, when using Single Handshake with two FSMD                                      |
| 26 | Cases, when using Single Handshake with two FSM                                       |
| 27 | Cases, when using Single Handshake with a FSM transmitting to a FSMD                  |
| 28 | Cases, when using Single Handshake with a FSMD transmitting to a FSM                  |
| 29 | Insertion of a "General State" in the receiving state machine of figure 30            |
| 30 | Double Handshake                                                                      |
| 31 | Timing problem when writing                                                           |
| 32 | Timing problems when receiving                                                        |
| 33 | Direct data-transfer with Double Handshake: Block structure for FSMDs                 |
| 34 | Direct data-transfer with Double Handshake: Refinement to FSMs and datapaths          |
| 35 | Data-transfer via queue with Double Handshake: Block structure for FSMDs              |
| 36 | Data-transfer via queue with Double Handshake: FSMDs                                  |
| 37 | Data-transfer via queue with Double Handshake: Refinement to FSMs and Datapaths       |
| 38 | Datapath of buffered queue                                                            |
| 39 | Data-transfer via queue with Double Handshake: FSMs                                   |
| 40 | Data-transfer via memory with Double Handshake: Structure                             |
| 41 | Data-transfer via memory with Double Handshake: FSMs                                  |
| 42 | Statemachines for Single Handshake via Queue (FSMD)                                   |
| 43 | Statemachines for Single Handshake via Queue (FSM Controlling Datapath)               |
| 44 | Statemachines for Doublehandshake via Queue with simultaneous Read/Write              |
| 45 | Statemachines with Extended Timing Properties for Doublehandshake via Memory (FSM) 50 |

### Modeling Custom Hardware in VHDL

#### Heiko Lehr

#### Abstract

This report focuses on models for describing Hardware at different refinement levels within High Level Synthesis flow: SFSMD, FSMD and FSM Controlling Datapath. Simple data interfaces are often needed in these models. The hardware description language VHDL is used and tested for implementing these models and interfaces. Problems inherent in the models as well as problems caused by VHDL are discussed. A model related main difficulty is the one-cycle delay of data processing, when introducing a datapath. VHDL descriptions of abstract models (SFSMD) can become complicated. Templates for general problems, examples and VHDL guidelines are provided in this report to help to minimize design errors significantly.

### 1 The meaning of the models

In order to discuss High Level Synthesis, different specification models are important.



Figure 1: Models for HL-Synthesis

Figure 1 shows the refinement flow of the models. The models used for high level synthesis are connected by the solid arrows. The FSMD to FSM flow is not useful (indicated by a dotted arrow). The horizontal arrow indicates, that the connected models are on the same refinement level. They are related strongly, but their ideas and possibilities for usage are differently.

Theoretically, each model could be described behaviorally and structurally. However depending on the model, such a description may be impractical or even meaningless because of an exploding size or complexity of the description.

Figure 2 shows the system exploration chart (Y-Chart). It consists of refinement levels (circles) and description domains (lines). The significant description models are marked in the chart.

The following sections explain these models. Additionally, description models of little or no use are mentioned briefly.



Figure 2: System Exploration Chart

#### 1.1 SFSMD

The Super Finite State Machine with Datapath (SF-SMD) is the most abstract level of description.

The whole algorithm can be considered to be a SF-SMD with one superstate. But generally, the algorithm can be divided into any number of parts of any size. These parts are the superstates.

The algorithm parts consist of reading inputs (I), writing outputs (O) and executing expressions (EXP), which use intermediate storage variables (V). Using these elements, SFSMD can be described syntactically in the same way as FSMD. (See FSMD below.) The

basic difference between the two models is, that SF-SMD does not restrict the size of the algorithm-parts assigned to a state, whereas FSMD does. This is, because SFSMD does not correspond to hardware at all, whereas the states of FSMD correspond to clock cycles.

#### 1.1.1 Behavioral Description

For behavioral description, the algorithm which is chosen to be implemented in hardware must be correct and executable on any platform, but no restrictions concerning a special structure apply. The whole algorithm could be assigned to one single big superstate (figure 3). But in most cases several superstates are recommended. When describing the algorithm in a high level programming language, it is typical to map the procedures to superstates.

Figure 4 shows an example with several superstates. Like in every state machine, in SFSMD a superstate can have several possible successors. The valid one is determined by transition conditions as shown in state  $Part\ \mathcal{J}$  of the figure. The left-hand side "cond" can be a variable, but it also could be a function.



Figure 3: One-State-SFSMD



Figure 4: SFSMD

#### 1.1.2 Structural Description

A structural description is not used for this model. Hardware allocation and scheduling are not done at this point (see section 2). Therefore a structural description just could be something like a general purpose computer which is capable of executing the appropriate program which is stored in some memory (Y-Chart, fig. 2: structural domain, outer circle). This is not helpful for implementing a specialized algorithm.

#### 1.2 FSMD

In the model Finite State Machine with Datapath (FSMD) scheduling is performed (see section 2). That means, algorithm is divided into small pieces which are assigned to cycles using cycle based states. FSMD-cycles will be mapped to the cycles of the hardware clock. However hardware cycle time is not finally set yet. There may also be a multiplication factor in this mapping (e.g. each FSMD cycle lasts two clock cycles).

In each state two things are done:

- The operation scheduled to this state is executed within one clock cycle.
- The next state for the next clock cycle is determined.

For an exact description of FSMD, some sets have to be defined:

```
S: set of states I: set of inputs
```

V: set of storage variables

O: set of outputs EXP: set of expressions:

functions which give results depending on storage variables V and operators OP:

 $EXP = \{V, OP\} = \{f(x, y, z, \ldots) | x, y, z \in V\}$ 

OP: set of operators used in EXP
STAT: set of status expressions:
logic relations between

two expressions from the set EXP:  $STAT = \{Rel(a,b)|a,b \in EXP\}$ 

Referring to the above definitions, the data processing (first item in above list) is described by function h:

$$h: S \times (I \cup STAT \cup EXP) \rightarrow (V \cup O); \qquad V0, S0$$

There needs to be a set of initializing values V0 for the variables V when starting the FSMD, because the expressions in EXP read them. There also has to be

an initial value S0 for the state.

The next state (item 2 in above list) is determined in a similar way:

$$f: S \times (I \cup STAT) \rightarrow S;$$
  $V0, S0$ 

V0 and S0 are needed again, because STAT depends on EXP, and EXP depends on V.

With these definitions, a FSMD is described by:

$$\langle S, S0, (I \cup STAT \cup OP), V, V0, O, h, f \rangle$$

#### 1.2.1 Behavioral Description

The FSMD Behavioral description is shown in figure 5. Behavioral description is preferred for this model, because of better readability. It clearly illustrates how the code pieces are scheduled in the states. It is located at the RTL Behavior point of the Y-Chart. The term "RTL" (Register Transfer Level) indicates the cycle accuracy of this description and the corresponding register transfers of the final hardware:

Each variable used in this description will be assigned to elements of the final hardware (register, memory, input, output, delay-wire). Several variables may be assigned to the same element as long as their lifetimes do not overlap. However, the assignment itself is not done in this description. It is just assumed that each variable will have a location somewhere in the final hardware.



Figure 5: FSMD Behavioral

#### 1.2.2 Structural Description

A structural description is not used for this model. The structural description would look like figure 6, except that control and datapath are not separated into distinct blocks.

The following model is more suitable for a structural description.

#### 1.3 FSM Controlling Datapath

Here, the hardware is described on the same refinement level as the previous model. In contrast to FSMD, there is splitting into a *control*-part, described by a *Finite State Machine (FSM)* without datapath, and a separate datapath now.

The control block controls the execution of the operations while the datapath actually performs them. That means, compared to FSMD, the computation of the expression STAT and EXP are done in the datapath. Storage variables V, inputs I and outputs O are also covered by the datapath.

For the control block, the set of states S remains. Additionally, there are connections between control block and datapath:

C: Control signals (output of control FSM)

M: Message signals (input of control FSM)

The functions for the datapath look similar to the FSMD model. The difference is, that message and control signals are used instead of the state:

$$h_M: C \times (I \cup STAT \cup EXP) \to (V \cup O); \qquad V0, C0$$
  
 $f_M: C \times (I \cup STAT) \to M; \qquad V0, C0$ 

$$< C, C0, (I \cup STAT \cup OP), V, V0, O, h_M, f_M >$$

The control block defines the relationships among message signals M, control signals C and states S:

$$h_C: S \times M \to C$$
  
 $f_C: S \times M \to S$   
 $\langle S, M, C, h_C, f_C \rangle$ 

Depending on the current state, the control block sets control signals for the datapath. These signals tell the datapath which data are to be processed and how they are to be processed. The datapath can inform the control block about special results via message signals. See figures 6 and 8. Depending on the message signals, the control block determines the next state which becomes valid in the next clock cycle.

The separation into these two blocks offers new possibilities: Each can be independently described behaviorally or structurally.

#### 1.3.1 Behavioral Description

A pure behavioral description of this model is not used. When trying to describe this, there would be a second state machine for the datapath. This would cause lot of trouble:

If NC is the number of control signals from the control block to the datapath, then  $T = 2^{NC}$  is the number of different conditions for state transitions, which is an "exploding" expression.

Conclusion: A behavioral description of a separate datapath is not suitable. The model FSMD is used instead.

#### 1.3.2 Behavioral-Structural Description

The big advantage of the split model is the possibility of still using a behavioral description for the control, while describing the datapath structurally. See figure 6. The first block illustrates the states in accordance with the Register Transfer Level while the second contains the data structure: memory, registers, ALU and so on, as well as their connections and data inputand output ports. For example, a multiplication is implemented by the ALU and the result is stored in its output register. The ALU consists of pure combinational logic. The command signals from the FSM control the datapath: ALU = "11" chooses multiplication and 1d\_0 = '1' stores the result in register O.



Figure 6: FSM Controlling Datapath (1)



Figure 7: Multi purpose datapath

This structural representation also illustrates the possible data flows. In figure 6, variables A and B can be multiplied by the ALU and the result can be stored in variable O which resides in register O.

Instead of the very specific datapath in figure 6, also a more general datapath can be used, like the one in figure 7. A general datapath often avoids the necessity of creating or changing the datapath when the control FSM is replaced or changed. On the other hand, a general datapath is less optimized and may contain unused overhead.

The huge amount of combinational logic is represented well in a datapath picture. For example, multiplier and comparator are purely combinational. That means, they do not use the clock and they do not contain any latches.

This pure structural description grows just linearly when inserting additional components. The size of a behavioral datapath description would have exploded. There would also be complicated transition conditions due to the huge amount of combinational logic.

In the control block, the situation is different: There are no "control data" which are modified and then passed to other operations. There is exactly one variable and it has special meaning: the state variable. It needs to be written and to be read. As its value is the basis for cycle accurate control, it has to be defined by a FSM in each clock cycle. For pointing out the state transitions and the state based interaction with the datapath, the behavioral description on the left-hand side in figure 6 is the best representation.

It is to say, that the split model with the control part described behaviorally and the datapath described structurally is more difficult to read by humans than a behavioral description of the model FSMD. However, this approach to the final hardware has to be taken. Physical hardware finally is obtained by implementing a structural description. Therefore we finally need a pure structural description. The datapath, which is by far the largest part of the design, is structural already. The behavioral control description is simple and can be converted to a structural description by a simple tool.

Conclusion: The split model with behavioral control and structural datapath is a good basis for synthesis tools which have some intelligence about how to convert behavioral description into structure. The model is still readable by humans.

#### 1.3.3 Structural Description

Synthesis tools with no intelligence may need a pure structural description. In figure 8, even the control part is structural. The control block shown here, implements the control FSM, too. However it reveals neither the states nor the associated actions any longer but hides them within the logic blocks. Even looking at the internal description of the logic blocks, pure structure is quite less readable.



Figure 8: FSM Controlling Datapath (2)

Conclusion: A pure structural description should be used only if the synthesis tool or the input language does not permit a behavioral description of the

control block.

#### 1.4 FSM

A Finite State Machine (FSM) without Datapath for describing the whole design (and not just the control) causes an explosion of the number of states:

Using no datapath means that for all possible data a separate state is needed. For example, a 16-bit integer variable causes 65536 different states. This has to be done for all other variables, too. As there is no datapath, a formal description of this model is simple. There are no storage variables and expressions using them. The only value which is stored is the state.

The functions for the next state and output are simplified to:

$$h: S \times I \to O$$
$$f: S \times I \to S$$

The FSM defining tuple is:

It is obvious, that the model FSM can only be used for problems which are dominated by boolean variables. They should not use variables which are larger than about 3 or 4 bits. Thus, this model is only suited for control algorithms, which are a special case. The FSM within the model "FSM Controlling Datapath" is an example.

However, for the general case, which includes data processing, this model can not be used reasonably.

Note: Synthesis tools may use this model for a hidden, automatically generated intermediate format in order to further refine the synthesis flow. But this is only necessary if low-level IPs (Intellectual Properties) like adder, multiplier and so on can not be provided as plug-ins for the datapath (see previous model).

#### 1.5 Conclusion

The above discussion points out the following models and descriptions. They are the best in most cases for performing High Level Synthesis with machine-human interaction.

#### 1. SFSMD Behavioral

(a) One-state SFSMD (general mathematical description or program)

- (b) Multi-state SFSMD (problem separation, structured program)
- 2. FSMD Behavioral
- 3. FSM Behavioral Controlling Datapath Structural
- 4. (FSM Structural Controlling Datapath Structural, if needed)

## 2 Scheduling, Allocation and Binding

The assignment of code slices to states is called *scheduling*. A slice of code from a FSMD Behavioral description should be as small to be executable within one cycle.

Exceptions can be made for some complex computations which cannot meet this constraint. They can also use multiple states.

Parallel execution of several incremental code pieces within one clock cycle can reduce the number of needed cycles and thus speed up the design. However there are two problems: One problem are the data dependencies. The simplest example is the impossibility of computing two values in parallel, if one is needed as an input for computing the other one. Another constraint is, that the datapath should not get too large exceeding restrictions in size or cost. For example, parallelizing two multiplications causes two multipliers in the datapath which significantly increases complexity.

So, when scheduling is performed for the behavioral description of the FSMD model, the *allocation* of data and actions partially should be planed already. It is accomplished in the structural description of the datapath however. The control block for the datapath points out the real requirements in a much more obvious way. The datapath has to be equipped with appropriate registers, memory and so on. This is the real execution of allocation.

Thereafter, binding has to be performed. The control block is adjusted to assign data to the right registers, correct memory addresses and other components.

### 3 VHDL Programming Structure

Generally, the "programming" language VHDL can be used for all the models and descriptions: Behavioral description of SFSMD, FSMD and for the description of a behavioral FSM controlling a structural datapath.

This section is a summary about the structure of VHDL programs. Those, who know VHDL well, may skip this section and proceed with section 4. For others, it helps to remember the structure of VHDL programs and to understand the following sections and examples better. However, this section does not teach VHDL.

Figure 9 shows a simple general template for the structure of VHDL-code from which the description of the models will be derived. Some optional parts are omitted, because they are seldom used and not needed here

VHDL is block-oriented and in that way uses hierarchy. The hierarchy structure usually looks like the one shown in figure 7: A testbench forms the top level. It instantiates the design block which may have several subblocks. Subblocks can have subblocks themselves.

Except the testbench, the other blocks need to have data ports. The ports of the top level design block implement the inputs and outputs of the algorithm. The are independent of the currently used model and description. Therefore VHDL separates them: The entity covers the port declarations whereas the architecture implements the computation part.

The figure shows that the architecture consists of a declaration part and an implementation part. The latter one contains processes, subblock-instantiations and concurrent statements. They are executed parallelly, indicated by grayed boxes in the figure 9. Simple concurrent statements usually are a simple logic block. Processes behave like infinite loops of code. They temporarily have to be suspended before each run of the loop. This can be done by wait commands or by a so-called sensitivity list. Figure 9 uses sensitivity lists in the process heads, causing the process to wait on a change of the listed signals.

Subblock-components, functions, procedures and additional internal signals have to be declared in the declaration part before they can be used in the implementation part.

A process can also use its own local variables, declared inside the process which uses them (schematically indicated by "Variable\_Declaration").

This causes the question whether to declare a global signal or a local variable for holding a value.



Figure 9: General VHDL program structure

- If the value is needed outside the process, it can only be held by a signal.
- Sequential computations within one cycle need variables. Sequential computations means: A process writes a value. Thereafter, the same process reads this updated value in the same cycle.
   Signals own the property of having a delta-delay, which delays the update until the next cycle starts. Variables do not have this property and

therefore can be used in this case.

#### • Otherwise:

Declare a signal, if a physical equivalent (physical signal or wire) in the final hardware is expected.

Declare a variable, if the description is too abstract to have an obvious correspondence to hardware (variables in SFSMD).

An exception to the rule of leaving the entity unchanged is needed: When switching the models for the algorithm to deeper refinement levels, some synchronization signals like *start*, *done* and *clk* (clock) need to be added in the entity. They are inevitable ports of the final, hardware but they are not *data* ports.

In the figure, the words "Design\_Input" and "Design\_Output" are abbreviations. They stand for the range of all inputs and outputs of the design.

# 4 Describing the Models in VHDL

Section 1 gave a view of the meaning of the hardware description models. The following sections explain how to model them in VHDL and discuss some significant problems.

#### 4.1 Abstract and Introduction

#### 4.1.1 The State Machines Graphically

The graphical representation of the state machines in figures 10 and 11 is according to the following rules: The actions are attached to the states they are assigned or scheduled to. The kind and size of the actions is dependent on the model. The new data results caused by the actions are not available before the next state transition.

Among the arrows starting at the current state there must be exactly one with a true condition. Thisone determines the next state. All possibilities have to be covered by the conditions and ambiguity is not allowed.

Whereas state transitions are bound to conditions, actions are bound to states only but not to conditions. Such state machines are called *Moore Machines*.

#### 4.1.2 Translation into VHDL

The right-hand side of figures 10 and 11 show possible VHDL descriptions, which implement the graphical representation of the right-hand side. Figure 10 shows how to implement a state machine inside a procedure. Figure 11 describes the implementation inside a process. Both versions use the VHDL-case statement for modeling the state machine.

In the figures, the typewriter font is used for direct VHDL code. Proportional font is used for parts which can either be a procedure (resp. function) or an abbreviation for a piece of code, that is:

#### Actions\_i:

Parts of the algorithm assigned to the state i. Actions\_i can be a piece of code or a procedure. It reads the signals/ variables represented by IN\_i and writes the variables represented by OUT\_i.

#### Cond\_i\_j:

Condition for the state transition from state i to j. Cond\_i\_j may be either a function or a piece of code. It reads variables and signals from IN\_i\_j. The result value of the condition is represented by its name itself.

#### Next\_State\_i:

Sets the value of **state** after state i. Next\_State\_i is an expression or a procedure. In the used syntax, it has the following parameters: inputs: list of conditions, list of possible next states; output: next state (STATE).

**Procedure versus Process.** The "procedure version" should be used for models at a high level of abstraction (SFSMD) and for models which do not use timing (SFSMD, FSMD Without Time).

Procedures are well-suited for sequential programming style with preferred use of variables instead of signals. This especially meets the requirements of SFSMD. Additionally, SFSMDs can be simply concatenated by calling one procedure after the other. This means, no allusions to a hardware protocol are needed.

A process on the other hand does not have these advantages. Additionally, a process is an endless loop which is developed for use with a clock.



Figure 10: A state machine inside a procedure

```
behavior1: Process(Clock)
                            Variable Declarations;
                            begin
                              if (Clock'event and Clock='1') then
                                if (Reset = '1') then
                                 Initialization( OUT_1 );
                                 next_STATE <= S_BEGIN;</pre>
                                else
                                 case next_STATE is
                                    when S_BEGIN => Done <= '0';
Initialization S_BEGIN Start = '0'
                                                    Initialization( OUT_1 );
                                                    if Start = '1'
                                                      then next_STATE<= S_1;
                                                      else next_STATE<= S_BEGIN;</pre>
     Start = '1'
                                                    end if;
                                    when S_1 =>
                                                  Actions_1( IN_1, OUT_1 );
   Actions_1
                                                  Next_State_1( Cond_1_2, Cond_1_8,
                                                            S_2,
                                                                    S_{8}
     Cond_1_2
                Cond 1 8
                                                            next_STATE );
                                    when S_2 =>
   Actions 2
                                    when S_END => Done <= '1';
           S END Start = '1'
 Done:='1'
                                                  if Start = '0'
                                                   then next_STATE<= S_BEGIN;
 to S_BEGIN
                                                   else next_STATE<= S_END;</pre>
                                                  end if;
                                  end case;
 Start = '0'
                                end if;
                              end if;
                              end process;
```

Figure 11: A state machine inside a process



Figure 12: A state machine for controlling a separate datapath

The "process version" is preferable if the model uses a clock (FSMD With Clock, FSM controlling datapath).

Although a procedure could be used with wait statements, this is not recommended. Such a bad style of hidden and scattered wait statements could confuse humans and synthesis machines. According to that, the even worse way of using wait statements inside a function is forbidden by the language already. Thus processes are better for use with a clocked description. The use of a sensitivity list in the process heads expose the wait conditions very well. In figure 11 the list has one entry: Clock.

Special Case: State Machine and Separate Datapath. Figure 12 shows some changes to figure 11: The activation of the next state is done by a newly introduced process. The second process, which contains the state machine, is made sensitive to the state and all input signals (process(STATE,Start,DP\_msg,IN\_1)). This modification is needed for a model which uses a separate datapath. It compensates delays which are caused by the datapath: In contrast to a single state machine with "build-in" datapath, a separate control-FSM just gives commands and the actual computations are ex-

ecuted by the datapath one cycle delayed.

The modification in figure 12, however, compensates this: The clock has been removed from the state machine. Instead, datapath feedback messages now appear in the sensitivity list of the state machine's process. Thus the state machine can react to changes on these signals without waiting for the next clock cycle. In this way the delay is "compensated". See section 4.2.6 for more.

#### 4.2 Examples and Guidelines

This section shows how to use and code the models with VHDL with the help of examples. Examples are much clearer than the abstract descriptions of the previous section. The examples can be used like templates. Figures 13 to 17 show the descriptions of the state machines in either a procedure or a process. Figures 20 to 23 show their embedding into a complete VHDL description. In this latter set of figures, the declaration of all variables and signals, needed for the examples, can be seen too.

Important rules are generalized and described. Executable files of the examples are available and listed in the appendix.

#### 4.2.1 SFSMD Behavioral

Figure 13 shows a very simple SFSMD: The superstates S\_1 and S\_2 perform expensive power operations each. In superstate S\_3, the results are subtracted.

This is just one way of describing the algorithm. The abstract model SFSMD provides a lot of freedom:

- The three superstates in the example may be merged into a single one.
- Actions and conditions (conditions are not used here, but see figure 10) can be of any complexity.
- In this example even the loop and the case statement may be omitted, because there is just a sequential computation:

```
begin
-- State S_1:
-- S:= In1 ** In2;
                     -- (power)
S := 1;
for i in In2 downto 1 loop
   S := S * In1;
end loop;
-- State S_2:
-- R:= In3 ** In4;
                      -- (power)
R := 1:
for i in In4 downto 1 loop
   R := R * In3;
end loop;
-- State S_3:
-- Output
Out1:=S-R;
end behavior1;
```

Rules. Almost all constructs of the programming language are allowed for use in the description of the SFSMD model, but the following rules have to be obeyed when modeling an SFSMD in VHDL:

• All inputs and outputs have to be declared clearly.

In VHDL, do not introduce "unofficial" data ports by reading from or writing to a file within a procedure. All inputs and outputs should be declared in the entity.

• Data ports have to be read and set accurately.

Other ports (control, timing) should be avoided.

An abstract description never means, that abstract results are allowed. To be a valid model, it

must occupy the data ports completely in accordance to the specifications, except that time is not handled. That means outputs are set without delay.

Synchronization ports and other timing related ports (like the clock input) are implementation specific. They are omitted in this model because this one is abstract and non-timing accurate.

In this model parallelism and timing are not used.

This means, in VHDL use only one process and prefer to use variables instead of signals.

Use of several parallel processes would mean switching to the model CHSFSMD (Concurrent Hierarchic SFSMD). In that model hierarchically higher processes start other processes parallelly. But this is not discussed here.

Signals should not be used because signal assignments used without appropriate timing delays are not executed in the desired sequential order (see section 3). It is difficult and dangerous to use signals in this model, because a large amount of sequential code is assigned to each superstate. The state is a variable, too.

#### 4.2.2 FSMD Behavioral

There are two variants of description for the model FSMD: FSMD Behavioral Without Time (figure 14) and FSMD Behavioral with clock (figure 15).

Timing and synchronization originally do not belong to this model, but introducing clock at this level often is an advantage when using VHDL. (See section "Discussion" below.)

The VHDL-implementations of both behavioral variants look similar: In this example, figures 14 and 15 implement superstate S\_1 of the SFSMD of figure 13. An FSMD also may implement several superstates.

**Steps.** The steps to convert a SFSMD into a FSMD are the following:

• Divide the SFSMD into slices which are meant to become implemented by a FSMD each.

A FSMD can be the implementation of just one superstate up to the whole SFSMD. After testing and/or refining into the model "FSM Controlling Datapath", the state machines can be joined together again by concatenating the

states (and merging the datapaths). Alternatively the state machines can remain separate when adding communication (see section 5).

• Choose way of description: FSMD Without Time or FSMD With Clock.

Usually the description with clock is preferable, when modeling in VHDL (see section "Discussion" below).

According to section 4.1.2, FSMD Without Time should be implemented in a procedure (figure 14). FSMD With Clock has to be implemented in a process (figure 15).

Version without time in a procedure:

- There doe not need to be a "when"statement for the state S\_BEGIN in this model.
  - The reason is the sequential programming style: The state machine is alive only for the time it is needed, else it does not exist. If the code of the state machine is not executed, this means that it rests in the state S\_BEGIN.
- Inside the procedure only variables are used (assignment operator ":=").
  (However, signals become inevitable in communicating state machines.)
- The simple data types are adopted from the SFSMD-model.

The example uses "integer" for instance.

Version using clock in a process:

- The states S\_BEGIN and S\_END as well as "start"- and "done"-signals are needed for synchronization.
  - In a process, a state machine is always alive. The state machine waits in state S\_BEGIN for its "start"-signal. When the machine finishes, it reaches the state S\_END and sets a "done" signal.
- In the process, signals are used generally.
   (Variables may be used exceptionally, if they are not expected to have a corresponding wire or register in the final hardware, e.g. variables which model delay.)
- The hardware related data type std\_logic\_vector is used in the example.

This seems likely because this description

uses clock which is close to hardware, too. Both aspects provide the possibility of easy translation into the model "FSM Controlling Datapath".

#### • Schedule the operations:

A superstate is broken into several states for FSMD. Each state of a FSMD lasts exactly one cycle time. A state of a FSMD can contain as many computations as the data dependencies allow. However a lot of computations in a state cause the datapath to become large (see section scheduling).

In the example, the superstate S\_1 in figure 13 is scheduled to the states S\_1 to S\_3 in figures 14 and 15. The "power"-operation of the superstate cannot be executed in one cycle. Hardware usually performs this operation by running a loop of multiplications. So does this FSMD example.

# 4.2.3 Discussion: FSMD Behavioral without time or with clock

The constraint of using variables for behavior input and output is not nice. Behavior input and outputs as well as many other value-holders<sup>1</sup> can be expected to have a physical equivalence (wire) in the final hardware. Therefore the use of signals would be more accurate.

A more unpleasant circumstance is the fact, that in a description without clock, a VHDL Simulator determines the execution time equal to zero. Therefore the possibilities for debugging the code are limited to the old fashioned way of running the code step by step. When simulating hardware, displaying the data and the clock cycles over the time may be desired. That is not possible, because there is no time. This disadvantage can be removed by using FSMD Behavioral with clock. In this description a clock accurate simulation shows much better how the design will work. (Example: The "Waveform Viewer" provided by "Synopsys" can be employed now.) Debugging becomes easier. Additionally, the designer already sees, where to pay attention to possible timing problems.

Using the appropriate subset of VHDL, the model FSMD Behavioral with clock can be even directly synthesizable already (with bad results though, compared to "FSM Controlling Datapath").

A theoretical disadvantage of this model is the fact, that hardware details like clock normally belong to a

 $<sup>^1\,</sup>Value\text{-}h\,older$  is used as a generic term for both VHDL- signal and VHDL- variable .

structural description instead of to a behavioral one, because it is an implementation detail. However, this step has to be performed anyway and besides the advantages for simulation, this makes the step to the description "FSM Controlling Datapath" closer.

Finally, such allusions to hardware implementations are just *suggestions* and *not definite*. It still has to be regarded as being a *behavioral* description. The used clock time may change or become a multiple of the real system clock.

It also is possible to use both behavioral models, first the one without clock and then refine it to the one with clock.

#### 4.2.4 FSM Behavioral Controlling Datapath Structural

After the FSMD Behavioral description with clock is finished, the step to a structural datapath controlled by a FSM is easily described, but tedious to perform.

- A VHDL datapath file has to be created, which binds the library components<sup>2</sup> and connects the entity signals (data in- and outputs, control inputs and message outputs). See figure 16.
- All computations and data assignments are removed from the state machines, including computations within conditions. Instead, the appropriate control signals, which cause the data processing in the datapath, are set. See example in figure 17.
- The state machine is notified of the result of conditions by message signals, only few bits wide ("CMP" in figure 16).
- A separate process for setting the next state has to be introduced.
  - The remaining process for the FSM is not clock-triggered any longer, but sensitive to all inputs of the FSM. (See sensitivity list in the process's head.) Thus this process can react to results of the datapath *before* the next clock edge arrives and "compensate" the delay of the datapath (see section 4.2.6 below).
- A datapath structural description has to be got. This is a deep refinement level. All wires and hardware blocks are specified. Therefore only signals have to be used and the use of variables is forbidden here. Components of the datapath are instantiated as shown in figure 16 by an example.

#### 4.2.5 FSM Structural Controlling Datapath Structural

In most cases, the step from behavioral FSM to structural FSM does not need to be supervised by humans. Figure 18 shows the conversion into a structural description. Each bit of the output of the FSM is a boolean function of all bits.

The example shows the unoptimized coding. The size of the boolean functions should be minimized before synthesis.

#### 4.2.6 Compensation of Datapath Delay

An FSMD does computation directly. For example, in figure 14, the state S\_2 contains Mult\_temp := 0 \* A. In contrast to that, a control-FSM just gives commands.

The actual computations are executed by the datapath one clock cycle delayed. As a result, a FSMD could not simply be translated into "FSM Controlling Datapath". The solution is a modification in figure 12 which is shown in figure 17. As soon as the datapath gives its result, the FSM is employed again within the same clock cycle because of the feedback messages appearing in the sensitivity list. In figure 17 there is just one feedback signal: CMP.

Thus the datapath delay is "compensated" and computation results can be used one cycle after they were ordered. The behavior of "FSM Controlling Datapath" is back to the expected behavior of a state machine. It is equivalent to the behavior of the FSMD, from which it is derived and in accordance with the FSM architecture on the left-hand side of figure 8 in section 1.3.3.

Note on "compensation":

When talking about "compensation" here, it does not mean a real removal of the delay. Delays and actions are just moved closer together. This causes a decrease of the allowed clock period time!

Warnings:

• Omitting the separation of next-state process and state-machine process means, the delay through the datapath will not be compensated. This is equivalent to introducing additional output latches as shown in figure 19. Doing this is possible, but discouraged. It means a change of common architecture agreements and the behavior is not equivalent to the FSMD-model.

<sup>&</sup>lt;sup>2</sup>from user libraries and /or propriatary libraries

• Introducing a separate next-state process in the FSMD-model itself is strictly forbidden! Trying to compensate non-existent delays means removing necessary latches. Unlatched, indeterministic feedback loops may occur. Also, such a description can never be translated into "FSM Controlling Datapath" any more.



Figure 13: Example: SFSMD Behavioral in VHDL



Figure 14: Example: FSMD Behavioral Without Time in VHDL

```
behavior1: Process(Clock)
                               begin
                                if (Clock'event and Clock='1') then
                                  if (Reset = '1') then
                                   0 <= (others=>'0');
                                   next_STATE <= S_BEGIN;</pre>
                                  else
                                    case next_STATE is
                                      when S_BEGIN => Done <= '0';
            S_BEGIN Start = '0'
  Done<='0'
                                                      if Start = '1'
                                                        then next_STATE<= S_1;
                                                        else next_STATE<= S_BEGIN;</pre>
      Start = '1'
                                                      end if;
     A \leq In1;
                                      when S_1 =>
                                                      A <= In1;
     B \leq In2;
                                                      B <= In2;
      O \le 1;
                                                      0 <= conv_std_logic_vector(1, 32);</pre>
                                                      next_STATE<= S_2;
                                                   _____.
Mult_temp<=
                                      when S_2 =>
                                                      Mult_temp <= O(15 downto 0) * A;</pre>
        O*R;
                                                      B <= B - conv_std_logic_vector(1, 1</pre>
     B \leq In2;
                                                      next_STATE<= S_3;</pre>
                     B > 0
                                      when S_3 =>
                                                      O <= Mult_temp;</pre>
O<=Mult_temp;
                                                      if (B > conv_std_logic_vector(0, 16
                                                        then next_STATE<= S_2;
         B \le 0
                                                        else next_STATE<= S_END;</pre>
                                                      end if;
                                                      Done <= '1';
                                      when S_END =>
                   Start = '1'
  Done<='1';
             S END
                                                      if Start = '0'
  to S_BEGIN
                                                        then next_STATE<= S_BEGIN;
                                                        else next_STATE<= S_END;</pre>
                                                      end if;
                                   end case;
                                 end if;
   Start = '0'
                                end if;
                               end process;
```

Figure 15: Example: FSMD Behavioral With Clock in VHDL



Figure 16: Example: Datapath of "FSM Behavioral Controlling Datapath Structural"



Figure 17: Example: FSM of "FSM Behavioral Controlling Datapath Structural"

```
Clock
Reset
Enable
State Register
STATE
```

#### State\_Register: Process(Clock, Enable)

```
begin
  if (Clock'event and Clock = '1') then
   if (Reset = '1') then STATE <= 000;
  elsif (Enable = '1') then STATE <= next_STATE;
  end if;
  end if;
end Process;</pre>
```

|       | DP-Control and Status Output Logic                                 |               |  |  |  |
|-------|--------------------------------------------------------------------|---------------|--|--|--|
| Input | DP-Control Output                                                  | Status Output |  |  |  |
| STATE | DP_Reset<br>1d_A<br>1d_B<br>Count_M<br>Count_En<br>MUX_sel<br>1d_0 | Done          |  |  |  |
| 000   | 1 0 0 0 0                                                          | 0             |  |  |  |
| 000   | 1 0 0 - 0 - 0                                                      | 0             |  |  |  |
| 001   | 0 1 1 - 0 0 1                                                      | 0             |  |  |  |
| 010   | 0 0 0 1 1 1 0                                                      | 0             |  |  |  |
| 011   | 0 0 0 - 0 1 1                                                      | 0             |  |  |  |
| 111   | 0 0 0 - 0 - 0                                                      | 1             |  |  |  |
| rest  | don't care                                                         | don't care    |  |  |  |

#### -- Datapath-Control and Status Output Logic

```
-- Datapath Control Output
DP_Reset <=
              ( not STATE(2) and not STATE(1) and not STATE(0) );
ld_A
         <=
               ( not STATE(2) and not STATE(1) and
                                                      STATE(0));
ld_B
         <=
               ( not STATE(2) and not STATE(1) and
                                                      STATE(0));
Count_M
         <=
               ( not STATE(2) and
                                 STATE(1) and not STATE(0) );
Count_En <=
               ( not STATE(2) and
                                     STATE(1) and not STATE(0) );
MUX_sel
         <=
               ( not STATE(2) and
                                     {\tt STATE(1)} and not {\tt STATE(0)} )
          OR ( not STATE(2) and
                                     STATE(1) and
                                                      STATE(0));
ld 0
              ( not STATE(2) and not STATE(1) and
                                                      STATE(0))
           OR ( not STATE(2) and
                                    STATE(1) and
                                                      STATE(0));
-- Controller Status Output
                   STATE(2) and
                                    STATE(1) and
                                                      STATE(0));
Done
         <= (
```

| Input                           |                      |                       | Next-State Logic Output         |
|---------------------------------|----------------------|-----------------------|---------------------------------|
| STATE                           | CMP                  | Start                 | next_STATE                      |
| 000<br>000<br>001<br>010<br>011 | <br><br><br>-1<br>-0 | 0<br>1<br>-<br>-<br>- | 000<br>001<br>010<br>011<br>010 |
| 111<br>rest                     |                      | -                     | don't care                      |

```
-- Next-State Logic
next_STATE(2) <= ( not STATE(2) and STATE(1) and next_STATE(1) <= ( not STATE(2) and not STATE(1) and
                                                                 STATE(0) and not CMP(0) );
                                                                 STATE(0))
                  OR ( not STATE(2) and
                                              STATE(1) and not STATE(0) )
                  OR ( not STATE(2) and
                                              STATE(1) and
                                                                 STATE(0) and
                  OR ( not STATE(2) and
                                              STATE(1) and
                                                                 STATE(0) and not CMP(0));
next_STATE(0) <=
                   ( not STATE(2) and not STATE(1) and not STATE(0) and START )
                  OR ( not STATE(2) and
                                              STATE(1) and not STATE(0)
                  OR ( not STATE(2) and
                                              STATE(1) and
                                                                 STATE(0) and not CMP(0));
```

Figure 18: Example: FSM of "FSM Structural Controlling Datapath Structural"

a) Changes in FSM Behavioral Description when Using just One Process (Outputs Latched)



b) Equvalent Changes in FSM Structural Description / Changes of the former Architecture (Outputs Latched)

```
Next State Register: Process(Clock, Enable)
begin
  if (Clock'event and Clock='1') then
    if (Reset='1') then next_STATE <= "000";
elsif (Enable='1') then next_STATE <= next_STATE_logic;</pre>
     end if;
end Process;
Output_Registers: Process(Clock, Enable)
begin
                                                                                                Control Inputs (Start)
  if (Clock'event and Clock='1') then
            (Reset='1') then
          DP_Reset <= '1';
                                                                                          Next-State
          ld_A<='0';
                                                                                             Logic
          ld_B<='0';
                                                                           Clock
          Count_M<='0';
                                                                           Reset
                                                                                                                Data-
                                                                                      Next-State Reg
          Count_En<='0';
                                                                           Enable
                                                                                                                path
          MUX sel<='0';
                                                                                                                Control
          ld 0<='0';
                                                                                           Datapath
          Done <= '0';
                                                                                           Control
     else
                                                                                             and
          DP_Reset <= DP_Reset_Logic;</pre>
                                                                                        Output Logic
          ld_A <= ld_A_Logic;
          ld_B <= ld_B_logic;</pre>
                                                                                        Output Reg's
          Count_M <= Count_M_Logic;
          Count_En <= Count_En_Logic;
          MUX_sel <= MUX_sel_Logic;
                                                                                              Control Outputs (Done)
          ld_0 <= ld_0_Logic;</pre>
          Done <= Done_Logic;
     end if;
  end if;
end Process;
-- Datapath-Control and Status Output Logic
DP_Reset_Logic <= ( not STATE(2) and not STATE(1) and not STATE(0) );</pre>
ld_A_Logic <= ...
-- Next State Logic
\texttt{next\_STATE\_Logic(2)} \  \, <= \  \, (\texttt{not} \  \, \texttt{STATE(2)} \  \, \texttt{and} \  \, \texttt{STATE(1)} \  \, \texttt{and} \  \, \texttt{STATE(0)} \  \, \texttt{and} \  \, \texttt{not} \  \, \texttt{CMP(0)} \  \, );
next STATE Logic(1) <= ...
```

Figure 19: Warning: Omitting separation of next-state-process causes additional output latches

#### Design: SFSMD Example

```
-- no library needed
 Entity SFSMD_ex is
    port (Reset, Start:
                                in bit;
           In1, In2, In3, In4:in integer;
           Out1:
                                out integer );
 end SFSMD ex;
Architecture SFSMD_ex_behavioral
                            of SFSMD_ex is
 -- declarations for architecture
 -- no add. signal declarations needed
 Procedure behavior1
     (In1, In2, In3, In4: in integer;
     signal Out1: out integer)
 type State_Set is (S_1, S_2, S_3, S_END);
variable next_STATE: State_Set;
variable R, S: integer;
begin
 -- implementation of architecture
 P1: process
end SFSMD_ex_behavioral;
```

Figure 20: Embedding the example SFSMD (fig. 13) in VHDL

#### Design: FSMD "No Time" Example

```
- no library needed
 Entity FSMD_no_time_ex is
    port (Reset, Start: in bit;
          In1, In2:
                          in natural;
                          out natural );
          O Port:
 end FSMD_no_time_ex;
 Architecture FSMD_no_time_ex_behavioral
                     of FSMD_no_time_ex is
 -- declarations for architecture
 -- behavior outputs
 signal O: natural;
                     _____
 Procedure behavior1
     (In1, In2: in natural;
     signal O: inout natural)
 type State_Set is (S_1, S_2, S_3, S_END);
 variable next_STATE: State_Set;
variable A, B: natural;
 variable Mult_temp: natural;
 begin
 -- implementation of architecture
 P1: process
-- Entity Outputs
O_Port <= 0;
end FSMD_no_time_ex_behavioral;
```

Figure 21: Embedding the example FSMD Without Time (fig. 14) in VHDL

#### Design: FSMD "with Clock" Example

```
library IEEE;
 use IEEE.std_logic_1164.all;
 use IEEE.std_logic_arith.all;
 Entity FSMD_clock_ex is
   port (Clock: in std_logic;
        Reset, Start: in std_logic;
        Done: out std_logic;
        In1, In2: in unsigned(15 downto 0);
        O_Port: out unsigned(31 downto 0));
 end FSMD_clock_ex;
 Architecture FSMD_clock_ex_behavioral
                     of FSMD_clock_ex is
 -- declarations for architecture
 signal O: unsigned(31 downto 0);
 signal Mult_temp: unsigned(31 downto 0);
 type State_Set is (S_BEGIN, S_1, S_2,
                    S_3, S_END);
 signal next_STATE: State_Set;
 signal A, B: unsigned(15 downto 0);
 -- implementation of architecture
begin
 behavior1: process
                       (Clock)
-- Entity Outputs
O_Port <= 0;
end FSMD_clock_ex_behavioral;
```

Figure 22: Embedding the example FSMD With Clock (fig. 15) in VHDL



Figure 23: Embedding the example "FSM Controlling Datapath" (fig. 16 and 17) in VHDL

# 5 Communication of State Machines

State machines may have to exchange data. There can be different reasons for having state machines which need to communicate:

- A large super state machine can be divided into smaller state machines in order to get smaller problems.
- Separate state machines can be designed, which solve common problems. They may be required for use in connection with other state machines like "plug-ins".
- A state machine may have to be divided, when it contains problems with different demands of the implementation platform. Each piece then can run on the platform which is best.

In all these cases, data exchange between the several state machines is necessary and an appropriate protocol needs to come along with it. In the following subsections, the simple method of handshaking is discussed, which is sufficient for many problems. Single handshake can be used, if the communicating state machines have the same cycle time. However double handshake is recommended, since it works with all odd cycle times and even if there is no assigned cycle time (SFSMD and FSMD Without Time).

A data sending machine may produce bursts of data with delays between the bursts. Also the receiving machine may consume the data irregularly. In these cases a buffer comes in handy, which can be a queue or a memory for instance. The design of state machines, exchanging data via a queue or memory, is shown in the later sections.

#### 5.1 Single Handshake

A prerequisite for single handshake is that each state machine is either a FSMD, which uses clock, or a FSM, both working with the same clock.

Figure 24 shows the usage of signals for single handshake between two state machines. Set-output and read actions are adapted to FSMD or FSM according to the table below the figure. If the sender (A) is a FSMD, the Output is set in state S\_c directly. If it is a FSM, the associated datapath sets the output one cycle delayed, in state S\_d. The signal A\_Ready\_B is set to '1', when the state machine has set the output or just sets it. This means, one cycle after A\_Ready\_B was set, the data can be read by the receiver (B). The

receiver gives the acknowledge signal B\_Ackn\_A when it reads the data.

Note, that the receiver may reach state S\_w earlier than data is send. Then it reads invalid data. But, the signal A\_Ready is '0' as long as the data is invalid. This causes the receiver to read the data again and again until it finally reads valid data.

If the receiver is an FSM, reading is delayed by the datapath, too. However, reading the data in state S\_x is early enough, since both state machines work synchronously with the same clock.

Figures 25 to 28 show the order of events and switching of states for all possibilities: Figures for FSMD-FSMD, FSM-FSM, FSM-FSMD and FSMD-FSM each contain diagrams for the following three cases: A is earlier than B, A and B reach the communication states at the same time, B is earlier than A (where A is sender and B is receiver).

Black arrows show how ready and acknowledge signals cause state transitions. A FSMD model cannot recognize the setting of a signal within the same cycle but only in the next cycle. This causes the large "scissors" in figure 25.

In the model "FSM Controlling Datapath", a side effect of compensating the datapath-delay is the early reaction to the input signals. The results are the smaller, narrow "scissors" in figure 26.

Figures 27 and 28 show the connection of FSM with FSMD, resulting in a mixture of the two types of "scissors".

In the FSMs, thin gray arrows indicate the wait period for setting and reading data by the datapath. The compensation of the datapath delay, stated in section 4.2.6, does not apply for synchronous data-transfer via the ports. The compensation just compensates a delayed reaction of the FSM within the same design.

Thick gray arrows point out, how many cycles after the actual setting of the data, it is read. This number must be greater than or equal to one, meaning the arrow must point downwards. A horizontal arrow is not allowed. If the thick arrows point downwards through several states, the output data has to be kept in these states.

The timing diagrams show the functionality of the construct in figure 24.

The appendix contains figures with statemachines transmitting data via a queue by single handshake. This is not discussed further, because the focus is set on the more robust and adaptive double handshake. For explanation about a queue, refer to section 5.3.2.



|                    | FSMD (with clock)     | FSM (of FSM+Datapath) |
|--------------------|-----------------------|-----------------------|
|                    | Set / Read Ports      | Load Registers        |
| Set Output Action  | Data_AB <=            | Ld_Data_AB <= '1'     |
| Read Output Action | n B_Data_A <= Data_AB | Ld_B_Data_A <= '1'    |

Figure 24: Single Handshake



Figure 25: Cases, when using Single Handshake with two FSMD



Figure 26: Cases, when using Single Handshake with two FSM



Figure 27: Cases, when using Single Handshake with a FSM transmitting to a FSMD



Figure 28: Cases, when using Single Handshake with a FSMD transmitting to a FSM

#### 5.2 Double Handshake

Double handshaking makes the data exchange much more robust and works with all kinds of cycle times.

See figure 30. First, similar to the simple handshake, the sender puts data on its output port and sets the ready signal (A\_Ready\_B='1'). The receiver reads the data and acknowledges it by B\_Ackn\_A='1'. Then a second handshake follows: The sender sets A\_Ready\_B to '0' and waits for a B\_Ackn\_A='0' from the receiver. This assures that none of the state machines can run ahead of the other one:

If the receiver would not wait in state S\_w for A\_Ready\_B to become '0' again, it would not be able to know when the sender has noticed that the data has been received. There is the danger of removing the received confirmation signal B\_Ackn\_A too early. If the sender would not wait in state S\_d for B\_Ackn\_A to be '0' again, it may start another data exchange in some other state and read the old B\_Ackn\_A='1'.

Figure 30 also assures, that the signal A\_Ready\_B='1' at earliest is set one cycle after the output signal has been set. Both signals cannot be set at the same time, because in an asynchronous communication, it is unknown which one would be recognized first by the receiver.

In the same way, the output signal must remain valid until the next receiver-cycle after the read operation. In figure 30, this is *not* done by moving B\_Ackn\_A <= '1' to a later cycle. Instead the output of A is held until B\_Ackn\_A = '0'. Doing so saves additional states.

The reaction delay of a datapath makes it necessary to move the "set output operation" within state machine A from state S\_b to state S\_a. State S\_a can be used as a "generalized set output state", which works for all models.

The delay of the datapath has to be considered in the receiving state machine, too. Here, a "generalized" state is impossible without introducing another state (figure 29). If no extra state is introduced, the output action has to be moved, depending on the used model (see figure 30):

SFSMD, FSMD: use state S\_w: B\_Data\_A<=Data\_AB; FSM: use state S\_v: ld\_B\_Data\_A<='1'.

Figures 31 and 32 demonstrate the timing problems which may occur, if output action and input action are not scheduled at the right state.

Figure 32b also shows that a FSMD determines its next state when entering a state. An FSM, however, updates its "next state"-value continuously during the the current cycle. Thus, communication between FSMs uses less clock cycles. Double handshake

does not guarantee a specific number of cycles. It just assures a safe exchange of data.



Figure 29: Insertion of a "General State" in the receiving state machine of figure 30



|                    | SFSMD / FSMD        | FSM (of FSM+D)     |
|--------------------|---------------------|--------------------|
|                    | Set / Read Ports    | Load Registers     |
| Set Output Action  | Data_AB <=          | Ld_Data_AB <= '1'  |
| Read Output Action | B_Data_A <= Data_AB | Ld_B_Data_A <= '1' |

Figure 30: Double Handshake

a) For the state machines being FSMs, assume the write command is given in state S\_b instead of state S\_a. The following problem may occur, if the reading machine runs much faster than the writing machine:



Output may not be valid when it is read again for the final time.

Figure 31: Timing problem when writing

a) For the state machines being FSMs, assume the read command is given in state S\_w instead of state S\_v. The following problem is caused, if the writing machine runs much faster than the reading machine:



b) For the state machines being FSMDs assume reading is done in state S\_v instead of S\_w. The following problem is caused, if the writing machine runs much faster than the reading machine again:



Figure 32: Timing problems when receiving

#### **Examples:** 5.3Data-Transfer Constructions in VHDL

This section presents complete graphical descriptions of the three most important and simple constructions for data-transfer. For all these examples executable VHDL code is provided in the appendix.

In all cases, double handshake is used.

• Direct data-transfer:

A construction of two state machines with datatransfer

- Sender (FSMD)
  - $\rightarrow$  Receiver (FSMD)
- Sender (FSM Controlling Datapath)
  - → Receiver (FSM Controlling Datapath)
- Data-transfer via queue:

Data-transfer from sender to queue and from queue to receiver

- Sender (FSMD)
  - $\rightarrow$  Queue (FSMD)
  - $\rightarrow$  Receiver (FSMD)
- Sender (FSM Controlling Datapath)
  - → Queue (FSM Controlling Datapath)
  - → Receiver (FSM Controlling Datapath)
- Data-transfer via memory:

An arbiter first grants sender to store data in a memory, then it grants the receiver to read the data.

- Sender (FSM Controlling Datapath)
  - $\rightarrow$  Memory || Arbiter (FSM) <sup>3</sup>
  - → Receiver (FSM Controlling Datapath)

There are no examples of FSMD and "FSM Controlling Datapath" being mixed, just because giving examples for all possibilities would mean too many repetitions. The models can be mixed easily. They just have to be exchanged in the examples. For example refer to the example of sender, queue and receiver, all being FSMDs (figure 35 some pages forward). The FSMD-sender simply can be removed and replaced by the sender which is described by "FSM Controlling Datapath". There are no changes in the connection wires (e.g compare figure 35 and figure 37). In the figures, which show the state machines, the differences in FSMD and FSM are mentioned for sender and receiver.

For each FSMD, there are two versions of VHDL files in the appendix: FSMD described inside a procedure (meant for use without timing) and FSMD in a process (meant for use with clock). Refer to sections 4.1.2 and 4.2.3 for the meaning of these two versions.

The model SFSMD is not provided in the examples for data exchange, because double handshake for SFSMD is identical to double handshake for the procedure-version of FSMD.

Direct data-transfer (first item) implements the basic form of double handshake as described in section

When transferring data via a queue, the described double handshake is modified on the receiver side. This is, because the queue controller is passive and does not give a write request or an "output data ready" signal to the receiver. Instead the receiver transmits a read request signal to the queue.

In the data-transfer via memory, double handshake is used in a different way. Double handshake does not supervise the transfer of each word of data any more. This is done by sender and receiver themselves now, when the memory access has been granted to them via double handshake by an arbiter. The arbiter talks to sender and receiver. There is no connection to the memory. Thus, the arbiter always is a simple FSM and there is no FSMD version of it.

The following conventions apply for correspondence between the figures and the VHDL code:

The figures show the logical hierarchy of the constructions, using block elements with solid thick borders. They are given an explanatory name, printed in big letters (e.g. in figure 33 "FSMD A"), except the outer block which covers everything. Each block element corresponds to a VHDL entity. Its name is given in small letters enclosed in parenthesis. Also some signal names are given in small letters. A name is attached to each wire in the figures. In VHDL, the same name is used in the entity-/architecture pair which corresponds to the block in the figure, which has the signal name printed in<sup>4</sup>. If a wire is labeled outside a block, it has the same or a similar name inside the block. In the state machine figures, the real VHDL signal

names are used, too.

Figure 40 uses thick wires. They represent busses.

<sup>&</sup>lt;sup>3</sup>The arbiter is arranged parallely to the memory. It is connected to sender and receiver but not to the memory.

<sup>&</sup>lt;sup>4</sup>Except sometimes the trailing word "...Port" is omitted in the signal names in the figures.

### 5.3.1 Direct Data-Transfer

The following two figures show the block diagrams of the implementation of simple direct data-transfer under cover of the double handshake as described in section 30.



Figure 33: Direct data-transfer with Double Handshake: Block structure for FSMDs

State Machines: See figure 30 for the state machines for direct data-transfer via simple handshake.



Figure 34: Direct data-transfer with Double Handshake: Refinement to FSMs and datapaths

#### 5.3.2 Data-Transfer via Queue

Figure 35 shows how the queue is inserted in the transmission path. Between FSMD A and the queue, the familiar double handshake is used unchanged<sup>5</sup>. The handshake between queue and receiver (B) is modified however. It also is a double handshake, but it does not start with a "data ready" signal from the queue. As the queue is passive, the receiver has to start with a read request. The state machines in figure 36 show more details: The read request signal B\_Rd\_Req\_QC='1' is set in receiver state S\_v. When the data is ready to be read, the queue gives the signal QC\_Ready\_B='1'. The receiver reads the data in state S\_w and acknowledges with B\_Rd\_Req\_QC='0', which causes the queue to release QC\_Ready\_B again.

It may appear to be dangerous, that the receiver gives the acknowledge signal (B\_Rd\_Req\_QC='0') at the same time it reads the data, which causes the queue to go to its idle state 0 again. Doing so in an asynchronous design looks dangerous, because the provider of the data may catch the acknowledge a delta time too early. This queue however, has an output buffer and will not discard the data. It keeps the data until the receiver reads another data word. It even keeps the data when the sender sends data to the queue in the meantime.

But: The contexts of the previous paragraph are not the reason for the introduction of an output buffer: Common FIFOs for queuing data, usually discard the read set of data with the next clock cycle. This does not meet the requirements in this asynchronous data exchange, because the receiver may be slower than the queue. Therefore the queue output has to be buffered (see also figure 38).

There needs to be a control, which avoids data writing when the queue is full and data reading when the queue is empty. An initial control is build into the queue control: When the queue is empty, the read cycle is not started and QC\_Ready\_B='1' will not be set. This avoids the receiver from reading data. It waits in state S\_v. The queue control does not start a write cycle when the queue is full and A\_Ready\_QC='1' will not be signaled to the sender. This simple management, provided by the queue controller, allows a safe functionality, even if sender and receiver do not recognize "full" and "empty" signals. This makes it possible to insert a queue between two state machines in a late state of development by doing just minimal changes.

However, the state machines may be large and able

to do other tasks instead of waiting in a state for data exchange. Then, they should ask for the signals "full" and "ready", which tell them the reason for their waiting. The signal "Q\_Full" in figures 35 and 37 would be connected to the sender A and the signal "Q\_Empty" would be connected to the receiver B.

Figure 37 shows the refinement into FSMs and datapaths.

In figure 39, the state machine for the queue is changed into a simple FSM for controlling a datapath. Figure 38 is the datapath for this refined model. It shows the buffering built around an original FIFO-queue. The original queue used here is taken from the book "Gajski: Principles of Digital Design" (see references)<sup>6</sup>.

Figure 38 does not specify the data width. The depth is assumed to be four. The associated VHDL files of the queue datapath are fully generic, which means, depth and width can be specified by parameters. Thus, these VHDL files also provide an interesting example on how to write generic designs at low structural level. When the datapath is instantiated for connection with the FSM however, the depth is chosen to be four and the data width is 32.

<sup>&</sup>lt;sup>5</sup>Except from renaming the signal names: The letter "B", which stood for the receiver B, is replaced by "Q" for "queue" or "QC" for "queue control", respectively.

<sup>&</sup>lt;sup>6</sup>The author of this book says, this design is meant for small queues. He recommends an implementation of large queues by usage of a memory and address pointers.



Figure 35: Data-transfer via queue with Double Handshake: Block structure for FSMDs



Figure 36: Data-transfer via queue with Double Handshake: FSMDs



Figure 37: Data-transfer via queue with Double Handshake: Refinement to FSMs and Datapaths



The inner queue is taken from the book "Principles of Digital Design" by D. Gajski. The picture is slightly modified without changes in functionality or general structure. The following explanation of the queue is taken from that book:

[Write operation is selected by Q\_Wr='1' and Q\_En='1', Read operation is selected by Q\_Wr='0' and Q\_En='1']

[The signals] Full and Empty ... [are] indicating that the queue is empty whenever the counter content is all 1's, and that the queue is full whenever the counter content is equal to 011. ..."

Figure 38: Datapath of buffered queue

<sup>&</sup>quot;... Whenever data is queued, the shift register shifts data to the right and the counter is incremented. On the other hand, whenever data is read out, the data at the bottom of the queue is selected by the selector and the counter is decremented. Notice that data is not really discarded at that time, but rather is invalidated by decrementing the counter..."

<sup>&</sup>quot;... During a read operation, the content of the shift register will not change and the counter will count down by 1. During the write operation, however, the shift register will shift one position to the right and the counter will count up by 1. The counter also controls selection of the proper data during the read operation. As a rule it would be set to 1111 during the initialization, so that its content will be zero when the first data is in the queue. This negative bias of 1 in counting is necessary to accommodate the selector control, which requires a value of 00, 01, 10, or 11 in order to select one of the shifter outputs....

[The signals Full and Funty. [are] indicating that the queue is empty whenever the counter content is all 1's and that the queue is full whenever the counter.



Figure 39: Data-transfer via queue with Double Handshake: FSMs

### 5.3.3 Data-Transfer via Memory

Two design possibilities come to mind:

One way of modeling a data-transfer via memory is to use a similar mechanism like for the queue design: Sender and receiver request write and read of each data word after selecting a memory address. This solution means using double handshake for each data word. As a result a data-transfer will have a lot of protocol overhead, when the sender sends several words of data continuously or when the receiver reads several words continuously.

Therefore a different solution is presented here: An arbiter grants either the sender or the receiver the permission to access the memory (figure 40). With the permission, the sender or receiver can access the memory directly without any additional protocol overhead. Sender and receiver are connected to the memory by busses (figure 40):

• Command bus: nWE, nOE

• Address bus: Addr

• Data bus: Data\_and\_Stop7

Outputs of the FSMs and datapaths, which are connected to a bus, are tristate outputs. When a state machine does not access the memory, the tristate driver sets the port to "high-impedance" (which is represented by 'Z''s in VHDL). As a result, a non-active state machine behaves indifferently on the bus and an active one can set the bus values. The chip select signal nCS (negative logic) is handled differently. It is driven by an AND-gate instead of being connected to a bus. As soon as A\_nCS or B\_nCS becomes '0', chip select nCS will become '0' (= active), too. If A\_nCS and B\_nCS both are '1', then nCS will become '1', which switches off all memory ports. This could not be done by nCS being high-impedant ('Z').

The implementation of data-transfer via memory as provided here also has a disadvantage: Sender and receiver are responsible for correct timing when accessing the memory. When increasing the sender's or the receiver's clock frequency, at some point the data exchange will fail. This can only be avoided with a lot of additional effort and protocol overhead.

Figure 45 in the appendix shows a graphical template, which is sufficient for simple memory types. In that figure, the memory access control state machine

assures the correct timing. Many additional handshakes are needed to handle this. The signals printed bold had to be added. The grayed circles are the states where the data is transmitted to or from the memory, finally. For example, the sender starts writing the data by A\_WE<='0'. This causes the access control machine to make the transition to the graved dotted circle. The dotted circle stands for several states which are passed through for counting the necessary setup time for the memory. When the setup time is counted, the access control gives an OK to the receiver (AC nWE OK='1'). There are two more (ungrayed) dotted circles in the receive branch of the access control machine. They represent states which count for address and chip enable times (setup time and recovery time). The receiver side works respectively.

Using such an extended memory access management, the clock cycles of sender and receiver can be increased without trouble. No problems for the data-transfer timing will occur.

<sup>&</sup>lt;sup>7</sup>this is called Data\_and\_Stop, because a stop signal is attached to the data, which signals whether this is the last data word to process or not (see datapath A in figure 40).



Figure 40: Data-transfer via memory with Double Handshake: Structure



Figure 41: Data-transfer via memory with Double Handshake: FSMs

# 6 Describing the Models in VHDL

# Summary of Problems and Conclusions

This section collects the advantages and disadvantages of using VHDL for describing the models for High Level Hardware Synthesis: SFSMD, FSMD and "FSM Controlling Datapath".

Some problems are not caused by the language VHDL but actually reside within the models themselves. However a VHDL description has to cope with these problems, too. For a *general* overview of the models, their usage and suitability in the High Level Synthesis design flow, refer to section 1.

### 6.1 Single Models in VHDL

### 6.1.1 SFSMD and FSMD Without Time

These models do not handle time. They are described in procedures instead of a processes in this report, because a process has its focus on designing with time.

The abstract model SFSMD usually is described by a structured executable high level program. The only relation to hardware is a declaration of input and output ports.

FSMD without time is the original FSMD-model. Scheduling is done, but the description is still executed in zero time.

Advantages of VHDL description. VHDL meets the above mentioned requirements for structured procedural programming. It provides high level programming language-constructs and data types. Compared to other programming languages, VHDL provides a nicer way to declare the inputs and outputs of the algorithm by using an entity.

Disadvantages of VHDL description. Using VHDL as a general purpose programming language, things may become a little complicated, because VHDL is quite unflexible: The programmer is forced to use many hardware-related constructs, even if he does not describe hardware. For example, he has to start with an "architecture" and at least one process in it.

For type conversions, often rigid library functions with varying syntax are needed.

Low level programming for FSMD is easier than high level for SFSMD, however.

In these single state machines, there are just minor difficulties. But when exchanging data between several of them, problems become severe (see below).

### 6.1.2 FSMD With Clock

The usage of "FSMD With Clock" is advantageous for VHDL descriptions and it helps handling the models in general, too. A model using time is described in a process. (A description in a procedure is possible but not recommended.)

Advantages of this model This clocked model can be used *instead* of the model without time, or, it can be used as a refinement after the model without time. Using FSMD with clock, some timing problems may become obvious in advance and the step towards the model "FSM Controlling Datapath" becomes easier (also see below).

**Disadvantages of this model.** This model is not the original FSMD model. Usually, time is not regarded at this point.

Advantages of VHDL description. In many VHDL systems, using a clock with a specific cycle time enables additional possibilities for debugging. A full simulation can be done, displaying signals over time and cycles. Also, using the right subset of VHDL, this model may be synthesizable already. Then, a preliminary implementation on a FPGA is possible for testing purposes.

Intelligent high level synthesis tools can turn this model into an optimized model "FSM Controlling Datapath", which would not be possible if the model is described without timing and in a procedure. (An example is the "Behavioral Compiler" by "Synopsys".)

On top of this, a data exchange between state machines causes less trouble, when they are described in a process and use a clock.

**Disadvantages of VHDL description.** There are no significant VHDL related disadvantages or problems.

#### 6.1.3 FSM Controlling Datapath

In this model the state machine does not do the computation, but gives the appropriate command to a datapath.

<sup>&</sup>lt;sup>8</sup>which can also do rescheduling

Problems of the model. After a command for the datapath is set, it needs one more cycle for the datapath to execute it. This delay is not present in the model FSMD. In order to retain behavioral equivalence between the models, the delay has to be compensated by an early reaction of the FSM to the datapath's results. A resulting problem is the decrease of the allowed clock time.

Once this model-related problem is solved, there is no problem describing it in VHDL. It is done by a separate next state process.

Advantages of VHDL description. VHDL provides a good possibility to block structure the design (FSM, datapath with subblocks) by use of separate descriptions with entities. Many helpful constructs are provided by the language to describe hardware at this refinement level.

**Disadvantages of VHDL description.** There are no problems.

### 6.2 Communicating Models in VHDL

In this report, communication means data exchange with hand shake.

### 6.2.1 SFSMD and FSMD Without Time

Problems of the model Here, handshaking is tried without using timing, because theoretically, handshaking just is based on causality, not timing.

Disadvantages of VHDL description. Only signals can handle ports and communication. Therefore, many of them have to be used in addition to variables. Variables are still needed for sequential computation. For communication, a procedure must be able to catch the change of a signal while it is running. There is also the problem, that a signal can not be passed to a procedure, if the parameters are declared as variables. Due to these facts, the procedures need to access several signals directly, like accessing a "global variable". Coexistence and mixing of signals and variables as well as their different meanings and properties is confusing at these levels of abstraction.

Additionally, VHDL fails to provide a description which really does not use timing. Wait statements become necessary because of the signal's delta-delay property. This confuses even more.

Conclusion: Describing handshake in VHDL without using timing is not recommended. For the model SFSMD, sequentialize the procedures of the state machines instead. For FSMD do the same, or switch over to "FSMD with clock".

#### 6.2.2 FSMD With Clock

Introducing handshake into the model "FSMD with clock" is not difficult, because the model is simple: Each model consists of just one process and all actions of a state are executed by this one process. So, communication can be inserted easily. In the following model, there are problems, however.

### 6.3 FSM Controlling Datapath

When introducing a data exchange with handshaking between two of these models, the handshake signals are connected between the FSMs and the data is exchanged between the datapaths.

Problems of the model In this model, the FSM is used to compensate the delay of the datapath. However, firstly, the data which is exchanged, is not read by a FSM but by another datapath. So, a data exchange via the data ports is not compensated. Secondly, the FSMs compensate a non-existent delay of the handshake signals.

As of these facts, timing of double handshake is diffrent between "FSM Controlling Datapath" and FSMD. This makes the step from the first model to the second complicated and confusing. But these general problems are solved and the solutions can be reused.

**Disadvantages of VHDL description.** There are no additional VHDL related problems.

### 6.4 Final Conclusion

In general, for each model, a corresponding VHDL description can be written. However, VHDL is suited better for low level clocked models than for abstract ones. A communication between state machines, which do not use a clock, should be avoided in VHDL.

The delay of data processing, which comes along with the introduction of a datapath, causes many problems. These are solved by this report in a generic way. Templates and examples are provided.

# References

- [1] Daniel D. Gajski. *Principles of Digital Design*, Prentice Hall, August 1996.
- [2] Shuquing Zhao, Daniel D. Gajski. Communication Synthesis with Custom Hardware, Technical Report, Department of Information and Computer Science, University of California, Irvine, coming soon.
- [3] A. Gerstlauer, Shuquing Zhao, Daniel D. Gajski. Design of a GSM Vocoder using SpecC Methodology, Technical Report, Department of Information and Computer Science, University of California, Irvine, December 1998.
- [4] Shuquing Zhao, Daniel D. Gajski. Communication Synthesis Issues in SpecC Environment, Technical Report, Department of Information and Computer Science, University of California, Irvine, June 1999.

# A Queue with Single Handshake

# A.1 FSMD



Figure 42: Statemachines for Single Handshake via Queue (FSMD)

# A.2 FSM for "FSM Controlling Datapath"



Figure 43: Statemachines for Single Handshake via Queue (FSM Controlling Datapath)

# B Extended Ways of Buffered Datatransfer

# B.1 FSM for Queue with Simultaneous Read/Write



Figure 44: Statemachines for Doublehandshake via Queue with simultaneous Read/Write

# B.2 Extended Timing for Memory



Figure 45: Statemachines with Extended Timing Properties for Doublehandshake via Memory (FSM)

# C VHDL examples

This appendix lists only the main files.

Tesbench files and datapath components are not listed. A directory of computer files belongs to this report. There, all files can be found and the examples can be executed.

# C.1 Single Finite State Machines

- Single SFSMD
- $\bullet\,$  Single FSMD without time
- Single FSMD using clock
- Single FSM Controlling Datapath
  - FSM
  - Datapath

### C.1.1 Single SFSMD

```
VHDL/plain_SMs/SFSMD/SFSMD_ex.vhd
  -- example of a SFSMD inherent in a procedure
  Entity SFSMD_ex is
     port (Reset, Start:
                                 in bit;
            In1, In2, In3, In4: in integer;
            Out1:
                                  out integer);
  end SFSMD_ex;
10 Architecture SFSMD_ex_behavioral of SFSMD_ex is
  Procedure behavior1(In1, In2, In3, In4: in integer;
                        signal Out1: out integer) is
type State_Set is (S_1, S_2, S_3, S_END);
15 -- S_BEGIN is represented by "Procedure not running".
  variable next_STATE: State_Set;
  variable R, S: integer;
  begin
20 next_STATE:= S_1;
  while (next_STATE/=S_END) and (Reset/='1') loop
    case next_STATE is
                     -- S:= In1 ** In2; -- (power)
      when S_1 = >
                     S := 1;
25
                     for i in In2 downto 1 loop
                        S := S * In1;
                     end loop;
                     next_STATE := S_2;
30
      when S_2 =>
                     --R := In3 ** In4; -- (power)
                     R:=1;
                     for i in In4 downto 1 loop
                       R:= R*In3;
35
                     end loop;
                     next_STATE := S_3:
      when S_3 =>
                     Out1 \le S-R;
40
                     next\_STATE := S\_END;
      when S_END = > -- nothing (Procedure quits)
    end case;
  end loop;
  end behavior1;
  begin -- Architecture
  P1: Process
55 begin
    wait until (Reset = '1') or (Start = '1');
    if (Reset = ', 1') then
      Out1 \leq 0;
    elsif (Start = '1') then
     behavior1 (In1, In2, In3, In4, Out1);
    end if;
  end process:
```

end SFSMD\_ex\_behavioral;

### C.1.2 Single FSMD without time

```
VHDL/plain_SMs/FSMD_no_time/FSMD_no_time_ex.vhd
  -- example of a FSMD (Without time) inherent in a procedure
  Entity FSMD_no_time_ex is
     port (Reset, Start:
                                   in bit;
            In1. In2:
                                   in integer;
            O_Port:
                              out integer);
  end FSMD_no_time_ex;
10 Architecture FSMD_no_time_ex_behavioral of FSMD_no_time_ex is
  Procedure behavior1(In1, In2: in integer;
                        O: inout integer) is
type State_Set is (S_1, S_2, S_3, S_END);
15 -- S_BEGIN is represented by "Procedure not running".
  variable next_STATE: State_Set;
  variable A, B: integer;
  variable Mult_temp: integer;
20 begin
  next\_STATE := S_1;
  while (next_STATE/=S_END) and (Reset/='1') loop
    case next_STATE is
      when S_1 =>
                      A := In1;
                      B := In 2;
                      O := 1;
                      next\_STATE := S\_2;
30
      when S_2 =>
                      Mult\_temp := O * A;
                      B := B - 1;
                      next\_STATE := S\_3 :
35
      when S_3 =>
                      O \,:=\, \, Mult\_temp\,;
                      if (B > 0)
                        then next_STATE:= S_2;
                         else next_STATE:= S_END;
40
                      end if;
      when S_END=> -- nothing (Procedure quits)
   end case;
  end loop;
  end behavior1;
  begin -- Architecture
  P1: Process
  variable O: integer;
55 begin
    \overline{\text{wait}} until (Reset = '1') or (Start = '1');
     if (Reset = '1') then
      O_Port <= 0;
     elsif (Start = '1') then
      behavior1 (In1, In2, O); O_{\bullet}Port \le O;
    end if:
  end process;
```

end FSMD\_no\_time\_ex\_behavioral;

### C.1.3 Single FSMD using clock

```
VHDL/plain_SMs/FSMD_clock/FSMD_clock_ex.vhd
  -- example of a FSMD (using clock) inherent in a process
  library IEEE;
  use IEEE. std_logic_1164.all;
5 use IEEE.std_logic_arith.all;
  use IEEE. std_logic_unsigned . all;
  Entity FSMD_clock_ex is
     port (Clock:
                                  in std_logic;
            Reset\;,\;\;Start:
                                  in std_logic;
                                  out std_logic;
            Done:
            In\,1\;,\quad In\,2:
                                  in std_logic_vector(15 downto 0);
                              out std_logic_vector(31 downto 0));
            O_Port:
  end FSMD_clock_ex;
1.5
  Architecture FSMD_clock_ex_behavioral of FSMD_clock_ex is
  -- behavior outputs\\
20 signal O: std_logic_vector(31 downto 0);
  signal Mult_temp: std_logic_vector(31 downto 0);
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_END); signal next_STATE: State_Set;
  signal A, B: std_logic_vector (15 downto 0);
  begin -- Architecture
  behavior1: Process(Clock)
  begin
    if (Clock'event and Clock = '1') then
      if (Reset = '1') then
        O <= (others = > '0');
         next_STATE <= S_BEGIN;
       else
35
         case next_STATE is
           when S_BEGIN => Done <= '0';
                             if Start = '1'
                               then next\_STATE \le S_1;
40
                               else next_STATE <= S_BEGIN;</pre>
                             end if;
           when S_1 =>
                          A <= In 1:
                          B \le In 2;
45
                          O \le conv\_std\_logic\_vector(1, 32);
                           next\_STATE \le S_2;
           when S_2 =>
                           Mult\_temp \le O(15 \ downto \ 0) * A;
                           B \le B - conv_std_logic_vector(1, 16);
                           next_STATE \le S_3;
           when S_3 = >
55
                          O <= Mult_temp;
                           if (B > conv_std_logic_vector(0, 16))
                             then next_STATE \leq S_2;
                             else next_STATE <= S_END;
                           end if;
60
           when S_END=> Done <= '1';
```

### C.1.4 Single FSM Controlling Datapath

```
VHDL/plain_SMs/FSM_structural/FSM_and_D_ex.vhd
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE.std_logic_arith.all;
5 Entity FSM_plus_DP is
     Port (Clock:
                              in std_logic;
            Reset, Start:
                              in std_logic;
            Done:
                              out std_logic;
            In 1, In 2:
                              in std_logic_vector(15 downto 0);
            O_Port:
                              out std_logic_vector(31 downto 0));
  end FSM_plus_DP;
  Architecture FSM_plus_DP_behavioral of FSM_plus_DP is
15 -- connection signals
  signal ld_A , ld_B , Count_En , Count_M: std_logic ;
  signal CMP:
                         std_logic_vector (1 downto 0);
  signal MUX_sel:
signal ld_O:
                         std_logic;
                         std_logic;
20 signal DP_Reset:
                         std_logic;
  component FSM
     Port (Clock:
                                   in std_logic;
            Reset, Start:
                                   in std_logic;
            Done:
                                   out std_logic;
25
            \mathsf{DP} \mathsf{\_Reset}:
                                   out std_logic;
            ld_A , ld_B , Count_En , Count_M: out std_logic ;
            CMP:
                                   in std_logic_vector(1 downto 0);
            MUX_{sel}:
                                   out std_logic;
            ld_O:
                                   out std_logic );
  end component;
  component DP
     port (Clock:
                              in std_logic;
                              in std_logic;
35
            ld_A , ld_B , Count_En , Count_M: in std_logic ;
            CMP:
                              out std_logic_vector (1 downto 0);
            \mathrm{MUX\_sel}:
                              in std_logic;
            ld _O :
                              in std_logic;
            In1, In2:
                              in std_logic_vector (15 downto 0);
            O_Port:
                              out std_logic_vector(31 downto 0));
  end component;
  begin
  Control: FSM
     Port map (
                  Clock=>Clock,
                  Reset=>Reset,
                  Start=>Start,
                  Done=>Done,
                  DP_Reset=>DP_Reset,
                  ld_A = > ld_A,
                  ld_B = > ld_B
                  Count_En=>Count_En,
                  Count_M=>Count_M,
55
                  CMP=>CMP,
                  MUX_{sel} = > MUX_{sel},
                  ld_O = > ld_O);
60 Datapath: DP
     Port map (
                  Clock=>Clock.
                  DP_Reset=>DP_Reset,
                  ld_A = > ld_A,
```

#### C.1.5 FSM

```
VHDL/plain_SMs/FSM_structural/FSM_ex.vhd
  -- example of a FSM inherent in a process
  library IEEE:
  use IEEE. std_logic_1164.all;
5 use IEEE.std_logic_arith.all;
  Entity FSM is
     port (Clock:
                               in std_logic;
           Reset, Start:
                               in std_logic;
           Done:
                               out std_logic;
1.0
           \mathsf{DP} \mathsf{\_Reset}:
                               out std_logic;
           ld_A , ld_B , Count_En , Count_M: out std_logic ;
                               in std_logic_vector(1 downto 0);
           MUX_{-}sel:
                               out std_logic;
           ld_O :
                               out std_logic );
1.5
  end FSM:
  Architecture FSM_behavioral of FSM is
  -- behavior outputs
  signal STATE, next_STATE: std_logic_vector(2 downto 0);
  signal Enable: std_logic;
25 begin -- Architecture
  Enable <= '1':
  State_Register: Process(Clock, Enable)
30 begin
    if (Clock'event and Clock = '1') then
           (Reset = '1') then STATE <= "000";
(Enable = '1') then STATE <= next_STATE;
      i f
      end if:
  end if;
  end Process;
  -- Datapath-Control and Status Output Logic
  -- Datapath Control Output:
  STATE(0);
  ld_B
           <= ( not STATE(2) and not STATE(1) and
45 Count_M \leq ( not STATE(2) and
                                     STATE(1) and not STATE(0);
  Count_En <= ( not STATE(2) and
                                      STATE(1) and not STATE(0));
  MUX_sel <= ( not STATE(2) and
                                      STATE(1) and not STATE(0)
           OR ( not STATE(2) and
                                      STATE(1) and
                                                       STATE(0);
           <= ( not STATE(2) and not STATE(1) and
  ld_O
                                                       STATE(0))
           OR ( not STATE(2) and
                                     STATE(1) and
                                                       STATE(0);
  -- Controller Status Output
                                      STATE(1) and
  Done
          \langle = ( STATE(2)  and
                                                       STATE(0);
  -- Next State Logic
  next_STATE(2)<=(not STATE(2) and
                                        STATE(1) and
                                                         STATE(0) and not CMP(0);
  next_STATE(1)<=(not STATE(2) and not STATE(1) and
                                                         STATE(0))
              OR (not STATE(2) and
                                        STATE(1) and not STATE(0))
              OR (not STATE(2) and
                                        STATE(1) and
                                                         STATE(0) and
                                                                          CMP(0)
              OR (not STATE(2) and
                                        STATE(1) and
                                                         STATE(0) and not CMP(0);
  next_STATE(0)<=(not STATE(2) and not STATE(1) and not STATE(0) and START)
```

```
OR (not STATE(2) and STATE(1) and not STATE(0))
OR (not STATE(2) and STATE(1) and STATE(0) and not CMP(0));
end FSM_behavioral;
```

#### C.1.6 Datapath

```
VHDL/plain_SMs/Datapath/DP.vhd
  -- Datapath (DP) for FSM
  library IEEE;
  use IEEE. std_logic_1164.all;
5 use IEEE.std_logic_arith.all;
  Entity DP is
     port (Clock:
                            in std_logic;
            DP_Reset:
                            in std_logic;
           ld_A , ld_B , Count_En , Count_M: in std_logic ;
10
                             out std_logic_vector (1 downto 0);
           MUX_{sel}:
                            in std_logic;
            ld _O :
                            in std_logic;
                            in std_logic_vector(15 downto 0);
            In\,1\;,\quad In\,2\;:
1.5
            O_Port:
                             out std_logic_vector(31 downto 0));
  end DP:
  Architecture DP_schematic of DP is
     -- connection signals
     signal A, B: std_logic_vector (15 downto 0);
     signal O: std_logic_vector(31 downto 0);
     signal MULT_Out, MUX_Out: std_logic_vector(31 downto 0);
     signal zero16: std_logic_vector(15 downto 0);
25
     signal one32: std_logic_vector(31 downto 0);
     component Reg_16bit
        Port ( Clock:
                          in std_logic;
                Reset:
                          in std_logic;
30
                Load:
                          in std_logic;
                Data_In :
                          in std_logic_vector (15 downto 0);
                Data_Out: out std_logic_vector (15 downto 0));
     end component;
35
     component Reg_32bit
        Port ( Clock:
                          in std_logic;
                          in std_logic;
                Reset:
                          in std_logic;
                Load:
                          in std_logic_vector (31 downto 0);
40
                Data_In :
                Data_Out: out std_logic_vector (31 downto 0));
     end component;
     component Counter
        Port ( Clock:
                          in std_logic;
45
                Reset:
                          in std_logic;
                          in std_logic;
                Load:
                Enable:
                          in std_logic;
                Mode:
                          in std_logic;
                Data_In :
                          in std_logic_vector (15 downto 0);
50
                Data_Out: out std_logic_vector (15 downto 0));
     end component;
     component Comp_16bit
                            in std_logic_vector(15 downto 0);
55
        Port ( Data_In1:
                Data_In2:
                            in std_logic_vector (15 downto 0);
                Result:
                             out std_logic_vector (1 downto 0));
     end component;
     component Multiplier
        Port ( Clock:
                             in std_logic;
                Data_In1:
                             in std_logic_vector (15 downto 0);
                            in std_logic_vector (15 downto 0);
                Data_In2:
```

```
Data_Out:
                                out std_logic_vector (31 downto 0));
      end component;
65
      component MUX_2x32bit
          Port (Sel:
                                in std_logic;
                  Data_In1:
                                in std_logic_vector (31 downto 0);
                  \mathrm{D}\,\mathtt{at}\,\mathtt{a}_{\blacksquare}\mathrm{In}\,2 :
                                in std_logic_vector (31 downto 0);
70
                  Data_Out:
                                out std_logic_vector (31 downto 0));
      end component;
   begin
75
   zero16 \le (others=>'0');
   one32 <= (0=>'1', others=>'0');
   Register_A: Reg_16bit
     Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_A, Data_In=>In1, Data_Out=>A);
   Counter_B: Counter
      Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_B,
                   Enable=>Count_En, Mode=>Count_M, Data_In=>In2, Data_Out=>B);
   COMP: Comp_16bit
      Port Map ( Data_In1=>zero16, Data_In2=>B, Result=>CMP);
   MULT: Multiplier
      Port Map ( Clock=>Clock, Data_In1=>A, Data_In2=>O(15 downto 0),
                   Data_Out=>MULT_Out);
95 MUX: MUX_2x32bit
      Port Map ( Sel=>MUX_sel, Data_In1=>one32,
                   Data_In2=>MULT_Out,
                   Data_Out=>MUX_Out);
100 Register_O: Reg_32bit
      Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_O,
                   Data_In=>MUX_Out, Data_Out=>O);
   O_Port \le O;
105
   end DP_schematic;
```

# C.2 Double Handshake

- A (FSMD without time)  $\rightarrow$  B (FSMD without time)
  - FSMD A
  - FSMD B
- A (FSMD using clock) → B (FSMD using clock)
  - FSMD A
  - FSMD B
- A (FSM Controlling Datapath)  $\rightarrow$  B (FSM Controlling Datapath)
  - FSM Controlling Datapath A
    - \* FSM of A
    - \* Datapath of A (identical B)
  - FSM Controlling Datapath B
    - \* FSM of B
    - \* Datapath of B (see B, identical B)

### C.2.1 A (FSMD without time) $\rightarrow$ B (FSMD without time)

```
VHDL/doubleHS_SMs/FSMD_dHS_no_time/FSMD_dHS_no_time_ex_top.vhd
                   FSMD\_dHS\_no\_time\_ex\_top\ .\ vh\ d
  -- bounding the two parts of the example
  -- for Example: FSMD(A) --> FSMD(B)
                   using double handshake protocol
  library IEEE;
  use IEEE.std_logic_1164.all;
  use IEEE.std_logic_arith.all;
  use STD. textio.all;
  entity FSMD_dHS_no_time_ex_top is
     port (Reset, Start:
                             in bit;
            TB_Ready_A, TB_Ackn_B: in bit;
A_Ackn_TB, B_Ready_TB: out bit;
            In1, In2:
                                 in integer;
15
            In_Stop:
                                 in bit;
            Out1_Port:
                                out integer;
            Out1_Stop_Port:
                                out bit);
  end FSMD_dHS_no_time_ex_top;
  architecture FSMD_dHS_no_time_ex_top_arch of FSMD_dHS_no_time_ex_top is
  Component FSMD_A_dHS_no_time_ex
     port (Reset, Start:
                                 in bit;
25
            TB_Ready_A, B_Ackn_A: in bit;
            A_Ackn_TB, A_Ready_B: out bit;
            In 1, In 2:
                                 in integer;
            In\_Stop:
                                 in bit;
            Data_AB_Port:
                                   out integer;
            Data_AB_Stop_Port:
                                   out bit);
  end Component;
  Component FSMD_B_dHS_no_time_ex
     port (Reset, Start:
                                 in bit;
            A_Ready_B, TB_Ackn_B: in bit;
35
            B_Ackn_A, B_Ready_TB: out bit;
            Data_AB:
                               in integer;
            Data_AB_Stop:
                                    in bit:
            Out1_Port:
                                out integer;
            Out1_Stop_Port:
                                out bit);
  end Component;
  signal B_Ackn_A, A_Ready_B: bit;
  signal Data_AB:
                                 integer;
45 signal Data_AB_Stop:
                                bit;
  begin
  FSMD_A_dHS_no_time_I: FSMD_A_dHS_no_time_ex
       port map ( Reset=>Reset, Start=>Start,
            TB_Ready_A=>TB_Ready_A, B_Ackn_A=>B_Ackn_A,
            A_Ackn_TB=>A_Ackn_TB, A_Ready_B=>A_Ready_B,
            In 1 = > In 1, In 2 = > In 2,
            In_Stop => In_Stop,
            Data_AB_Port=>Data_AB,
55
            Data_AB_Stop_Port=>Data_AB_Stop );
  FSMD_B_dHS_no_time_I: FSMD_B_dHS_no_time_ex
       port map ( Reset=>Reset, Start=>Start,
            A_Ready_B=>A_Ready_B, TB_Ackn_B=>TB_Ackn_B,
60
            B_Ackn_A=>B_Ackn_A, B_Ready_TB=>B_Ready_TB,
            Data_AB=>Data_AB,
```

Data\_AB\_Stop=>Data\_AB\_Stop,

```
Out1_Port=>Out1_Port,
Out1_Stop_Port=>Out1_Stop_Port );
end FSMD_dHS_no_time_ex_top_arch;
```

### C.2.2 FSMD A

```
VHDL/doubleHS_SMs/FSMD_dHS_no_time/FSMD_dHS_no_time_ex_A.vhd
  -- File:
                    FSMD\_dHS\_no\_time\_ex\_A . vhd
  -- implements:
                    FSMD A
  -- of Model:
                    FSMD (without using time)
  -- for Example: FSMD(A) --> FSMD(B)
                    using double handshake protocol
  {\tt Entity \ FSMD\_A\_dHS\_no\_time\_ex \ is}
     port (Reset, Start:
                                  in bit;
            TB_Ready_A, B_Ackn_A: in bit;
            A_Ackn_TB, A_Ready_B: out bit;
1.0
            In1, In2:
                                  in integer;
            In_Stop:
                                  in bit;
            Data_AB_Port:
                                    out integer;
            {\tt Data\_AB\_Stop\_Port}:
                                          out bit);
15 end FSMD_A_dHS_no_time_ex;
  Architecture FSMD_A_dHS_no_time_ex_behavioral of FSMD_A_dHS_no_time_ex is
20 -- behavior outputs
  signal Data_AB: integer;
  signal Data_AB_Stop: bit;
  Procedure behavior1(signal TB_Ready_A, B_Ackn_A: in bit;
                         signal A_Ackn_TB, A_Ready_B: out bit;
                         signal In1, In2: in integer;
                         signal In_Stop: in bit;
                         signal Data_AB: inout integer;
                         signal Data_AB_Stop: out bit) is
30 type State_Set is (S_1, S_2, S_3, S_4, S_5, S_6, S_END); -- S_BEGIN is represented by "Procedure not running".
  variable next_STATE: State_Set;
  variable A, B: integer;
  variable Stop: bit;
  begin
  next\_STATE := S\_1 :
  Data\_AB \le 0;
  Data\_AB\_Stop <= '0';
40 A_Ackn_TB<='0';
  A_Ready_B <= '0';
  while (next_STATE/=S_END) and (Reset/='1') loop
    case next_STATE is
      when S_1 =>
                      if (TB_Ready_A='1')
                        then next_STATE:= S_2;
                         else next_STATE:= S_1;
                      end if;
      when S_2 =>
                      A := In1;
                      B := In 2;
                      Stop := In_Stop;
                      A_Ackn_TB \le 1';
55
                      if (TB_Ready_A='0')
                        then next_STATE:= S_3;
                         else next_STATE := S_2;
                      end if;
60
      when S_3 =>
                      Data\_AB \le A - B;
                      Data_AB_Stop <= Stop;
```

```
A_Ackn_TB \le 0;
65
                      next\_STATE := S\_4;
       when S_4 =>
                      next_STATE := S_5;
70
       when S_5 =>
                      A_Ready_B \le 1';
                      if (B_Ackn_A='1')
                        then next_STATÉ:=S_6;
75
                        else next\_STATE := S\_5;
                      end if;
       when S_6 =>
                      A_Ready_B \le 0;
80
                      if (B_Ackn_A='0')
                        then if (Stop='1')
                                then next_STATE:=S_END;
                                else next\_STATE := S\_1;
                             end if:
85
                        else next_STATE := S_6;
                      end if;
       when S_END=> -- nothing (Procedure quits)
90
     end case;
     wait for 1 ns; -- Synopsys needs wait > 0 ns
   end loop;
95 end behavior1;
   begin -- Architecture
100 P1: Process
   begin
     wait until (Reset = '1') or (Start = '1');
     if (Reset = ', 1') then
       Data\_AB \le 0;
       A_Ackn_TB \le 0';
       A \_Ready \_B <= '0';
     elsif (Start = '1') then
       behavior1 (TB_Ready_A, B_Ackn_A, A_Ackn_TB, A_Ready_B,
                  In1, In2, In_Stop, Data_AB, Data_AB_Stop);
110 end if;
   end process;
   -- Entity Outputs
   Data_AB_Port <= Data_AB;
115 Data_AB_Stop_Port <= Data_AB_Stop;
   end FSMD_A_dHS_no_time_ex_behavioral;
```

#### C.2.3 FSMD B

```
VHDL/doubleHS_SMs/FSMD_dHS_no_time/FSMD_dHS_no_time_ex_B.vhd
  -- File:
                   FSMD\_dHS\_no\_time\_ex\_B. vhd
  -- implements:
                   FSMD B
  -- of Model:
                   FSMD (without using time)
  -- for Example: FSMD(A) --> FSMD(B)
                   using double handshake protocol
  Entity FSMD_B_dHS_no_time_ex is
     port (Reset, Start:
                                in bit;
            1.0
            Data_AB:
                               in integer;
            Data_AB_Stop:
                                    in bit:
            Out1_Port:
                                out integer;
            Out1_Stop_Port:
                                out bit);
15 end FSMD_B_dHS_no_time_ex:
  Architecture FSMD_B_dHS_no_time_ex_behavioral of FSMD_B_dHS_no_time_ex is
20 -- behavior outputs
  signal Out1: integer;
  signal Out1_Stop: bit;
  Procedure behavior1(signal A_Ready_B, TB_Ackn_B: in bit;
                        signal B_Ackn_A, B_Ready_TB: out bit;
                        signal Data_AB: in integer;
                        signal Data_AB_Stop: in bit;
                        signal Out1: inout integer;
                        signal Out1_Stop: out bit) is
30 type State_Set is (S_1, S_2, S_3, S_4, S_5, S_6, S_END); -- S_BEGIN is represented by "Procedure not running".
  variable next_STATE: State_Set;
  variable C: integer;
  variable Stop: bit;
  begin
  next\_STATE := S\_1 :
  Out1 \leq 0;
  Out1\_Stop <= \ '0';
40 B_Ackn_A<='0';
  B_Ready_TB \le 0;
  while (next_STATE/=S_END) and (Reset/='1') loop
    case next_STATE is
      when S_1 =>
                      if (A_Ready_B='1')
                       then next_STATE := S_2;
                        else next_STATE:= S_1;
                     end if;
      when S_2 =>
                     C := Data\_AB;
                     Stop := Data_AB_Stop;
                     B_Ackn_A <= '1';
55
                      if (A_Ready_B='0')
                        then next\_STATE := S\_3;
                        else next_STATE := S_2;
                     end if:
60
      when S_3 = >
                     Out1 <= 2 * C:
                      Out1\_Stop \le Stop;
                     B_Ackn_A <= '0';
```

```
next_STATE := S_4;
65
       when S_4 =>
                      next\_STATE := S\_5;
70
       when S_5 =>
                      B_Ready_TB \le 1';
                      if (TB_Ackn_B='1')
                        then next\_STATE := S\_6;
                        else next_STATE := S_5;
75
                      end if;
       when S_6 =>
                      B_Ready_TB \le 0';
                      if (TB_Ackn_B='0')
80
                        then if (Stop='1')
                                then next_STATE:=S_END;
                                else next\_STATE := S\_1;
                              end if;
                        else next_STATE := S_6;
85
                      end if;
       when S_END=> -- nothing (Procedure quits)
     end case;
     wait for 1 ns; -- Synopsys needs wait > 0 ns
   end loop;
   end behavior1;
   begin -- Architecture
   P1: Process
100 begin
     wait until (Reset = '1') or (Start = '1');
     if (Reset = '1') then
       Out1 \leq 0;
       B_Ackn_A \le 0;
       B_Ready_TB \le 0;
     elsif (Start = '1') then
       behavior1 (A_Ready_B, TB_Ackn_B, B_Ackn_A, B_Ready_TB,
                  Data_AB , Data_AB_Stop , Out1 , Out1_Stop );
     end if;
110 end process;
   -- Entity Outputs
   Out1\_Port <= Out1;
   Out1_Stop_Port <= Out1_Stop;
115
   end FSMD_B_dHS_no_time_ex_behavioral;
```

# C.2.4 A (FSMD using clock) → B (FSMD using clock)

```
VHDL/doubleHS_SMs/FSMD_dHS_clock/FSMD_dHS_clock_ex_top.vhd
  -- File:
                   FSMD\_dHS\_clock\_ex\_top. vhd
  -- bounding the two parts of the example
  -- for Example: FSMD(A) --> FSMD(B)
                   using double handshake protocol
  library IEEE;
  use IEEE.std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  use STD. textio. all;
  entity FSMD_dHS_clock_ex_top is
     port (Reset, Start:
                                in std_logic;
                                in std_logic;
            Clock_A , Clock_B :
            TB_Ready_A, TB_Ackn_B: in std_logic;
            A_Ackn_TB, B_Ready_TB: out std_logic;
1.5
           In1, In2:
                                in integer;
            In_Stop:
                                in std_logic;
            Out1_Port:
                                out integer;
            Out1_Stop_Port:
                                out std_logic);
20 end FSMD_dHS_clock_ex_top;
  architecture FSMD_dHS_clock_ex_top_arch of FSMD_dHS_clock_ex_top is
  Component FSMD_A_dHS_clock_ex
     port (Reset, Start:
                                in std_logic;
            Clock_A:
                                in std_logic;
            TB_Ready_A, B_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_B: out std_logic;
            In1, In2:
                                in integer;
            In_Stop:
                                 in std_logic;
30
            Data_AB_Port:
                                  out integer;
            Data_AB_Stop_Port:
                                   out std_logic);
  end Component;
35 Component FSMD_B_dHS_clock_ex
     port (Reset, Start:
                                in std_logic;
            Clock_B:
                                in std_logic;
            A_Ready_B, TB_Ackn_B: in std_logic;
            B_Ackn_A, B_Ready_TB: out std_logic;
            Data_AB:
                              in integer;
40
            Data_AB_Stop:
                                    in std_logic;
            Out1_Port:
                                out integer;
            Out1_Stop_Port:
                                out std_logic);
  end Component:
  signal B_Ackn_A, A_Ready_B: std_logic;
  signal Data_AB:
                                integer;
  signal Data_AB_Stop:
                                std_logic;
50 begin
  FSMD_A_dHS_clock_I: FSMD_A_dHS_clock_ex
       port map ( Reset=>Reset, Start=>Start, Clock_A=>Clock_A,
            TB_Ready_A=>TB_Ready_A, B_Ackn_A=>B_Ackn_A,
55
            A\_Ackn\_TB = > A\_Ackn\_TB, A\_Ready\_B = > A\_Ready\_B,
            In 1=>In 1, In 2=>In 2,
            In_Stop => In_Stop,
            Data_AB_Port=>Data_AB,
            Data_AB_Stop_Port=>Data_AB_Stop );
  FSMD_B_dHS_clock_I: FSMD_B_dHS_clock_ex
       port map ( Reset=>Reset, Start=>Start, Clock_B=>Clock_B,
            A_Ready_B=>A_Ready_B, TB_Ackn_B=>TB_Ackn_B,
```

```
B_Ackn_A=>B_Ackn_A, B_Ready_TB=>B_Ready_TB,

Data_AB=>Data_AB,

Data_AB_Stop=>Data_AB_Stop,

Out1_Port=>Out1_Port,

Out1_Stop_Port=>Out1_Stop_Port

70 end FSMD_dHS_clock_ex_top_arch;
```

#### C.2.5 FSMD A

```
VHDL/doubleHS_SMs/FSMD_dHS_clock/FSMD_dHS_clock_ex_A.vhd
  -- File:
                   FSMD\_dHS\_clock\_ex\_A . vhd
  -- implements:
                   FSMD A
  -- of Model:
                   FSMD (using clock)
  -- for Example: FSMD(A) --> FSMD(B)
                   using double handshake protocol
5 - -
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  -- FSMD A of example of two FSMD (Without time) exchanging data
  Entity FSMD_A_dHS_clock_ex is
     port (Reset, Start:
                                 in std_logic;
1.5
            Clock_A:
                                in std_logic;
            TB_Ready_A, B_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_B: out std_logic;
            In1, In2:
                                 in integer;
            In_Stop:
                                 in std_logic;
                                  out integer;
            Data_AB_Port:
20
            Data_AB_Stop_Port:
                                        out std_logic);
  end FSMD_A_dHS_clock_ex;
25 Architecture FSMD_A_dHS_clock_ex_behavioral of FSMD_A_dHS_clock_ex is
  -- outputs
  signal Data_AB: integer;
  signal Data_AB_Stop: std_logic;
30 -- internal signals
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END);
  signal next_STATE: State_Set;
  signal A, B: integer;
  signal Stop: std_logic;
  begin -- Architecture
  behavior1: process (Clock_A)
  begin
    if (Clock_A' event and Clock_A='1') then
      if (Reset = '1') then
        Data\_AB \le 0;
        Data\_AB\_Stop <= '0';
        A_Ackn_TB \le 0';
         A_Ready_B \le 0';
45
        next_STATE <= S_BEGIN;
      else
        case next_STATE is
          when S\_BEGIN => Data\_AB <= 0;
                            Data\_AB\_Stop <= '0';
                            A_Ackn_TB \le 0;
                            A_Ready_B \le 0;
                            if Start = '1'
55
                              then next_STATE \leq S_1;
                              else next_STATE <= S_BEGIN;</pre>
                            end if;
          when S_1 =>
60
                           if (TB_Ready_A='1')
                             then next_STATE\leq S_2;
```

```
else next_STATE<= S_1;
                             end if;
65
             when S_2 =>
                             A\,<=\,\operatorname{In}1\,;
                             B \le In 2;
                             Stop <= In_Stop;
                              A_A ckn_T B \le 1';
70
                              if (TB\_Ready\_A='0')
                                then next_STATE \leq S_3;
                                else next_STATE \le S_2;
                              end if;
75
             when S_3 =>
                              Data\_AB \le A - B;
                              Data_AB_Stop <= Stop;
                              A_A ckn_TB < = '0';
80
                              next\_STATE \le S\_4;
             when S_4 =>
                             next\_STATE \le S\_5;
85
             when S_5 =>
                             A_Ready_B<='1';
                              if (B_Ackn_A='1')
                                then next_STATE<=S_6;
90
                                else next_STATE<=S_5;
                              end if:
             when S_6 =>
                             A_Ready_B<='0';
95
                              if (B_Ackn_A='0')
                                then if (Stop='1')
                                        then next_STATE<=S_END;
                                        else next_STATE \le 1;
100
                                      end if;
                                else next_STATE \le 5_6;
                             end if;
             when S_END=> -- nothing (Procedure quits)
105
           end case;
         end if;
      end if;
110 end process;
   -- Entity Outputs
Data_AB_Port <= Data_AB;
115 Data_AB_Stop_Port <= Data_AB_Stop;</pre>
   end FSMD_A_dHS_clock_ex_behavioral;
```

#### C.2.6 FSMD B

```
VHDL/doubleHS_SMs/FSMD_dHS_clock/FSMD_dHS_clock_ex_B.vhd
                   FSMD\_dHS\_clock\_ex\_B. vhd
  -- File:
  -- implements:
                   FSMD B
  -- of Model:
                   FSMD (using clock)
  -- for Example: FSMD(A) --> FSMD(B)
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  -- FSMD B of example of two FSMD (Without time) exchanging data
  Entity FSMD_B_dHS_clock_ex is
     port (Reset, Start:
                                 in std_logic;
            Clock_B:
                                in std_logic;
1.5
            A_Ready_B, TB_Ackn_B: in std_logic;
            B_Ackn_A, B_Ready_TB: out std_logic;
            Data_AB:
                               in integer;
            Data_AB_Stop:
                                   in std_logic;
                                out integer;
            Out1_Port:
20
            Out1_Stop_Port:
                                out std_logic);
  end FSMD_B_dHS_clock_ex;
25 Architecture FSMD_B_dHS_clock_ex_behavioral of FSMD_B_dHS_clock_ex is
  -- outputs
  signal Out1: integer;
  signal Out1_Stop: std_logic;
30 -- internal signals
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END);
  -- S_BEGIN is represented by "Procedure not running,
  signal next_STATE: State_Set;
  signal C: integer;
35 signal Stop: std_logic;
  begin -- Architecture
  behavior1: process(Clock_B)
40 begin
    if (Clock_B'event and Clock_B='1') then
      i\hat{f} (Reset = '1') then
        Out1 \leq 0;
        Out1.Stop <= '0';
        B_Ackn_A <= '0';
45
        B_Ready_TB \le 0;
        next_STATE <= S_BEGIN;
      else
        case next_STATE is
          when S\_BEGIN => Out1 <= 0;
                            Out1\_Stop <= '0';
                            B_Ackn_A <= '0';
                            B_Ready_TB \le 0;
55
                            if Start = '1'
                              then next\_STATE \le S_1;
                              else next_STATE <= S_BEGIN;</pre>
                            end if:
60
          when S_1 =>
                         if (A_Ready_B='1')
```

```
then next_STATE \leq S_2;
                             else next_STATE<= S_1;
65
                           end if;
           when S_2 =>
                          C \leq Data\_AB;
                           Stop <= Data_AB_Stop;
                           B_Ackn_A <= '1';
70
                           if (A_Ready_B='0')
                             then next_STATE \leq S_3;
                             else next_STATE<= S_2;
                           end if;
75
                           Out1 \le 2 * C;
           when S_3 =>
                           Out1\_Stop \le Stop;
                           B_Ackn_A <= '0';
80
                           next\_STATE \le S\_4;
           when S_4 =>
                           next\_STATE \le S\_5;
85
           when S_5 =>
                           B_Ready_TB \le '1';
                           if (TB_Ackn_B='1')
                             then next_STATE<=S_6;
90
                             else next_STATE<=S_5;
                           end if;
           when S_6 =>
                          B_Ready_TB \le 0';
95
                           if (TB\_Ackn\_B='0')
                             then if (Stop = '1')
                                     then next_STATE<=S_END;
                                     else next_STATE \le 1;
100
                                  end if;
                             else nextSTATE<=S_6;
                           end if;
           when S_END=> -- nothing (Procedure quits)
105
         end case;
        end if;
      end if;
110 end process;
   -- Entity Outputs
   Out1_Port <= Out1;
115 Out1_Stop_Port <= Out1_Stop;
   end FSMD_B_dHS_clock_ex_behavioral;
```

# C.2.7 A (FSM Controlling Datapath) $\rightarrow$ B (FSM Controlling Datapath)

```
\label{thm:clock} \mbox{WHDL/doubleHS\_SMs/FSM\_dHS\_clock/FSM\_and\_D\_dHS\_clock\_ex\_top.vhd}
  -- File:
                    FSM\_and\_D\_dHS\_clock\_ex\_top, vhd
  -- bounding the two parts of the example
  -- for Example: FSM, DP(A) --> FSM, DP(B)
                    using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  use STD. textio. all;
  entity FSM_and_D_dHS_clock_ex_top is
                                  in std_logic;
     port (Reset, Start:
            Clock_A , Clock_B :
                                  in std_logic;
            TB_Ready_A, TB_Ackn_B: in std_logic;
            A_Ackn_TB, B_Ready_TB: out std_logic;
1.5
                                  in std_logic_vector (31 downto 0);
            In1, In2:
            In_Stop:
                                  in std_logic;
            Out1_Port:
                                 out std_logic_vector (31 downto 0);
            Out1_Stop_Port:
                                 out std_logic);
20 end FSM_and_D_dHS_clock_ex_top;
  {\bf architecture} \ \ FSM\_{and\_D\_dHS\_clock\_ex\_top\_arch} \ \ {\bf of} \ \ FSM\_{and\_D\_dHS\_clock\_ex\_top} \ \ {\bf is}
  Component FSM_and_D_A_dHS_clock_ex
     port (Reset, Start:
                                  in std_logic;
            Clock_A:
                                 in std_logic;
            TB_Ready_A, B_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_B: out std_logic;
            In 1. In 2:
                                  in std_logic_vector(31 downto 0);
            In_Stop:
                                  in std_logic;
30
            Data_AB_Port:
                                    out std_logic_vector(31 downto 0);
            Data_AB_Stop_Port:
                                    out std_logic);
  end Component;
35 Component FSM_and_D_B_dHS_clock_ex
     port (Reset, Start:
                                  in std_logic;
            Clock_B:
                                 in std_logic;
            A_Ready_B, TB_Ackn_B: in std_logic;
            B_Ackn_A, B_Ready_TB: out std_logic;
                                in std_logic_vector(31 downto 0);
40
            Data_AB:
            Data_AB_Stop:
                                      in std_logic;
                                 out std_logic_vector (31 downto 0);
            Out1_Port:
            Out1_Stop_Port:
                                 out std_logic);
  end Component:
  signal B_Ackn_A, A_Ready_B: std_logic;
                                 std_logic_vector (31 downto 0);
  signal Data_AB:
  signal Data_AB_Stop:
                                 std_logic;
50 begin
  FSM_and_D_A_dHS_clock_I: FSM_and_D_A_dHS_clock_ex
        port map ( Reset=>Reset, Start=>Start, Clock_A=>Clock_A,
            TB_Ready_A=>TB_Ready_A, B_Ackn_A=>B_Ackn_A,
55
            A\_Ackn\_TB = > A\_Ackn\_TB, A\_Ready\_B = > A\_Ready\_B,
            In 1=>In 1, In 2=>In 2,
            In_Stop => In_Stop,
            Data_AB_Port=>Data_AB,
            Data_AB_Stop_Port=>Data_AB_Stop );
  FSM\_and\_D\_B\_dHS\_clock\_I: FSM\_and\_D\_B\_dHS\_clock\_ex
        port map ( Reset=>Reset, Start=>Start, Clock_B=>Clock_B,
            A_Ready_B=>A_Ready_B, TB_Ackn_B=>TB_Ackn_B,
```

```
B_Ackn_A=>B_Ackn_A, B_Ready_TB=>B_Ready_TB,

Data_AB=>Data_AB,

Data_AB_Stop=>Data_AB_Stop,

Out1_Port=>Out1_Port,

Out1_Stop_Port=>Out1_Stop_Port);

70 end FSM_and_D_dHS_clock_ex_top_arch;
```

### C.2.8 FSM Controlling Datapath A

```
VHDL/doubleHS_SMs/FSM_dHS_clock/FSM_and_D_dHS_clock_ex_A.vhd
  -- File:
                   FSM\_and\_D\_dHS\_clock\_ex\_A , vhd
                   bounding of FSM A and D at a path A
  --implements:
                   FSM and separate Datapath (FSM+D)
  -- of Model:
  -- for Example: FSM, DP(A) --> FSM, DP(B)
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  -- FSM_and_D A of example of two FSM_and_D (Without time) exchanging data
  Entity FSM_and_D_A_dHS_clock_ex is
     port (Reset, Start:
                                 in std_logic;
                                in std_logic;
            Clock_A:
1.5
            TB_Ready_A, B_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_B: out std_logic;
            In1, In2:
                                 in std_logic_vector (31 downto 0);
            In_Stop:
                                 in std_logic;
            Data_AB_Port:
                                   out std_logic_vector (31 downto 0);
20
            Data_AB_Stop_Port:
                                         out std_logic);
  \mathbf{end} \ \mathrm{FSM\_and\_D\_A\_dHS\_clock\_ex} \ ;
25 Architecture FSM_and_D_A_dHS_clock_ex_structural of FSM_and_D_A_dHS_clock_ex is
  -- outputs
  signal Data_AB: std_logic_vector (31 downto 0);
  signal Data_AB_Stop: std_logic;
                       std_logic;
30 signal DP_Reset:
  signal ld_A , ld_B , ld_Stop: std_logic ;
  signal StopMsg: std_logic;
  signal ALU_M: std_logic_vector (1 downto 0);
  signal ld_O: std_logic;
35 signal CMP: std_logic_vector(1 downto 0);
  signal MUX_sel: std_logic;
  Component DP
     port (Clock:
                             in std_logic;
            DP_{\blacksquare}Reset:
                             in std_logic;
            ld_A , ld_B , ld_Stop : in std_logic ;
            StopMsg:
                             out std_logic;
            ALU_M:
                             in std_logic_vector (1 downto 0);
                             in std_logic;
            ld_O :
            In 1, In 2:
                             in std_logic_vector(31 downto 0);
45
            In_Stop:
                             in std_logic;
            O_Port:
                             out std_logic_vector(31 downto 0);
            Out_Stop:
                             out std_logic);
  end Component;
  Component FSM_A_dHS_clock_ex
     port (Reset, Start:
                                   in std_logic;
            Clock\_A:
                                   in std_logic;
            TB_Ready_A, B_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_B: out std_logic;
55
            DP\_Reset:
                                   out std_logic;
            ld_A , ld_B , ld_Stop :
                                   out std_logic;
           StopMsg:
                                   in std_logic;
           ALU_M:
                                   out std_logic_vector (1 downto 0);
            ld_O:
                                   out std_logic);
  end Component;
  begin -- Architecture
```

```
DP_AI: DP
         \label{eq:clock_A} \mbox{Port Map (Clock=>Clock_A, DP_Reset=>DP_Reset,}
                        ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                        StopMsg => StopMsg, ALU_M => ALU_M, Id_O => Id_O,
                        In1 = > In1, In2 = > In2,
                        In_Stop=>In_Stop,
O_Port=>Data_AB, Out_Stop=>Data_AB_Stop);
70
   FSM\_A\_dHS\_clock\_I: \ FSM\_A\_dHS\_clock\_ex
          Port Map (Reset=>Reset, Start=>Start,
                        C lock_A = > C lock_A,
                        \label{eq:thm:continuous} \begin{split} TB\_Ready\_A => TB\_Ready\_A \;, & B\_Ackn\_A => B\_Ackn\_A \;, \\ A\_Ackn\_TB => A\_Ackn\_TB \;, & A\_Ready\_B => A\_Ready\_B \;, \end{split}
75
                        DP_Reset=>DP_Reset,
                        ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                        StopMsg=>StopMsg, ALU_M=>ALU_M, ld_O=>ld_O);
   -- Entity Outputs
  Data_AB_Port <= Data_AB;
   Data_AB_Stop_Port <= Data_AB_Stop;
85 end FSM_and_D_A_dHS_clock_ex_structural;
```

#### C.2.9 FSM of A

```
VHDL/doubleHS_SMs/FSM_dHS_clock/FSM_dHS_clock_ex_A.vhd
  -- File:
                    FSM\_dHS\_clock\_ex\_A , vhd
                    FSM A
  -- implements:
  -- of Model:
                    FSM and separate Datapath
  -- for Example: FSM, DP(A) --> FSM, DP(B)
                    using double handshake protocol
5 - -
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_A_dHS_clock_ex is
     port (Reset, Start:
                                   in std_logic;
            Clock_A:
                                     in std_logic;
            TB_Ready_A, B_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_B: out std_logic;
1.5
            DP_Reset:
                                    out std_logic;
            ld_A , ld_B , ld_Stop :
                                    out std_logic;
            StopMsg:
                                     in std_logic;
            ALU_M:
                                     out std_logic_vector (1 downto 0);
            ld_O:
                                     out std_logic);
  end FSM_A_dHS_clock_ex;
  Architecture FSM_A_dHS_clock_ex_behavioral of FSM_A_dHS_clock_ex is
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END);
  signal next_STATE, STATE: State_Set;
  begin -- Architecture
  transition: Process
  begin
    wait until (Clock_A'event and Clock_A = '1');
    STATE \le next\_STATE;
35 end process;
  behavior1: process(STATE, Start, TB_Ready_A, B_Ackn_A, StopMsg)
  begin
      if (Reset = '1') then
            DP_Reset <= '1';
            A\_Ackn\_TB <= '0';
            A_Ready_B <= '0';
                       <= '0';
            ld_A
                       <= '0':
            ld_B
                       <= '0';
            ld_Stop
45
                       <= "--";
            ALU_M
            ld_O
                       <= '0';
         next_STATE <= S_BEGIN;
       else
         case STATE is
           \begin{array}{lll} \mbox{when $S$\_BEGIN =>$ DP$\_Reset & <= '1'; \\ & A\_Ackn\_TB <= '0'; \end{array}
                             A_Ready_B <= 0.0
                                        <= '0';
55
                             ld_A
                             ld_B
                                        <= '0';
                             ld_Stop
                                        <= 0,0,0
                             ALU_M
                                        <= "--";
                             ld_O
                                        <= '0':
60
                            if Start = '1'
                              then next_STATE \leq S_1;
                              else next_STATE <= S_BEGIN;
```

```
end if;
65
                                 \begin{array}{lll} {\rm DP\_Reset} & <= \ '0\ '; \\ {\rm A\_Ackn\_TB} & <= \ '0\ '; \end{array}
             when S_1 =>
                                  A_Ready_B <= '0';
                                             <= '0';
                                  ld_A
                                  ld_B
                                              <= ', 0';
70
                                             <= '0';
<= "--";
                                  ld_Stop
                                  ALU_M
                                  ld_O
                                              <= ', 0';
75
                                if (TB_Ready_A='1')
                                   then next_STATE<= S_2;
                                   else next_STATE<= S_1;
                                end if;
                                  DP_Reset <= '0';
A_Ackn_TB <= '1';</pre>
              when S_2 =>
80
                                  A_Ready_B <= '0';
                                  ld_A
                                              <= '1';
                                  ld\_B
                                              <= '1';
                                             <= '1';
<= "11";
                                  ld_Stop
85
                                 ALU_M
                                              <= '0';
                                  ld_O
                                if (TB_Ready_A='0')
                                   then next_STATE\leq= S_3;
90
                                   else next_STATE<= S_2;
                                end if;
              when S_3 =>
                                  \mathrm{DP} \_\mathrm{Reset} \quad <= \ ^{,}0\ ^{,};
                                  A_Ackn_TB <= '0';
95
                                  A_Ready_B <= '0';
                                  ld_A
                                             <= '0';
                                              <= '0';
                                  ld_B
                                  ld_Stop
                                             <= '0';
100
                                  ALU_M
                                              <= "11";
                                              <= '1';
                                  ld_O
                                next\_STATE \le S\_4;
              when S_4 =>
                                  DP_Reset <= '0';
105
                                  A\_Ackn\_TB <= '0';
                                  A_Ready_B <= '0';
                                             <= '0';
                                  ld_A
                                  ld_B
                                              <= '0';
                                             <= '0';
<= "--";
110
                                 ld_Stop
                                  ALU_M
                                              <= '0';
                                  ld_O
                                next\_STATE \le S\_5;
115
              when S_5 =>
                                  DP_Reset <= '0';
                                  A\_Ackn\_TB <= '0';
                                  A_Ready_B <= '1';
                                             <= '0';
                                  ld_A
                                              <= '0';
                                  ld_B
120
                                  ld_Stop
                                             <= '0';
                                              <= "--";
                                  ALU_M
                                  ld_O
                                              <= '0';
                                if (B_Ackn_A='1')
125
                                   then next_STATE<=S_6:
                                   else next\_STATE \le 5;
                                end if;
```

```
DP_Reset <= '0';
A_Ackn_TB <= '0';
A_Ready_B <= '0';
Id_A <= '0';
130
               when S_6 =>
                                               <= '0';
                                  ld_B
                                              <= '0';
<= "--";
                                  ld_Stop
135
                                  ALU_M
                                  ld_O
                                               <= '0';
                                 if (B_Ackn_A='0')
                                   then if (StopMsg='1')
then next_STATE<=S_END;
140
                                            else next_STATE<=S_1;</pre>
                                          end if;
                                    else next_STATE<=S_6;
                                 end if;
145
               when S_END=> -- nothing (Procedure quits)
            end case;
          end if;
150
   end process;
   end FSM_A_dHS_clock_ex_behavioral;
```

### C.2.10 Datapath of A (identical B)

```
VHDL/doubleHS_SMs/Datapath/DP.vhd
  -- File:
                   DP. vhd
  -- implements: Datapath (instanced twice: for A and for B)
  -- for Example: FSM, DP (A) --> FSM, DP (B)
                   using double handshake protocol
  library IEEE;
  use IEEE.std_logic_1164.all;
  use IEEE.std_logic_arith.all;
10 Entity DP is
                            in std_logic;
     port (Clock:
           DP_Reset:
                            in std_logic;
            ld_A , ld_B , ld_Stop: in std_logic;
           StopMsg:
                             out std_logic;
           ALU_M:
                            in std_logic_vector (1 downto 0);
1.5
           ld_O :
                            in std_logic;
            In 1, In 2:
                            in std_logic_vector(31 downto 0);
                            in std_logic;
            In_Stop:
            O_Port:
                            out std_logic_vector (31 downto 0);
            Out_Stop:
                            out std_logic);
  end DP;
  Architecture DP_schematic of DP is
     -- connection signals
     signal A, B, O: std_logic_vector(31 downto 0);
     signal ALU_Out: std_logic_vector(31 downto 0);
     signal Stop:
                      std_logic;
     component Reg_32bit
30
                          in std_logic;
        Port (Clock:
                Reset:
                          in std_logic;
                Load:
                          in std_logic;
                Data_In: in std_logic_vector (31 downto 0);
                Data_Out: out std_logic_vector (31 downto 0));
35
     end component;
     component Latch_impl
        Port ( Clock:
                           in std_logic;
                Reset:
                           in std_logic;
40
                Load:
                           in std_logic;
                Data_In :
                           in std_logic;
                Data_Out: out std_logic);
     end component:
45
     component ALU_32
                            in std_logic_vector (1 downto 0);
        Port ( Mode:
                            in std_logic_vector (31 downto 0);
                Data In1:
                Data_In2:
                            in std_logic_vector (31 downto 0);
                Data_Out:
                            out std_logic_vector (31 downto 0));
     end component;
  begin
55 Register_A: Reg_32bit
     Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_A, Data_In=>In1, Data_Out=>A);
  Register_B: Reg_32bit
     Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_B,
                 Data_In=>In2, Data_Out=>B);
  Latch_Stop: Latch_impl
```

```
Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_Stop, Data_In=>In_Stop, Data_Out=>Stop);

ALU: ALU_32
    Port Map ( Mode=>ALU_M, Data_In1=>A, Data_In2=>B, Data_Out=>ALU_Out);

Register_O: Reg_32bit
    Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_O, Data_In=>ALU_Out, Data_Out=>O);

To O_Port <= O;
Out_Stop <= Stop;
StopMsg <= Stop;
end DP_schematic;
```

### C.2.11 FSM Controlling Datapath B

```
VHDL/doubleHS_SMs/FSM_dHS_clock/FSM_and_D_dHS_clock_ex_B.vhd
  -- File:
                   FSM\_and\_D\_dHS\_clock\_ex\_B , vhd
                   bounding of FSM B and D at a path B
  -- implements:
                   FSM and separate Datapath (FSM+D)
  -- of Model:
  -- for Example: FSM, DP(A) --> FSM, DP(B)
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_and_D_B_dHS_clock_ex is
     port (Reset, Start:
                                in std_logic;
            Clock_B:
                               in std_logic;
            A_Ready_B, TB_Ackn_B: in std_logic;
            B_Ackn_A, B_Ready_TB: out std_logic;
1.5
                              in std_logic_vector (31 downto 0);
            Data\_AB\_Stop:
                                    in std_logic;
            Out1_Port:
                               out std_logic_vector (31 downto 0);
            Out1_Stop_Port:
                               out std_logic);
20 end FSM_and_D_B_dHS_clock_ex;
  Architecture FSM_and_D_B_dHS_clock_ex_behavioral of FSM_and_D_B_dHS_clock_ex is
  -- outputs
25 signal Out1: std_logic_vector (31 downto 0);
  signal Out1_Stop: std_logic;
  signal DP_Reset:
                       std_logic;
  signal ld_A , ld_B , ld_Stop : std_logic ;
30 signal StopMsg: std_logic;
  signal ALU_M: std_logic_vector (1 downto 0);
  signal ld_O: std_logic;
  signal CMP: std_logic_vector(1 downto 0);
  signal MUX_sel: std_logic;
  Component DP
     port (Clock:
                            in std_logic;
            DP_Reset:
                            in std_logic;
            ld_A , ld_B , ld_Stop: in std_logic;
                            out std_logic;
40
            StopMsg:
           ALU_M:
                            in std_logic_vector (1 downto 0);
            ld_O :
                            in std_logic;
            In 1, In 2:
                            in std_logic_vector(31 downto 0);
           In_Stop:
                            in std_logic;
            O_Port:
                            out std_logic_vector (31 downto 0);
            Out_Stop:
                            out std_logic);
  end Component;
  Component FSM_B_dHS_clock_ex
     port (Reset, Start:
                                   in std_logic;
            Clock_B:
                                   in std_logic;
            A_Ready_B, TB_Ackn_B: in std_logic;
            B_Ackn_A, B_Ready_TB: out std_logic;
            DP_Reset:
                                   out std_logic;
55
            ld_A , ld_B , ld_Stop :
                                  out std_logic;
           StopMsg:
                                   in std_logic;
           ALU_M:
                                   out std_logic_vector (1 downto 0);
           ld_O:
                                   out std_logic);
  end Component;
  begin -- Architecture
  DP_B_I: DP
       Port Map (Clock=>Clock_B, DP_Reset=>DP_Reset,
```

```
\label{eq:ld_A} $$ $ \operatorname{Id_A} = \operatorname{Id_A} , $ \operatorname{Id_B} = \operatorname{Id_B} , $ \operatorname{Id_Stop} = \operatorname{Id_Stop} , $$
                       StopMsg => StopMsg \,, \quad ALU\_M => ALU\_M \,, \quad ld\_O => ld\_O \,,
65
                       In1 = > Data\_AB, In2 = > Data\_AB,
                       In_Stop => Data_AB_Stop,
                       O_Port=>Out1, Out_Stop=>Out1_Stop);
70 FSM_B_dHS_clock_I: FSM_B_dHS_clock_ex
         Port Map (Reset=>Reset, Start=>Start,
                       Clock_B=>Clock_B,
                       A_Ready_B=>A_Ready_B, TB_Ackn_B=>TB_Ackn_B,
                       B_Ackn_A=>B_Ackn_A, B_Ready_TB=>B_Ready_TB,
                       DP_Reset=>DP_Reset,
75
                       ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                       StopMsg=>StopMsg, ALU_M=>ALU_M, Id_O=>Id_O);
  -- Entity Outputs
80 Out1_Port <= Out1;</pre>
  Out1_Stop_Port <= Out1_Stop;
  end FSM_and_D_B_dHS_clock_ex_behavioral;
```

#### C.2.12 FSM of B

```
VHDL/doubleHS_SMs/FSM_dHS_clock/FSM_dHS_clock_ex_B.vhd
                    FSM\_dHS\_clock\_ex\_B . vhd
  -- File:
  -- implements:
                    FSM B
  -- of Model:
                    FSM and separate Datapath
  -- for Example: FSM, DP(A) --> FSM, DP(B)
                    using double handshake protocol
5 - -
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_B_dHS_clock_ex is
                                  in std_logic;
     port (Reset, Start:
                                 in std_logic;
            Clock_B:
            A_Ready_B, TB_Ackn_B: in std_logic;
            B_Ackn_A, B_Ready_TB: out std_logic;
1.5
            DP_Reset:
                                    out std_logic;
            ld_A , ld_B , ld_Stop: out std_logic;
            StopMsg:
                                    in std_logic;
            ALU_M:
                                    out std_logic_vector (1 downto 0);
            ld_O:
                                    out std_logic);
  end FSM_B_dHS_clock_ex;
  Architecture FSM_B_dHS_clock_ex_behavioral of FSM_B_dHS_clock_ex is
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END);
  signal next_STATE, STATE: State_Set;
  begin -- Architecture
  transition: Process
  begin
    wait until (Clock_B'event and Clock_B = '1');
    STATE <= next_STATE;
35 end process;
  behavior1: process(STATE, Start, A_Ready_B, TB_Ackn_B, StopMsg)
  begin
      if (Reset = '1') then
            B_Ready_TB \le 0;
                       <= '0';
            ld_A
                       <= '0':
            ld_B
                       <= '0';
            ld_Stop
45
                       <= "--";
            ALU_M
            ld_O
                       <= '0';
         next_STATE <= S_BEGIN;
       else
         case STATE is
           \begin{array}{lll} \mbox{when $S$\_BEGIN =>$ DP$\_Reset} & <= '1'; \\ & B\_Ackn\_A & <= '0'; \\ \end{array}
                             B_Ready_TB \le 0
                                       <= '0';
55
                             ld A
                             ld_B
                                        <= '0';
                             ld_Stop
                                       <= 0,0,0
                             ALU_M
                                       <= "--";
                             ld_O
                                        <= '0':
60
                             if Start = '1'
                               then next_STATE \leq S_1;
                               else next_STATE <= S_BEGIN;</pre>
```

```
end if;
65
                                 \begin{array}{ll} {\rm D\,P} \text{\_Reset} & <=~,\,0~,\,;\\ {\rm B} \text{\_A\,ckn} \text{\_A} & <=~,\,0~,\,; \end{array}
              when S_1 =>
                                  B_Ready_TB <= '0';
                                              <= '1';
                                  ld_A
                                 ld_B
                                              <= '1';
70
                                              <= '1'; <= "--";
                                  ld_Stop
                                 ALU\_M
                                              <= '0';
                                 ld_O
75
                                if (A_Ready_B='1')
                                   then next_STATE \le S_2;
                                   else next_STATE <= S_1;
                                end if;
                                DP_Reset <= '0';
B_Ackn_A <= '1';</pre>
              when S_2 =>
80
                                B_Ready_TB <= '0';
                                ld_A
                                            <= '0';
                                ld_B
                                             <= '0';
                                ld_Stop
                                             <= 0,0,0
85
                                             <= "10";
                                ALU_M
                                ld_O
                                             <= '0';
                                if (A_Ready_B='0')
                                  then next STATE \le S_3;
90
                                   else next_STATE <= S_2;
                                end if;
                                DP_Reset <= '0';
B_Ackn_A <= '0';</pre>
              when S_3 =>
95
                                B_Ready_TB \le '0';
                                ld_A
                                            <= '0';
                                             <= '0';
                                ld_B
                                ld_Stop
                                            <= '0';
100
                                ALU_M
                                             <= "10";
                                             <= '1';
                                ld_O
                                next\_STATE \le S\_4;
                                DP_Reset <= '0';
B_Ackn_A <= '0';</pre>
105
              when S_4 =>
                                B_Ready_TB <= '0';
                                             <= '0';
                                ld_A
                                ld_B
                                             <= '0';
                                             <= '0';
110
                                ld_Stop
                                             <= "--";
                                ALU_M
                                             <= '0';
                                ld_O
                                next\_STATE \le S\_5;
115
              when S_5 =>
                                DP_Reset <= '0';
                                B_Ackn_A <= '0';
                                B_Ready_TB <= '1';
                                ld_A
                                             <= '0';
                                             <= '0';
                                ld_B
120
                                ld_Stop
                                             <= '0';
                                             <= "--";
                                ALU_M
                                             <= '0';
                                ld_O
                                if (TB_Ackn_B='1')
125
                                  then next_STATE<=S_6;
                                   else next\_STATE \le 5;
                                end if;
```

```
DP_Reset <= '0';
B_Ackn_A <= '0';
130
              when S_6 =>
                                B_Ready_TB<= '0';
ld_A <= '0';
                                              <= '0';
                                 ld_B
                                             <= '0';
<= "--";
                                 \operatorname{ld} \_ \operatorname{Stop}
135
                                 ALU_M
                                              <= '0';
                                 ld_O
                                 if (TB_Ackn_B='0')
                                   then if (StopMsg='1')
then next_STATE<=S_END;
140
                                             else next\_STATE \le S_1;
                                          end if;
                                   else next_STATE<=S_6;</pre>
                                end if;
145
              when S_END=> -- nothing (Procedure quits)
           end case;
          end if;
150
   end process;
   end FSM_B_dHS_clock_ex_behavioral;
```

# C.2.13 Datapath of B

The datapath for the receiver B is identical to the datapath of the sender  $A. \rightarrow See$  datapath for sender A.

# C.3 Transfer via Queue with Double Handshake

- A (FSMD without time)  $\rightarrow$  Queue (FSMD without time)  $\rightarrow$  B (FSMD without time)
  - FSMD A
  - FSMD Queue
  - FSMD B
- A (FSMD using clock)  $\rightarrow$  Queue (FSMD using clock)  $\rightarrow$  B (FSMD using clock)
  - FSMD A
  - FSMD Queue
  - FSMD B
- A (FSM Controlling Datapath)  $\rightarrow$  Queue (FSM Controlling Datapath)
  - → B (FSM Controlling Datapath)
    - FSM Controlling Datapath A
      - \* FSM of A
      - \* Datapath of A (identical B)
    - FSM Controlling Datapath Queue
      - \* FSM of Queue
      - \* Datapath of Queue (Buffered)
        - · Datapath of Unbuffered Queue
    - FSM Controlling Datapath B
      - \* FSM of B
      - \* Datapath of B identical to A (see A)

### C.3.1 A (FSMD without time) $\rightarrow$ Queue (FSMD without time) $\rightarrow$ B (FSMD without time)

```
VHDL/doubleHS_queued_SMs/FSMD_dHS_q_no_time/FSMD_dHS_q_no_time_ex_top.vhd
                    FSMD\_dHS\_q\_no\_time\_ex\_top.vhd
  -- bounding the three parts of the example
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE.std_logic_arith.all;
  use IEEE. std_logic_signed.ALL;
  entity FSMD_dHS_q_no_time_ex_top is
     port (Reset, Start: in bit;
            TB_Ready_A, TB_Ackn_B: in bit;
A_Ackn_TB, B_Ready_TB: out bit;
            In1, In2:
                                  in integer;
            In_Stop:
                                  in bit;
            Out1_Port:
                                 out integer;
            Out1_Stop_Port:
                                 out bit);
  end FSMD_dHS_q_no_time_ex_top;
  architecture FSMD_dHS_q_no_time_ex_top_arch of FSMD_dHS_q_no_time_ex_top is
  {\tt Component FSMD\_A\_dHS\_q\_no\_time\_ex}
     port (Reset, Start:
                                in bit;
            TB_Ready_A, QC_Ackn_A: in bit;
            A_Ackn_TB, A_Ready_QC: out bit;
            In1, In2:
                                  in integer;
            In\_Stop:
                                  in bit;
            Data_AQ_Port:
                                   out integer;
            Data_AQ_Stop_Port:
                                    out bit);
  end Component;
  {\tt Component FSMD\_Queue\_dHS\_no\_time\_ex}
     generic (W, D: integer);
     port (Reset, Start:
                                  in bit;
            A_Ready_QC, B_Rd_Req_QC: in bit;
            QC_Ackn_A, QC_Ready_B: out bit;
            Q_Empty_Port, Q_Full_Port: out bit;
            Data_AQ:
                               in bit_vector(W-1 downto 0);
                                      out bit_vector(W-1 downto 0));
            Buffer\_QB\_Port:
  end Component;
  Component FSMD_B_dHS_q_no_time_ex
     port (Reset, Start:
                                 in bit;
            QC_Ready_B, TB_Ackn_B: in bit;
45
            B_Rd_Req_QC, B_Ready_TB: out bit;
            Buffer_QB:
                                   in integer;
            Buffer_QB_Stop:
                                   in bit;
            Out1_Port:
                                 out integer;
            Out1\_Stop\_Port:
                                 out bit);
  end Component;
  \begin{tabular}{ll} \bf signal & A\_Ready\_QC \, , & B\_Rd\_Req\_QC \, ; & bit \, ; \\ \end{tabular}
  signal QC_Ackn_A, QC_Ready_B: bit;
55 signal Data_AQ:
                                 integer;
  signal Data_AQ_bit_v:
                                 bit_vector (31 downto 0);
  signal Data_AQ_Stop:
                                 bit;
  signal Buffer_QB:
                                   integer;
  signal Buffer_QB_bit_v:
                                   bit_vector (31 downto 0);
60 signal Buffer_QB_Stop:
                                   bit;
  begin
```

```
FSMD_A_dHS_q_no_time_I: FSMD_A_dHS_q_no_time_ex
       port map ( Reset=>Reset, Start=>Start,
65
            TB_Ready_A => TB_Ready_A, QC_Ackn_A => QC_Ackn_A,
            A_Ackn_TB=>A_Ackn_TB, A_Ready_QC=>A_Ready_QC,
            In 1 = > In 1, In 2 = > In 2,
           In_Stop => In_Stop,
            Data_AQ_Port=>Data_AQ,
70
            Data_AQ_Stop_Port=>Data_AQ_Stop );
  FSMD_Queue_dHS_no_time_I: FSMD_Queue_dHS_no_time_ex
       generic map ( W=>33, D=>4)
       port map ( Reset=>Reset, Start=>Start,
75
            A_Ready_QC=>A_Ready_QC, B_Rd_Req_QC=>B_Rd_Req_QC,
            QC_Ackn_A=>QC_Ackn_A, QC_Ready_B=>QC_Ready_B,
            Q_Empty_Port=>open, Q_Full_Port=>open,
            Data_AQ(31 downto 0)=>Data_AQ_bit_v,
              Data_AQ(32) = > Data_AQ_Stop,
80
            Buffer_QB_Port (31 downto 0)=>Buffer_QB_bit_v,
              Buffer_QB_Port (32)=>Buffer_QB_Stop);
  FSMD\_B\_dHS\_q\_no\_time\_I: \ FSMD\_B\_dHS\_q\_no\_time\_ex
       port map ( Reset=>Reset, Start=>Start,
85
           QC_Ready_B = QC_Ready_B, TB_Ackn_B = TB_Ackn_B,
            B_Rd_Req_QC=>B_Rd_Req_QC, B_Ready_TB=>B_Ready_TB,
            Buffer_QB=>Buffer_QB,
            Buffer_QB_Stop=>Buffer_QB_Stop,
            Out1_Port=>Out1_Port,
90
            Out1_Stop_Port=>Out1_Stop_Port );
  Data_AQ_bit_v <= To_BitVector(conv_std_logic_vector(Data_AQ, 32));
  Buffer_QB
                  <= conv_integer(To_StdLogicVector(</pre>
                                                             Buffer_QB_bit_v ));
  end FSMD_dHS_q_no_time_ex_top_arch;
```

#### C.3.2 FSMD A

```
VHDL/doubleHS_queued_SMs/FSMD_dHS_q_no_time/FSMD_dHS_q_no_time_ex_A.vhd
  -- File:
                    FSMD\_dHS\_q\_no\_time\_ex\_A . vhd
  -- implements:
                    FSMD A
  -- of Model:
                    FSMD (without using time)
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                    using double handshake protocol
  {\tt Entity} \ {\tt FSMD\_A\_dHS\_q\_no\_time\_ex} \ {\tt is}
      port (Reset, Start:
                                 in bit;
            TB_Ready_A, QC_Ackn_A: in bit;
            A_Ackn_TB, A_Ready_QC: out bit;
1.0
            In1, In2:
                                  in integer;
            In_Stop:
                                   in bit;
            Data_AQ_Port:
                                    out integer;
            Data_AQ_Stop_Port:
                                           out bit);
15 end FSMD_A_dHS_q_no_time_ex;
  Architecture FSMD_A_dHS_q_no_time_ex_behavioral of FSMD_A_dHS_q_no_time_ex is
20 -- behavior outputs
  signal Data_AQ: integer;
  signal Data_AQ_Stop: bit;
  Procedure behavior1(signal TB_Ready_A, QC_Ackn_A: in bit;
                         signal A_Ackn_TB, A_Ready_QC: out bit;
                         signal In1, In2: in integer;
                         signal In_Stop: in bit;
                         signal Data_AQ: inout integer;
                         signal Data_AQ_Stop: out bit) is
30 type State_Set is (S_1, S_2, S_3, S_4, S_5, S_6, S_END); -- S_BEGIN is represented by "Procedure not running".
  variable next_STATE: State_Set;
  variable A, B: integer;
  variable Stop: bit;
  begin
  next\_STATE := S\_1 :
  Data_AQ \le 0;
  Data_AQ_Stop <= '0';
40 A_Ackn_TB<='0';
  A_Ready_QC \le 0';
  while (next_STATE/=S_END) and (Reset/='1') loop
    case next_STATE is
      when S_1 =>
                       if (TB_Ready_A='1')
                         then next_STATE:= S_2;
                         else next_STATE:= S_1;
                      end if;
       when S_2 =>
                      A := In1;
                      B := \operatorname{In} 2;
                      Stop := In_Stop;
                      A_Ackn_TB \le 1';
55
                       if (TB_Ready_A='0')
                         then next_STATE:= S_3;
                         else next_STATE := S_2;
                      end if;
60
       when S_3 =>
                      Data\_AQ \le A - B;
                      Data_AQ_Stop <= Stop;
```

```
A_Ackn_TB \le 0';
65
                       next\_STATE := S\_4;
       when S_4 =>
                       next_STATE := S_5;
70
       when S_5 =>
                       A_Ready_QC \le 1';
                       if (QC_Ackn_A='1')
                         then next STATE := S_6;
75
                         else next\_STATE := S\_5;
                       end if;
       when S_6 =>
                       A_Ready_QC \le 0;
80
                       if (QC_Ackn_A='0')
                         then if (Stop = '1')
                                 then next_STATE:=S_END;
                                 else next\_STATE := S\_1;
                               end if:
85
                         else next\_STATE := S\_6;
                       end if;
       when S_END=> -- nothing (Procedure quits)
90
     end case;
     wait for 1 ns; -- Synopsys needs wait > 0 ns
   end loop;
95 end behavior1;
   begin -- Architecture
100 P1: Process
   begin
     wait until (Reset = '1') or (Start = '1');
     if (Reset = ', 1') then
       Data_AQ <= 0;
Data_AQ_Stop <= '0';
       A\_Ackn\_TB \le 0;
       A_Ready_QC \le 0';
     elsif (Start = '1') then
       behavior1 (TB\_Ready\_A \,, \ QC\_Ackn\_A \,, \ A\_Ackn\_TB \,, \ A\_Ready\_QC \,,
                   In1, In2, In_Stop, Data_AQ, Data_AQ_Stop);
     end if;
   end process;
   -- Entity Outputs
115 Data_AQ_Port <= Data_AQ;
   Data_AQ_Stop_Port <= Data_AQ_Stop;
   end FSMD_A_dHS_q_no_time_ex_behavioral;
```

#### C.3.3 FSMD Queue

```
VHDL/doubleHS_queued_SMs/FSMD_dHS_q_no_time/FSMD_dHS_q_no_time_ex_Queue.vhd
  -- File:
                     FSMD\_dHS\_g\_no\_time\_ex\_Queue. vhd
  --implements:
                     Queue
  -- of Model:
                    FSMD (without using time)
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                     using double handshake protocol
  Entity FSMD_Queue_dHS_no_time_ex is
      generic (W, D: integer);
      port (Reset, Start:
             A_Ready_QC, B_Rd_Req_QC: in bit;
             QC_Ackn_A, QC_Ready_B: out bit;
             Q_Empty_Port, Q_Full_Port: out bit;
                                  in bit_vector(W-1 downto 0);
             Data_AQ:
             Buffer_QB_Port:
                                        out bit_vector(W-1 downto 0));
15 end FSMD_Queue_dHS_no_time_ex;
  Architecture FSMD_Queue_dHS_no_time_ex_behavioral of FSMD_Queue_dHS_no_time_ex
  is
20
  -- queue outputs
  signal Q_Empty, Q_Full: bit;
  signal Buffer_QB: bit_vector (W-1 downto 0);
25 Procedure Queue(signal Reset, Start: in bit;
                    signal A_Ready_QC, B_Rd_Req_QC: in bit;
signal QC_Ackn_A, QC_Ready_B: out bit;
signal Q_Empty, Q_Full: inout bit;
signal Data_AQ: in bit_vector(W-1 downto 0);
                     signal Buffer_QB: out bit_vector(W-1 downto 0)) is
  type State_Set is (S_0, W_1, W_2, R_1, R_2, R_3, S_END); -- S_BEGIN is represented by "Procedure not running".
  variable next_STATE: State_Set;
  type fifo_array is ARRAY(D-1 downto 0) of bit_vector(W-1 downto 0);
35 variable Q: fifo_array;
  variable C: integer range -1 to D-1;
  begin
  next\_STATE := S\_0;
40 QC_Ackn_A <= '0';
  QC_Ready_B <= '0';
  C:=-1; --empty;
  Q_Empty <= '1';
  Q_Full <= '0';
  while (next_STATE/=S_END) and (Reset/='1') loop
    case next_STATE is
       when S_0 => QC_A ckn_A <= '0';
                     QC_Ready_B \le 0;
                     if (Q_Empty='0' and B_Rd_Req_QC='1')
                       then next_STATE:= R_1;
                     elsif (Q_Full='0' and A_Ready_QC='1')
                       then next_STATE:= W_1;
                     end if;
55
       when W_1 => C := C+1;
                     next\_STATE := W_2;
60
       when W_2 = 5 for i in D-1 downto 1 loop
                       Q(i) := Q(i-1);
                     end loop;
```

```
Q(0) := Data_AQ;
                     if ( C=−1 )
65
                       then Q_Empty <= '1';
                      else
                       \mathrm{Q\_Empty} \ <= \ \ ^{,} \ 0 \ ^{,};
                     end if;
                     if (C=D-1)
70
                       then Q_Full <= '1';
                     else
                       Q_Full <= '0';
                     end if;
75
                     QC_Ackn_A <= '1';
                     if (A_Ready_QC='0')
                       then next_STATE:= S_0;
                     else
                       next\_STATE := W_2;
80
                     end if;
       when R_1 => -- needed for conversion into "FSM plus Datapath"
                     next\_STATE := R_2;
85
       when R_2 => Buffer_QB <= Q(C);
                     C := C-1;
                     next_STATE := R_3;
90
       \label{eq:when R_3} \textbf{when } R\_3 => QC\_Ready\_B <= \mbox{'1'};
                     if ( C=−1 )
                       then Q_Empty <= '1';
                     _{
m else}
95
                       Q_Empty <= '0';
                     end if;
                     if (C=D-1)
                       then Q_Full <= '1';
100
                       Q_Full <= '0';
                     end if;
                     if (B_Rd_Req_QC = '0')
105
                       then next\_STATE := S\_0;
                     else
                       next\_STATE := R\_3;
                     end if;
       110
     end case;
     wait for 1 ns; -- Synopsys needs wait > 0 ns
115 end loop;
   end Queue;
120 begin -- Architecture
   Q1: Process
   begin
     wait until (Reset = '1') or (Start = '1');
     if (Reset = ',1') then
       QC_Ackn_A <= '0';
        QC_Ready_B <= '0';
       \operatorname{Q-Empty}^{2} <= \ \ ^{2}0^{2} \ ;
        Q_Full <= '0';
```

```
elsif (Start = '1') then
Queue(Reset, Start,
A_Ready_QC, B_Rd_Req_QC,
QC_Ackn_A, QC_Ready_B,
Q_Empty, Q_Full,

135 Data_AQ, Buffer_QB);
end if;
end process;

-- Entity Outputs

140 Q_Empty_Port <= Q_Empty;
Q_Full_Port <= Q_Full;
Buffer_QB_Port <= Buffer_QB;
end FSMD_Queue_dHS_no_time_ex_behavioral;
```

#### C.3.4 FSMD B

```
VHDL/doubleHS_queued_SMs/FSMD_dHS_q_no_time/FSMD_dHS_q_no_time_ex_B.vhd
  -- File:
                    FSMD\_dHS\_q\_no\_time\_ex\_B. vh d
  -- implements:
                    FSMD B
  -- of Model:
                    FSMD (without using time)
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                    using double handshake protocol
  {\tt Entity} \ {\tt FSMD\_B\_dHS\_q\_no\_time\_ex} \ {\tt is}
      port (Reset, Start:
                                 in bit;
            QC_Ready_B, TB_Ackn_B: in bit;
B_Rd_Req_QC, B_Ready_TB: out bit;
1.0
             Buffer_QB:
                                     in integer;
             Buffer_QB_Stop:
                                     in bit;
             Out1_Port:
                                  out integer;
             Out1_Stop_Port:
                                  out bit);
15 end FSMD_B_dHS_q_no_time_ex;
  Architecture FSMD_B_dHS_q_no_time_ex_behavioral of FSMD_B_dHS_q_no_time_ex is
20 -- behavior outputs
  signal Out1: integer;
  signal Out1_Stop: bit;
  Procedure behavior1(signal QC_Ready_B, TB_Ackn_B: in bit;
                         signal B_Rd_Req_QC, B_Ready_TB: out bit;
                         signal Buffer_QB: in integer;
                         signal Buffer_QB_Stop: in bit;
                         signal Out1: inout integer;
                         signal Out1_Stop: out bit) is
30 type State_Set is (S_1, S_2, S_3, S_4, S_5, S_6, S_END); -- S_BEGIN is represented by "Procedure not running".
  variable next_STATE: State_Set;
  variable C: integer;
  variable Stop: bit;
  begin
  next\_STATE := S_1 :
  Out1 \leq 0;
  Out1\_Stop <= \ '0';
40 B_Rd_Req_QC <= '0';
  B_Ready_TB \le 0;
  while (next_STATE/=S_END) and (Reset/='1') loop
    case next_STATE is
      when S_1 =>
                       B_Rd_Req_QC <= '1';
                       if (QC_Ready_B='1')
                         then next_STATE := S_2;
                         else next_STATE:= S_1;
                       end if;
       when S_2 =>
                       C := Buffer_QB;
                       Stop := Buffer_QB_Stop;
                       B_Rd_Req_QC <= '0';
55
                       if (QC_Ready_B='0')
                         then next\_STATE := S\_3;
                         else next_STATE:= S_2;
                       end if:
60
       when S_3 = >
                       Out1 <= 2 * C:
                       Out1\_Stop \le Stop;
```

```
next\_STATE := S\_4;
65
        when S_4 =>
                        next\_STATE := S\_5;
       when S_5 =>
                        B_Ready_TB \le '1';
70
                        if (TB_Ackn_B='1')
                          then next_STATE:=S_-6;
                          else next_STATE:=S_5;
                        end if;
75
                        B\_Ready\_TB <= '0';
       when S_6 =>
                        if (TB_Ackn_B='0')
                          then if (Stop='1')
80
                                  then next_STATE:=S_END;
                                  else next\_STATE := S\_1;
                                end if;
                          else next_STATE:=S_6;
                        end if;
85
       when S_END=> -- nothing (Procedure quits)
     end case;
     wait for 1 ns; -- Synopsys needs wait > 0 ns
   end loop;
   end behavior1;
   begin -- Architecture
   P1: Process
   begin
     wait until (Reset = '1') or (Start = '1');
     if (Reset = ', 1') then
        Out1 <= 0;
        Out1\_Stop <= '0';
        B_Rd_Req_QC <= '0';
     B_Ready_TB<='0';
elsif (Start = '1') then
        behavior1(QC_Ready_B, TB_Ackn_B, B_Rd_Req_QC, B_Ready_TB, Buffer_QB, Buffer_QB_Stop, Out1, Out1_Stop);
     end if;
110 end process;
   -- Entity Outputs
   Out1_Port <= Out1;
   Out1_Stop_Port <= Out1_Stop;
115
   end FSMD_B_dHS_q_no_time_ex_behavioral;
```

# C.3.5 A (FSMD using clock) $\rightarrow$ Queue (FSMD using clock) $\rightarrow$ B (FSMD using clock) VHDL/doubleHS\_queued\_SMs/FSMD\_dHS\_q\_clock/FSMD\_dHS\_q\_clock\_ex\_top.vhd -- File: FSMD\_dHS\_q\_clock\_ex\_top.vhd

```
-- bounding the three parts of the example
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  use STD. textio. all;
  entity FSMD_dHS_q_clock_ex_top is
     port (Reset, Start:
                                in std_logic;
           Clock_A, Clock_B, Clock_Q: in std_logic;
           TB_Ready_A, TB_Ackn_B: in std_logic;
           A_Ackn_TB, B_Ready_TB: out std_logic;
1.5
                                in std_logic_vector(31 downto 0);
           In 1, In 2:
           In_Stop:
                                in std_logic;
           Out1_Port:
                               out std_logic_vector (31 downto 0);
           Out1_Stop_Port:
                               out std_logic);
20 end FSMD_dHS_q_clock_ex_top;
  architecture FSMD_dHS_q_clock_ex_top_arch of FSMD_dHS_q_clock_ex_top is
  {\tt Component FSMD\_A\_dHS\_q\_clock\_ex}
     port (Reset, Start:
                                in std_logic;
           Clock_A:
                               in std_logic;
           TB_Ready_A, QC_Ackn_A: in std_logic;
           A_Ackn_TB, A_Ready_QC: out std_logic;
           In1, In2:
                                in std_logic_vector(31 downto 0);
           In_Stop:
                                in std_logic;
30
           Data_AQ_Port:
                                 out std_logic_vector(31 downto 0);
           Data_AQ_Stop_Port:
                                  out std_logic);
  end Component;
35 Component FSMD_Queue_dHS_clock_ex
     generic (W, D: integer);
     port (Reset, Start: in std_logic;
           Clock_Q:
                               in std_logic;
           A_Ready_QC, B_Rd_Req_QC: in std_logic;
           QC_Ackn_A, QC_Ready_B: out std_logic;
40
           Q_Empty_Port, Q_Full_Port: out std_logic;
           Data_AQ:
                           in std_logic_vector(W-1 downto 0);
           Buffer_QB_Port: out std_logic_vector(W-1 downto 0));
  end Component:
  Component \ FSMD\_B\_dHS\_q\_clock\_ex
     port (Reset, Start:
                                in std_logic;
           Clock B:
                               in std_logic;
           QC_Ready_B, TB_Ackn_B: in std_logic;
           B_Rd_Req_QC , B_Ready_TB: out std_logic ;
           Buffer_QB:
                                in std_logic_vector(31 downto 0);
           Buffer_QB_Stop:
                                     in std_logic;
           Out1_Port:
                               out std_logic_vector (31 downto 0);
           Out1_Stop_Port:
                               out std_logic);
55 end Component;
  signal A_Ready_QC, B_Rd_Req_QC: std_logic;
  signal QC_Ackn_A, QC_Ready_B: std_logic;
  signal Data_AQ: std_logic_vector(31 downto 0);
60 signal Data_AQ_Stop: std_logic;
  signal Buffer_QB: std_logic_vector(31 downto 0);
  signal Buffer_QB_Stop: std_logic;
```

```
begin
65
  FSMD_A_dHS_q\_clock_I: FSMD_A_dHS_q\_clock\_ex
        port map ( Reset=>Reset, Start=>Start, Clock_A=>Clock_A,
             TB_Ready_A = TB_Ready_A, QC_Ackn_A = QC_Ackn_A,
             A_Ackn_TB=>A_Ackn_TB, A_Ready_QC=>A_Ready_QC,
             In 1 = > In 1, In 2 = > In 2,
70
             In_Stop=>In_Stop,
             Data_AQ_Port=>Data_AQ,
             Data_AQ_Stop_Port=>Data_AQ_Stop );
75 FSMD_Queue_dHS_clock_I: FSMD_Queue_dHS_clock_ex generic map (W=>33, D=>4)
        port map ( Reset=>Reset, Start=>Start, Clock_Q=>Clock_Q,
             A\_Ready\_QC => A\_Ready\_QC \;, \; B\_Rd\_Req\_QC => B\_Rd\_Req\_QC \;,
             QC_Ackn_A=>QC_Ackn_A, QC_Ready_B=>QC_Ready_B,
            Q_Empty_Port=>open, Q_Full_Port=>open, Data_AQ(31 downto 0)=>Data_AQ,
80
               Data_AQ(32) = > Data_AQ_Stop,
             Buffer_QB_Port (31 downto 0)=>Buffer_QB,
               Buffer_QB_Port (32)=>Buffer_QB_Stop);
  FSMD\_B\_dHS\_q\_clock\_I: FSMD\_B\_dHS\_q\_clock\_ex
        port map ( Reset=>Reset, Start=>Start, Clock_B=>Clock_B,
             QC_Ready_B=>QC_Ready_B, TB_Ackn_B=>TB_Ackn_B,
             B_Rd_Req_QC => B_Rd_Req_QC \; , \; \; B_Ready_TB => B_Ready_TB \; ,
             Buffer_QB=>Buffer_QB,
90
             Buffer_QB_Stop=>Buffer_QB_Stop,
             Out1_Port=>Out1_Port,
             Out1_Stop_Port=>Out1_Stop_Port );
95 end FSMD_dHS_q_clock_ex_top_arch;
```

## C.3.6 FSMD A

```
VHDL/doubleHS_queued_SMs/FSMD_dHS_q_clock/FSMD_dHS_q_clock_ex_A.vhd
  -- File:
                     FSMD\_dHS\_q\_clock\_ex\_A. vhd
  -- implements:
                    FSMD A
  -- of Model:
                    FSMD (using time)
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                    using double handshake protocol
 5 - -
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
10 use IEEE. std_logic_signed . all;
  Entity FSMD_A_dHS_q_clock_ex is
      port (Reset, Start:
                                   in std_logic;
            Clock_A:
                                  in std_logic;
            TB\_Ready\_A\;,\;\;QC\_Ackn\_A\;;\;\;\textbf{in}\;\;\;st\bar{d}\;\_logic\;;
1.5
            A_Ackn_TB, A_Ready_QC: out std_logic;
            In 1, In 2:
                                   in std_logic_vector(31 downto 0);
            In_Stop:
                                   in std_logic;
                                     out std_logic_vector(31 downto 0);
            Data_AQ_Port:
            Data_AQ_Stop_Port:
                                           out std_logic);
  end FSMD_A_dHS_q_clock_ex;
  Architecture FSMD_A_dHS_q_clock_ex_behavioral of FSMD_A_dHS_q_clock_ex is
25 -- outputs
  signal Data_AQ: std_logic_vector(31 downto 0);
  signal Data_AQ_Stop: std_logic;
  -- internal signals
30 type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END);
  signal next_STATE: State_Set;
  signal A, B: std_logic_vector(31 downto 0);
  signal Stop: std_logic;
35 begin -- Architecture
  behavior1: process (Clock_A)
  begin
    if (Clock_A' event and Clock_A='1') then
       if (Reset = '1') then
         DataAQ \le (others = > '0');
         Data_AQ_Stop <= '0';
         A_Ackn_TB \le 0';
         A_Readv_QC <= '0 ':
         next_STATE <= S_BEGIN;
45
       else
         case next_STATE is
           when S_BEGIN => Data_AQ <= (others=>'0');
                             Data\_AQ\_Stop <= '0';
                             A\_Ackn\_TB \le 0';
                             A_Ready_QC \le 0';
                              if Start = '1'
                                \label{eq:then_next_STATE} \textbf{then} \ \ \text{next\_STATE} \ <= \ \text{S\_1} \ ;
55
                                else next_STATE <= S_BEGIN;</pre>
                             end if;
           when S_1 =>
60
                            if (TB_Ready_A='1')
                               then next_STATE<= S_2;
                               else next_STATE<= S_1;
```

```
end if;
65
             when S_2 =>
                              A\,<=\,\operatorname{In}1\,;
                              B <= \, \operatorname{In} 2 \; ;
                              S top \le In\_S top;
                              A_A ckn_T B \le 1';
70
                              if (TB_Ready_A='0')
                                then next_STATE<=S_3;
                                else next_STATE<= S_2;
                              end if;
75
             when S_3 =>
                              Data\_AQ \le A - B;
                              Data_AQ_Stop <= Stop;
                              A_A ckn_T B \le 0;
                              next\_STATE \le S\_4;
80
             when S_4 =>
                              next\_STATE \le S\_5;
85
             when S_5 =>
                              A_Ready_QC \le 1';
                              if (QC_Ackn_A='1')
                                then next_STATE\leq=S_6;
                                else next_STATE \le 5;
90
                              end if;
             when S_6 =>
                              A_Ready_QC <= '0';
                              if (QC_Ackn_A='0')
95
                                then if (Stop='1')
                                        then next_STATE<=S_END;
                                        else next\_STATE \le S_1;
                                      end if;
100
                                else next_STATE \le 6;
                              end if;
             when S_END=> -- nothing (Procedure quits)
105
           end case;
         end if;
      end if;
   end process;
   -- Entity Outputs
   Data\_AQ\_Port <= Data\_AQ;
   Data_AQ_Stop_Port <= Data_AQ_Stop;
115
   end FSMD_A_dHS_q_clock_ex_behavioral;
```

# C.3.7 FSMD Queue

```
VHDL/doubleHS_queued_SMs/FSMD_dHS_q_clock/FSMD_dHS_q_clock_ex_Queue.vhd
  -- File:
                       FSMD\_dHS\_q\_clock\_ex\_Queue. vh d
  --implements:
                       Queue
  -- of Model:
                      FSMD (using clock)
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                      using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
   Entity FSMD_Queue_dHS_clock_ex is
      -- pragma template
      generic (W, D: integer);
      port (Reset, Start:
                                       in std_logic;
              Clock_Q:
                                in std_logic;
1.5
              A_Ready_QC, B_Rd_Req_QC: in std_logic;
              \label{eq:continuous} \ensuremath{\operatorname{QC\_Ackn\_A}}\ , \ \ensuremath{\operatorname{QC\_Ready\_B}}\ : \ \ensuremath{\operatorname{\mathbf{out}}}\ \ \operatorname{std\_logic}\ ;
              Q_Empty_Port, Q_Full_Port: out std_logic;
                                      in std_logic_vector(W-1 downto 0);
              Data_AQ:
              Buffer_QB_Port:
                                            out std_logic_vector(W-1 downto 0));
  end FSMD_Queue_dHS_clock_ex;
   Architecture FSMD_Queue_dHS_clock_ex_behavioral of FSMD_Queue_dHS_clock_ex
25 is
  -- queue outputs
  signal Q_Empty, Q_Full: std_logic;
  signal Buffer_QB: std_logic_vector(W-1 downto 0);
30 type State_Set is (S_0, W_1, W_2, R_1, R_2, R_3, S_END); -- S_BEGIN is represented by "Procedure not running".
  signal next_STATE: State_Set;
  type fifo_array is ARRAY(D-1 downto 0) of std_logic_vector(W-1 downto 0);
  signal Q: fifo_array;
35 signal C: integer range -1 to D-1;
  begin
  Queue: Process (Clock_Q)
40 begin
     if (Clock_Q'event and Clock_Q='1') then
       if (Reset = '1') then
next_STATE<= S_0;
          QC_Ackn_A <= '0';
          QC_Ready_B \le 0;
45
          C <= -1 ; -- empty;
          QEmpty \langle = '1';
          \mathrm{Q} \, \underline{\ } \, \mathrm{F} \, \mathrm{u} \, \mathrm{l} \, \mathrm{l} \ < = \ ' \, 0 \, ' \, ;
        else
          case next_STATE is
            when S_0 => QC_Ackn_A <= '0';
                            QC_Ready_B \le 0;
                            if (Q_Empty='0' and B_Rd_Req_QC='1')
                              then next_STATE \leq R_1;
55
                            elsif (Q_Full='0' and A_Ready_QC='1')
                              then next_STATE<= W_1;
                            end if;
            when W_1 = C = C+1;
60
                            for i in D-1 downto 1 loop
                              Q(i) \le Q(i-1);
                            end loop;
```

```
next_STATE \le W_2;
65
            when W_2 => Q(0) <= Data_AQ;
                          if ( C=−1 )
                           then QEmpty <= '1';
                          else
                            \mathrm{Q}\_\mathrm{Empty} \ <= \ \ '0 \ ';
70
                          end if;
                          if (C=D-1)
                            then Q_Full <= '1';
                          else
                            Q_Full <= '0';
75
                          end if;
                          QC\_Ackn\_A <= '1';
                          if (A_Ready_QC='0')
                            then next_STATE <= S_0;
80
                            next_STATE \le W_2;
                         end if;
            when R_{-1} => -- needed for conversion into "FSM plus Datapath"
85
                          next\_STATE \le R_2;
            when R_2 => Buffer_QB <= Q(C);
                         C <= C-1;
90
                          next\_STATE \le R\_3;
            when R_3 => QC_Ready_B <= '1';
                          if (C=−1)
95
                            then Q_Empty <= '1';
                          else
                            Q_Empty \langle = '0';
                          end if;
100
                          if (C=D-1)
                            then Q_Full <= '1';
                          else
                            Q_Full <= '0';
                          end if;
105
                          if (B_Rd_Req_QC='0')
                            then next_STATE \leq S_0;
                          else
                            next_STATE \le R_3;
110
                         end if;
            when S_END=> -- nothing (Procedure quits)
                            -- (will never happen though)
115
          end case;
       end if:
     end if;
   end process;
120 Buffer_QB_Port <= Buffer_QB;</pre>
   end FSMD_Queue_dHS_clock_ex_behavioral;
```

## C.3.8 FSMD B

```
VHDL/doubleHS_queued_SMs/FSMD_dHS_q_clock/FSMD_dHS_q_clock_ex_B.vhd
  -- File:
                    FSMD\_dHS\_q\_clock\_ex\_B. vhd
  -- implements:
                    FSMD B
  -- of Model:
                    FSMD (using clock)
  -- for Example: FSMD(A) --> FSMD(Queue) --> FSMD(B)
                    using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
10 use IEEE. std_logic_signed.all;
  -- FSMD B of example of two FSMD (with clock) exchanging data over a queue
  Entity FSMD_B_dHS_q_clock_ex is
     port (Reset, Start:
                                  in std_logic;
            Clock_B:
                                 in std_logic;
            QC_Ready_B, TB_Ackn_B: in std_logic;
B_Rd_Req_QC, B_Ready_TB: out std_logic;
            Buffer_QB:
                                  in std_logic_vector(31 downto 0);
            Buffer_QB_Stop:
                                       in std_logic;
20
            Out1_Port:
                                 out std_logic_vector (31 downto 0);
            Out1\_Stop\_Port:
                                 out std_logic);
  end FSMD_B_dHS_q_clock_ex;
  Architecture FSMD_B_dHS_q_clock_ex_behavioral of FSMD_B_dHS_q_clock_ex is
  -- outputs
  signal Out1: std_logic_vector(31 downto 0);
  signal Out1_Stop: std_logic;
  -- internal signals
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END); -- S_BEGIN is represented by "Procedure not running".
  signal next_STATE: State_Set;
35 signal C: std_logic_vector(31 downto 0);
  signal Stop: std_logic;
  begin -- Architecture
40 behavior1: process (Clock_B)
  begin
    if (Clock_B'event and Clock_B='1') then
       if (Reset = '1') then
         Out1 <= (others=>'0');
         Out1\_Stop <= '0';
         B \_Rd \_Req \_QC <= `0';
         B_Ready_TB \le 0;
         next_STATE <= S_BEGIN;
       else
         case next_STATE is
           when S_BEGIN => Out1 <= (others=>'0');
                             Out1_Stop <='0';
                             B_Rd_Req_QC \le 0;
                             B\_Ready\_TB <= '0';
55
                             if Start = '1'
                               then next_STATE \leq S_1;
                               else next_STATE <= S_BEGIN:
                             end if;
60
           when S_1 = >
                           B_Rd_Req_QC <= '1';
```

```
if (QC_Ready_B='1')
                             then next STATE \le S_2;
65
                             else next_STATE<= S_1;</pre>
                           end if;
                           C <= Buffer_QB;
            when S_2 =>
                           Stop <= Buffer_QB_Stop;
70
                           B_Rd_Req_QC <= '0';
                           if (QC_Ready_B='0')
                             then next_STATE \leq S_3;
                             else next_STATE<= S_2;
75
                           end if;
            when S_3 =>
                           Out1 <= C(28 \text{ downto } 0) * "010";
                           Out1\_Stop \le Stop;
80
                           next\_STATE \le S\_4;
            when S_4 =>
                           next\_STATE \le S\_5;
85
            when S_5 =>
                           B_Ready_TB \le '1';
                           if (TB_Ackn_B='1')
                             then next_STATE<=S_6;
90
                             else next_STATE<=S_5;
                           end if;
            when S_6 =>
                           B_Ready_TB \le 0';
95
                           if (TB\_Ackn\_B='0')
                             then if (Stop = '1')
                                     then next_STATE<=S_END;
                                     else next_STATE \le 1;
100
                                   end if;
                              else next\_STATE \le S\_6;
                           end if;
            when S_END=> -- nothing (Procedure quits)
105
         end case;
        end if;
      end if;
110 end process;
   -- Entity Outputs
   Out1\_Port <= Out1;
115 Out1_Stop_Port <= Out1_Stop;
   end FSMD_B_dHS_q_clock_ex_behavioral;
```

# C.3.9 A (FSM Controlling Datapath) $\rightarrow$ Queue (FSM Controlling Datapath) $\rightarrow$ B (FSM Controlling Datapath)

```
VHDL/doubleHS_queued_SMs/FSM_dHS_q_clock/FSM_and_D_dHS_q_clock_ex_top.vhd
                  FSM\_and\_D\_dHS\_q\_clock\_ex\_top , vhd
  -- bounding the three parts of the example
  -- for Example: FSM, DP(A) --> FSM, DP(Queue) --> FSM, DP(B)
                  using double handshake protocol
  library IEEE;
  use IEEE.std_logic_1164.all;
  use IEEE.std_logic_arith.all;
10 entity FSM_and_D_dHS_q_clock_ex_top is
     port (Reset, Start:
                              in std_logic;
           Clock_A, Clock_B, Clock_Q: in std_logic;
           TB_Ready_A, TB_Ackn_B: in std_logic;
           A_Ackn_TB, B_Ready_TB: out std_logic;
           In1, In2:
                                in std_logic_vector(31 downto 0);
           In_Stop:
                                in std_logic;
           Out1_Port:
                               out std_logic_vector (31 downto 0);
           Out1_Stop_Port:
                               out std_logic);
  end FSM_and_D_dHS_q_clock_ex_top;
  architecture FSM_and_D_dHS_q_clock_ex_top_arch of FSM_and_D_dHS_q_clock_ex_top is
  Component FSM_and_D_A_dHS_q_clock_ex
     port (Reset, Start:
                                in std_logic;
           Clock_A:
                               in std_logic;
           TB_Ready_A, QC_Ackn_A: in std_logic;
           A_Ackn_TB , A_Ready_QC: out std_logic ;
                                in std_logic_vector(31 downto 0);
           In 1, In 2:
           In_Stop:
                                in std_logic;
                                  out std_logic_vector(31 downto 0);
30
           Data_AQ_Port:
           Data_AQ_Stop_Port:
                                  out std_logic);
  end Component;
  Component FSM_and_D_Queue_dHS_clock_ex
     generic (Q_W, Q_D, S_W: integer);
     port (Reset, Start: in std_logic;
           Clock_Q:
                               in std_logic;
           A_Ready_QC, B_Rd_Req_QC: in std_logic;
           QC_Ackn_A, QC_Ready_B: out std_logic;
           Q_Empty_Port, Q_Full_Port: out std_logic;
40
           Data_AQ:
                          in std_logic_vector(Q_W-1 downto 0);
           Buffer_QB_Port: out std_logic_vector(Q_W-1 downto 0));
  end Component;
45 Component FSM_and_D_B_dHS_q_clock_ex
     port (Reset, Start:
                                in std_logic;
           Clock_B:
                               in std_logic;
           QC_Ready_B, TB_Ackn_B: in std_logic;
           B_Rd_Req_QC , B_Ready_TB: out std_logic ;
                                in std_logic_vector (31 downto 0);
           Buffer_QB:
50
                                     in std_logic;
           Buffer_QB_Stop:
           Out1_Port:
                               out std_logic_vector (31 downto 0);
           Out1_Stop_Port:
                               out std_logic);
  end Component;
  signal A_Ready_QC, B_Rd_Req_QC: std_logic;
  signal QC_Ackn_A, QC_Ready_B: std_logic;
  signal Data_AQ: std_logic_vector(31 downto 0);
  signal Data_AQ_Stop: std_logic;
60 signal Data_AQ_w_Stop: std_logic_vector(32 downto 0);
  signal Buffer_QB: std_logic_vector(31 downto 0);
  signal Buffer_QB_Stop: std_logic;
```

```
signal Buffer_QB_w_Stop: std_logic_vector(32 downto 0);
65 begin
  FSM\_and\_D\_A\_dHS\_q\_clock\_I: FSM\_and\_D\_A\_dHS\_q\_clock\_ex
       port map ( Reset=>Reset, Start=>Start, Clock_A=>Clock_A,
            TB_Ready_A => TB_Ready_A , \quad QC_Ackn_A => QC_Ackn_A ,
            A_Ackn_TB=>A_Ackn_TB, A_Ready_QC=>A_Ready_QC,
70
            In 1 = > In 1, In 2 = > In 2,
            In_Stop => In_Stop,
            Data_AQ_Port=>Data_AQ,
            Data_AQ_Stop_Port=>Data_AQ_Stop );
  FSM\_and\_D\_Queue\_dHS\_clock\_I: FSM\_and\_D\_Queue\_dHS\_clock\_ex
       {\tt generic \ map \ (Q_W=>33, \ Q_D=>4, \ S_W=>2)}
       port map ( Reset=>Reset, Start=>Start, Clock_Q=>Clock_Q,
            A_Ready_QC=>A_Ready_QC, B_Rd_Req_QC=>B_Rd_Req_QC,
            \label{eq:qc_Ackn_A=>QC_Ackn_A} QC\_Ready\_B=>QC\_Ready\_B\,,
            Q_Empty_Port=>open, Q_Full_Port=>open,
            Data_AQ=>Data_AQ_w_Stop,
            Buffer_QB_Port=>Buffer_QB_w_Stop);
85 Data_AQ_w_Stop <= Data_AQ_Stop & Data_AQ;
  Buffer_QB <= Buffer_QB_w_Stop(31 downto 0);
  Buffer_QB_Stop <= Buffer_QB_w_Stop(32);
  FSM_and_D_B_dHS_q\_clock_I: FSM_and_D_B_dHS_q\_clock\_ex
       port map ( Reset=>Reset, Start=>Start, Clock_B=>Clock_B,
            QC_Ready_B=>QC_Ready_B, TB_Ackn_B=>TB_Ackn_B,
            B_Rd_Req_QC => B_Rd_Req_QC, B_Ready_TB => B_Ready_TB,
            Buffer_QB=>Buffer_QB,
            Buffer_QB_Stop=>Buffer_QB_Stop,
            Out1_Port=>Out1_Port,
95
            Out1_Stop_Port=>Out1_Stop_Port );
  end FSM_and_D_dHS_q_clock_ex_top_arch;
```

# C.3.10 FSM Controlling Datapath A

```
\label{thm:condition} WHDL/double HS\_queued\_SMs/FSM\_dHS\_q\_clock/FSM\_and\_D\_dHS\_q\_clock\_ex\_A \ . \ vhd
                    FSM\_and\_D\_dHS\_q\_clock\_ex\_A . vhd
  -- File:
                   bounding of FSM A and D at a path A
  -- implements:
  -- of Model:
                   FSM and separate Datapath
  -- for Example: FSM, DP(A) --> FSM, DP(Queue) --> FSM, DP(B)
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_and_D_A_dHS_q_clock_ex is
     port (Reset, Start:
                                 in std_logic;
            Clock_A:
                                in std_logic;
            TB_Ready_A, QC_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_QC: out std_logic;
1.5
                                 in std_logic_vector (31 downto 0);
            In 1. In 2:
            In\_Stop:
                                 in std_logic;
            Data_AQ_Port:
                                   out std_logic_vector (31 downto 0);
            Data_AQ_Stop_Port:
                                         out std_logic);
20 end FSM_and_D_A_dHS_q_clock_ex;
  Architecture FSM_and_D_A_dHS_q_clock_ex_structural of FSM_and_D_A_dHS_q_clock_ex is
  -- outputs
25 signal Data_AQ: std_logic_vector (31 downto 0);
  signal Data_AQ_Stop: std_logic;
  signal DP_Reset:
                       std_logic;
  signal ld_A , ld_B , ld_Stop : std_logic ;
30 signal StopMsg: std_logic;
  signal ALU_M: std_logic_vector (1 downto 0);
  signal ld_O: std_logic;
  signal CMP: std_logic_vector(1 downto 0);
  signal MUX_sel: std_logic;
  Component DP
     port (Clock:
                             in std_logic;
            DP_Reset:
                             in std_logic;
            ld_A , ld_B , ld_Stop: in std_logic;
                             out std_logic;
40
            StopMsg:
            ALU_M:
                             in std_logic_vector (1 downto 0);
            ld_O :
                             in std_logic;
            In 1, In 2:
                             in std_logic_vector(31 downto 0);
            In_Stop:
                             in std_logic;
            O_Port:
                             out std_logic_vector (31 downto 0);
            Out_Stop:
                             out std_logic);
  end Component;
  Component FSM_A_dHS_q_clock_ex
     port (Reset, Start:
                                    in std_logic;
                                   in std_logic;
            Clock_A:
            TB_Ready_A, QC_Ackn_A: in std_logic;
            A_Ackn_TB, A_Ready_QC: out std_logic;
            DP_Reset:
                                   out std_logic;
55
            ld_A , ld_B , ld_Stop: out std_logic;
            StopMsg:
                                    in std_logic;
            ALU_M:
                                    out std_logic_vector (1 downto 0);
            ld_O:
                                   out std_logic);
  end Component;
  begin -- Architecture
  DP_A_I: DP
       Port Map (Clock=>Clock_A, DP_Reset=>DP_Reset,
```

```
\label{eq:ld_A} $$ $ \operatorname{Id_A} = \operatorname{Id_A} , $ \operatorname{Id_B} = \operatorname{Id_B} , $ \operatorname{Id_Stop} = \operatorname{Id_Stop} , $$
                       StopMsg => StopMsg, ALU_M => ALU_M, Id_O => Id_O,
65
                        In1 => In1, In2 => In2,
                        In_Stop => In_Stop,
                       O_Port=>Data_AQ, Out_Stop=>Data_AQ_Stop);
70 FSM_A_dHS_q\_clock_I: FSM_A_dHS_q\_clock\_ex
         Port Map (Reset=>Reset, Start=>Start, Clock_A=>Clock_A,
                        TB_Ready_A = TB_Ready_A, QC_Ackn_A = QC_Ackn_A,
                        A\_A ckn\_TB => A\_A ckn\_TB , \quad A\_Ready\_QC => A\_Ready\_QC ,
                        {\rm DP\_Reset}{=}{>}{\rm DP\_Reset}\;,
75
                        ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                       StopMsg=>StopMsg, ALU_M=>ALU_M, ld_O=>ld_O);
   -- Entity Outputs
80 Data_AQ_Port <= Data_AQ;
   Data_AQ_Stop_Port <= Data_AQ_Stop;
  end FSM_and_D_A_dHS_q_clock_ex_structural;
```

#### C.3.11 FSM of A

```
\label{thm:local_vhole} \begin{tabular}{ll} WHDL/double HS\_queued\_SMs/FSM\_dHS\_q\_clock/FSM\_dHS\_q\_clock\_ex\_A.vhd \\ \end{tabular}
  -- File:
                     FSM\_dHS\_q\_clock\_ex\_A . vhd
                     FSM A
  -- implements:
  -- of Model:
                     FSM and separate Datapath
  -- for Example: FSM, DP (A) --> FSM, DP (Queue) --> FSM, DP (B)
                     using double handshake protocol
 5 - -
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_A_dHS_q_clock_ex is
      port (Reset, Start:
                                    in std_logic;
                                      in std_logic;
             Clock_A:
             TB_Ready_A, QC_Ackn_A: in std_logic;
             A_Ackn_TB, A_Ready_QC: out std_logic;
1.5
             DP_Reset:
                                      out std_logic;
             ld_A , ld_B , ld_Stop :
                                      out std_logic;
             StopMsg:
                                       in std_logic;
             ALU_M:
                                      out std_logic_vector (1 downto 0);
             ld_O:
                                      out std_logic);
20
  end FSM_A_dHS_q_clock_ex;
  Architecture FSM_A_dHS_q_clock_ex_behavioral of FSM_A_dHS_q_clock_ex is
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END);
  signal next_STATE, STATE: State_Set;
  begin -- Architecture
  transition: Process
  begin
     wait until (Clock_A'event and Clock_A = '1');
    STATE \le next\_STATE;
35 end process;
  behavior1: process(STATE, Start, TB_Ready_A, QC_Ackn_A, StopMsg)
  begin
       if (Reset = '1') then
             DP_Reset <= '1';
             A_Ackn_TB <= '0';
             A_Ready_QC <= '0';
                        <= '0';
             ld_A
                        <= '0':
             ld_B
                        <= '0';
             ld_Stop
45
                        <= "--";
             ALU_M
             ld_O
                        <= '0';
         next_STATE <= S_BEGIN;
       else
         case STATE is
           \begin{array}{lll} \mbox{when $S$\_BEGIN =>$ DP$\_Reset & <= '1'; \\ & A\_Ackn\_TB <= '0'; \end{array}
                              A_Ready_QC \le '0';
                                          <= '0';
55
                              ld A
                              ld_B
                                          <= '0';
                              ld_Stop
                                          <= '0';
                                          <= "--";
                              ALU_M
                              ld_O
                                          <= '0':
60
                             if Start = '1'
                                then next_STATE \leq S_1;
                                else next_STATE <= S_BEGIN;
```

```
end if;
65
                                  \begin{array}{lll} {\rm DP\_Reset} & <= \ '0\ '; \\ {\rm A\_Ackn\_TB} & <= \ '0\ '; \end{array}
              when S_1 =>
                                   A_Ready_QC <= '0';
                                               <= '0';
                                   ld_{-}A
                                   ld_B
                                               <= ', 0';
70
                                               <= ',0';
<= "--";
                                   ld_Stop
                                  ALU_M
                                   ld_O
                                               <= 0.75
75
                                 if (TB_Ready_A='1')
                                    then next_STATE<= S_2;
                                    else next_STATE<= S_1;
                                 end if;
                                   DP_Reset <= '0';
A_Ackn_TB <= '1';</pre>
               when S_2 =>
80
                                   A_Ready_QC <= '0';
                                   ld_A
                                               <= '1';
                                   ld\_B
                                               <= '1';
                                               <= '1';
<= "11";
                                   ld_Stop
85
                                  ALU_M
                                               <= '0';
                                   ld_O
                                 if (TB_Ready_A='0')
                                    then next_STATE<=S_3:
90
                                    else next_STATE<= S_2;
                                 end if;
                                  DP_Reset <= '0';
A_Ackn_TB <= '0';</pre>
               when S_3 =>
95
                                   A_Ready_QC \le '0';
                                              <= '0';
                                   ld_A
                                               <= '0';
                                   ld_B
                                   ld_Stop
                                               <= '0';
100
                                   ALU_M
                                               <= "11";
                                               <= '1';
                                   ld_O
                                 next\_STATE \le S\_4;
               when S_4 =>
                                   DP_Reset <= '0';
105
                                   A\_Ackn\_TB <= '0';
                                   A_Ready_QC \le '0';
                                               <= '0';
                                   ld_A
                                   ld_B
                                               <= '0';
                                               <= '0';
<= "--";
110
                                   ld_Stop
                                   ALU_M
                                               <= '0';
                                   ld_O
                                 next\_STATE \le S\_5;
115
               when S_5 =>
                                   DP_Reset <= '0';
                                   A\_Ackn\_TB <= '0';
                                   A_Ready_QC <= '1';
                                               <= '0';
                                   ld_A
                                               <= '0';
                                   ld_B
120
                                   \operatorname{ld} \_ \operatorname{Stop}
                                               <= '0';
                                               <= "--";
                                   ALU_M
                                   ld_O
                                               <= 0.75
                                 if (QC_Ackn_A='1')
125
                                    then next_STATE<=S_6;
                                    else next\_STATE \le 5;
                                 end if;
```

```
DP_Reset <= '0';
A_Ackn_TB <= '0';
A_Ready_QC <= '0';
Id_A <= '0';
Id_B <= '0';
130
                when S_6 =>
                                                    <= '0';
<= "--";
                                      ld_Stop
135
                                      ALU_M
                                      ld_O
                                                    <= '0';
                                     if (QC_Ackn_A='0')
                                       then if (StopMsg='1')
then next_STATE<=S_END;
140
                                                  else next_STATE<=S_1;</pre>
                                               end if;
                                        else next_STATE<=S_6;
                                     end if;
145
                 when S_END=> -- nothing (Procedure quits)
              end case;
           end if;
150
    end process;
    \mathbf{end} \ \mathrm{FSM\_A\_dHS\_q\_clock\_ex\_behavioral};
```

# C.3.12 Datapath of A (identical B)

```
VHDL/doubleHS_queued_SMs/Datapath/DP.vhd
  library IEEE;
  use IEEE.std_logic_1164.all;
  use IEEE.std_logic_arith.all;
5 -- File:
                   DP, vhd
  -- implements: Datapath (instanced twice: for A and for B)
  -- for Example: FSMD(A) --> Memory --> FSMD(B)
                         <--> FSM <-->
                   using double handshake protocol
10
  Entity DP is
     port (Clock:
                            in std_logic;
                            in std_logic;
            DP_Reset:
            ld_A , ld_B , ld_Stop : in std_logic ;
            StopMsg:
                             out std_logic;
1.5
            ALU_M:
                             in std_logic_vector (1 downto 0);
            ld_O :
                            in std_logic;
            In\,1\;,\quad In\,2:
                            in std_logic_vector(31 downto 0);
            In_Stop:
                            in std_logic;
            O_Port:
                            out std_logic_vector (31 downto 0);
20
            Out_Stop:
                            out std_logic);
  end DP:
  Architecture DP_schematic of DP is
25
     -- connection signals
     signal A, B, O: std_logic_vector(31 downto 0);
     signal ALU_Out: std_logic_vector(31 downto 0);
     signal Stop:
                      std_logic;
30
     component Reg_32bit
        Port ( Clock:
                          in std_logic;
                Reset:
                          in std_logic;
                Load:
                          in std_logic;
                Data_In: in std_logic_vector (31 downto 0);
35
                Data_Out: out std_logic_vector (31 downto 0));
     end component;
     component Latch_impl
        Port ( Clock:
                            in std_logic;
40
                Reset:
                            in std_logic;
                Load:
                            in std_logic;
                Data_In :
                           in std_logic;
                Data_Out: out std_logic);
     end component;
     component ALU_32
                            in std_logic_vector (1 downto 0);
        Port ( Mode:
                Data_In1:
                            in std_logic_vector (31 downto 0);
                Data_In2:
                            in std_logic_vector (31 downto 0);
                Data_Out:
                            out std_logic_vector (31 downto 0));
     end component;
  begin
  Register_A: Reg_32bit
     Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_A, Data_In=>In1, Data_Out=>A);
60 Register_B: Reg_32bit
     Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_B,
                 Data_In=>In2, Data_Out=>B);
```

```
Latch_Stop: Latch_impl
Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_Stop, Data_In=>In_Stop, Data_Out=>Stop);

ALU: ALU_32
Port Map ( Mode=>ALU_M, Data_In1=>A, Data_In2=>B, Data_Out=>ALU_Out);

Register_O: Reg_32bit
Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_O, Data_In=>ALU_Out, Data_Out=>O);

O_Port <= O;
Out_Stop <= Stop;
StopMsg <= Stop;

80 end DP_schematic;
```

# C.3.13 FSM Controlling Datapath Queue

```
VHDL/doubleHS_queued_SMs/FSM_dHS_q_clock/FSM_and_D_dHS_q_clock_ex_Queue.vhd
                     FSM\_and\_D\_dHS\_q\_clock\_ex\_Queue. vhd
  -- File:
                     bounding of FSM and Datapath of Queue
  -- implements:
  -- of Model:
                     FSM and separate Datapath
  -- for Example: FSM, DP(A) --> FSM, DP(Queue) --> FSM, DP(B)
                     using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_and_D_Queue_dHS_clock_ex is
      -- pragma template
      generic (Q_W, Q_D, S_W: integer);
port (Reset, Start: in std_
                                     in std_logic;
                              in std_logic;
             Clock_Q:
1.5
             A_Ready_QC, B_Rd_Req_QC: in std_logic;
             QC_Ackn_A, QC_Ready_B: out std_logic;
             Q_Empty_Port, Q_Full_Port: out std_logic;
                                    \  \, \textbf{in} \  \  \, \textbf{std\_logic\_vector} \, \, \big( \textbf{Q\_W-1} \  \, \, \textbf{downto} \  \  \, 0 \, \big) \, ; \\
             Data_AQ:
             Buffer_QB_Port:
                                         out std_logic_vector(Q_W-1 downto 0));
  end FSM_and_D_Queue_dHS_clock_ex;
  Architecture FSM_and_D_Queue_dHS_clock_ex_behavioral of FSM_and_D_Queue_dHS_clock_ex is
25
  Component Queue_Buffered
      generic ( Q_W, Q_D, S_W: integer);
      port ( Clock: in std_logic;
              Q_Reset: in std_logic;
              Q_En, Q_Wr: in std_logic;
30
              Buffer_ld: in std_logic;
              Q_In: in std_logic_vector(Q_W-1 downto 0);
              Buffer_Out: out std_logic_vector(Q_W-1 downto 0);
              Q_Full, Q_Empty: out std_logic);
      end Component;
35
  Component Queue_Buffered_Control
      port ( Clock_Q: in std_logic;
              Q_Reset: in std_logic;
              Q_Start: in std_logic;
40
              Q_Wr, Q_En, Buffer_ld: out std_logic;
              Q_Full, Q_Empty: in std_logic;
              A_Ready_QC, B_Rd_Req_QC: in std_logic;
              QC_Ackn_A, QC_Ready_B: out std_logic);
      end Component;
  -- queue outputs
  signal Buffer_QB: std_logic_vector(Q_W-1 downto 0);
  signal Buffer_ld: std_logic;
50 signal Q_Start: std_logic;
  signal Q_En, Q_Wr: std_logic;
  signal Q_Full, Q_Empty: std_logic;
  begin
  Queue_Buffered_I: Queue_Buffered
       \label{eq:continuous_problem} \texttt{generic} \quad \texttt{Map} \; \left( \texttt{Q\_W=>Q\_W}, \; \; \texttt{Q\_D=>Q\_D} \;, \; \; \texttt{S\_W=>S\_W} \right)
       Port Map ( Clock=>Clock_Q,
                    Q_Reset=>Reset .
                    Q_En=>Q_En, Q_Wr=>Q_Wr,
60
                    Buffer_ld => Buffer_ld,
                    Q_In = Data_AQ,
                    Buffer_Out=>Buffer_QB,
```

# C.3.14 FSM of Queue

```
\label{thm:local_vhd} WHDL/double HS\_queued\_SMs/FSM\_dHS\_q\_clock/FSM\_dHS\_q\_clock\_ex\_Queue \ . vhd \\
                     FSM\_dHS\_q\_clock\_ex\_Queue . vhd
  -- File:
  -- implements:
                     FSM \ of \ Queue
  -- of Model:
                     FSM \ and \ separate \ Datapath
  -- for Example: FSM, DP (A) --> FSM, DP (Queue) --> FSM, DP (B)
                     using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity Queue_Buffered_Control is
      Port ( Clock_Q: in std_logic;
              Q_Reset: in std_logic;
              Q_Start: in std_logic;
              Q_Wr, Q_En, Buffer_ld: out std_logic;
1.5
              Q_Full, Q_Empty: in std_logic;
              A_Ready_QC, B_Rd_Req_QC: in std_logic;
              QC_Ackn_A, QC_Ready_B: out std_logic);
  end Queue_Buffered_Control;
  Architecture Queue_Buffered_Control_beh of Queue_Buffered_Control is
  type State_Set is (S_0, W_1, W_2, R_1, R_2, R_3, S_END);
25 signal next_STATE, STATE: State_Set;
  begin
  transition: Process
30 begin
     wait until (Clock_Q'event and Clock_Q = '1');
    STATE <= next_STATE;
  end process;
35 Queue: Process(STATE, Q_Start, Q_Full, Q_Empty, A_Ready_QC, B_Rd_Req_QC)
       if (Q_Reset = '1') then
         Q_En <= '0';
         Q_Wr <= '0';
         QC_Ackn_A <= '0';
40
         QC\_Ready\_B \le '0';
         Buffer_ld \langle = '0';
         next\_STATE \le S\_0;
       else
         case STATE is
45
           when S_0 => Q_E n <= 0;
                         Q_{-}Wr <= '-';

QC_{-}Ackn_{-}A <= '0';
                         QC_Ready_B \le 0;
                          Buffer_ld <= '0';
                          if (Q_Empty='0' and B_Rd_Req_QC='1')
                            then next_STATE \le R_1;
                          elsif (Q_Full='0' and A_Ready_QC='1')
55
                            then next_STATE <= W_1;
                         end if;
           when W_1 => Q_E n <= '1';
                         Q_W r <= '1';
                         \mathrm{QC}_{-}\mathrm{Ackn}_{-}\mathrm{A} \quad <= \ ^{\prime}0\ ^{\prime};
60
                         QC_Ready_B \le 0;
                          Buffer_ld \langle = '0';
```

```
next_STATE \le W_2;
65
              when W_2 => Q_E n <= '0';
                              Q_En <= 0;
Q_Wr <= '-';
QC_Ackn_A <= '1';
QC_Ready_B <= '0';
                              Buffer_ld <= '0';
70
                              if (A_Ready_QC='0')
                                 then next_STATE \leq S_0;
                               _{
m else}
                                 next_STATE \le W_2;
75
                              end if;
              when R_1 => Q_E n <= '1';
                              Q_Wr <= '0';
                              \mathrm{QC\_Ackn\_A} \quad <= \ ^{,}0\ ^{,};
80
                              QC_Ready_B <= '0';
Buffer_ld <= '1';
                              next_STATE <= R_2;
85
              when R_2 => Q_E n <= '0';
                              QWr <= 0.7
                              QC_Ackn_A <= '0';
                              QC_Ready_B \le 0;
                              Buffer_ld <= '0';
90
                              next\_STATE \le R\_3;
              when R_3 => Q_En <= '0';  
Q_Wr <= '-';  
QC_Ackn_A <= '0';
95
                              QC_Ready_B <= '1';
Buffer_ld <= '0';
100
                              if (B_Rd_Req_QC='0')
                                 then next_STATE \leq S_0;
                              _{
m else}
                                 next\_STATE \le R\_3;
                              end if;
105
              when S\_END=>
           end case;
         end if;
110 end process;
   end Queue_Buffered_Control_beh;
```

#### C.3.15 Datapath of Queue (Buffered)

```
VHDL/doubleHS_queued_SMs/Datapath_Queue/DP_all.vhd
  -- File:
                   DP\_all.vhd
  -- implements: Datapath of Buffered Queue
  -- for Example: FSM, DP (A) --> FSM, DP (Queue) --> FSM, DP (B)
                   using double handshake protocol
  library IEEE;
  use IEEE.std_logic_1164.all;
  use IEEE. std_logic_arith . all;
10 Entity Queue_Buffered is
     \tt generic ( Q\_W, Q\_D, S\_W: integer);
     port ( Clock: in std_logic;
             Q_Reset: in std_logic;
Q_En, Q_Wr: in std_logic;
1.5
             Buffer_ld: in std_logic;
             Q_In: in std_logic_vector(Q_W-1 downto 0);
             Buffer_Out: out std_logic_vector(Q_W-1 downto 0);
             Q_Full, Q_Empty: out std_logic);
     end Queue_Buffered;
  Architecture Queue_Buffered_Arch of Queue_Buffered is
  signal Q_Out: std_logic_vector (Q_W-1 downto 0);
  signal Buffer_In: std_logic_vector(Q_W-1 downto 0);
  signal Q_N_Reset: std_logic;
  Component Queue
     \tt generic ( Q\_W, Q\_D, S\_W: integer);
     port ( Clock: in std_logic;
             Q_Reset: in std_logic;
             Q_En, Q_Wr: in std_logic;
30
             Q_In: in std_logic_vector(Q_W-1 downto 0);
             Q_Out: out std_logic_vector(Q_W-1 downto 0);
             Q_Full, Q_Empty: out std_logic);
     end Component;
35
  Component D_FF
      port ( Clk: in std_logic;
Rst, En: in std_logic;
              D: in std_logic;
              Q, Q_T: out std_logic );
      end Component;
  begin
45 Queue_I: Queue
      generic map(Q_W=>Q_W, Q_D=>Q_D, S_W=>S_W)
      port map (Clock=>Clock,
             Q_Reset = Q_Reset,
             Q_En=>Q_En, Q_Wr=>Q_Wr,
             Q_In = Q_In
             Q_Out = Q_Out,
             Q_Full=>Q_Full, Q_Empty=>Q_Empty);
  D_FFs: for i in Q_W-1 downto 0 generate
            FF: D_FF
                port map ( Clk=>Clock,
                            Rst = >Q_N_Reset, En = >Buffer_ld,
                            D=>Buffer_In(i),
                            Q=>Buffer_Out(i), Q_T=>open);
60
         end generate;
  Buffer_In <= Q_Out;
  Q_N_Reset <= not Q_Reset;
```

65 end Queue\_Buffered\_Arch;

# C.3.16 Datapath of Unbuffered Queue

```
VHDL/doubleHS_queued_SMs/Datapath_Queue/DP_Queue.vhd
  -- File:
                   DP\_all.vhd
  -- implements: Datapath of Queue (which will be wrapped by buffers)
  -- for Example: FSM, DP (A) --> FSM, DP (Queue) --> FSM, DP (B)
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  use IEEE. Std_Logic_unsigned . ALL;
  -- Queue
  Entity Queue is
     generic (Q_W, Q_D, S_W: integer);
     port ( Clock: in std_logic;
            Q_Reset: in std_logic;
            Q_En, Q_Wr: in std_logic;
            Q_In: in std_logic_vector(Q_W-1 downto 0);
            Q_Out: out std_logic_vector(Q_W-1 downto 0);
            Q_Full, Q_Empty: out std_logic);
20
     end Queue; -- pragma template
  Architecture Queue_arch of Queue is
25 type fifo_sh_transp is ARRAY(Q_W-1 downto 0) of std_logic_vector(Q_D-1 downto 0);
  signal Q_Sh, Q_Sel: fifo_sh_transp;
  type fifo_sh is ARRAY(Q_D-1 downto 0) of std_logic_vector(Q_W-1 downto 0);
  signal Q: fifo_sh;
30 signal Sh_S: std_logic;
  signal Sel_S: std_logic_vector(S_W-1 downto 0);
  signal Coun_O: std_logic_vector(S_W downto 0);
  signal Coun_D: std_logic;
  signal Q_N_Reset: std_logic;
35 signal HL: std_logic;
  signal LL: std_logic;
  Component Shifter_Dx1
     generic ( D: integer);
     port ( Reset: in std_logic;
            Clk: in std_logic;
            S_1 , S_0: in std_logic;
I_L , I_R: in std_logic;
            Q: out std_logic_vector(D-1 downto 0));
     end Component;
  Component Sel_W
     generic (W, S_W: integer);
     -- W: Number of 1 Bit inputs joined in a std_logic_vector;
     -- S_W: Number of Select bits ( must be >= lb (W) );
     port ( I: in std_logic_vector(W-1 downto 0);
            S: in std_logic_vector(S_W-1 downto 0);
            O: out std_logic);
     end Component;
  Component Counter_W
     generic (W: integer);
     port ( Reset, Set: in std_logic;
            Clk: in std_logic;
            D: in std_logic;
            E: in std_logic;
            C: out std_logic_vector(W-1 downto 0));
     end Component;
```

```
65 begin
   Shifters: for i in Q-W-1 downto 0 generate
                 Sh: Shifter Dx1
                      \texttt{generic} \quad \texttt{map} \ (\texttt{D}\!\!=\!\!>\!\! \texttt{Q}\_\!\texttt{D})
                      port map ( Reset=>Q_N_Reset,
70
                                   Clk=>Clock,
S_1=>Sh_S, S_0=>Sh_S,
I_L=>Q_In (i), I_R=>LL,
                                   Q=>Q_Sh(i)(Q_D-1 \text{ downto } 0);
                 Twist: for j in Q_D-1 downto 0 generate
75
                                   Q(j)(i) \le Q_Sh(i)(Q_D-1-j);
                          end generate;
               end generate;
80
   Selectors: for i in Q_W-1 downto 0 generate
                   Transp: for j in Q_D-1 downto 0 generate
                                   Q_Sel(i)(j) <= Q(j)(i);
                            end generate;
                   Sel: Sel_W
85
                       generic map (W=>Q_D, S_W=>S_W)
                      port map ( I=>Q_Sel(i)(Q_D-1 downto 0),
                                   S=>Sel_S, O=>Q_Out(i);
                end generate;
90
   Counter: Counter_W
                generic map (W=>S_W+1)
                port map ( Reset=>HL, Set=>Q_N_Reset,
                             Clk=>Clock,
                             D=>Coun_D , E=>Q_En , C=>Coun_O );
95
   Sh\_S \le (Q\_Wr \text{ and } Q\_En);
   Sel_S \le Coun_O(S_W-1 \text{ downto } 0);
100 Coun_D \leq ((not Q_Wr) and Q_En);
   \mathrm{HL} <= '1';
   LL <= '0';
105 Q_N_Reset <= not Q_Reset;
   Q_Empty <= '1' when (Coun_O = conv_std_logic_vector(-1, Coun_O'length))
                     else '0';
   Q_Full <= '1' when (Coun_O = conv_std_logic_vector(Q_D-1, Coun_O'length))
110
                     else '0';
```

end Queue\_arch;

# C.3.17 FSM Controlling Datapath B

```
\label{thm:condition} \begin{tabular}{ll} WHDL/double HS\_queued\_SMs/FSM\_dHS\_q\_clock/FSM\_and\_D\_dHS\_q\_clock\_ex\_B.vhd \\ \end{tabular}
                    FSM\_and\_D\_dHS\_q\_clock\_ex\_B. vhd
  -- File:
                    bounding of FSM B and D at a path B
  -- implements:
  -- of Model:
                    FSM and separate Datapath
  -- for Example: FSM, DP(A) --> FSM, DP(Queue) --> FSM, DP(B)
                    using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_and_D_B_dHS_q_clock_ex is
     port (Reset, Start:
                                 in std_logic;
            Clock_B:
                                 in std_logic;
            QC_Ready_B, TB_Ackn_B: in std_logic;
            B_Rd_Req_QC, B_Ready_TB: out std_logic;
1.5
                                 in std_logic_vector (31 downto 0);
            Buffer\_QB\_Stop:
                                       in std_logic;
            Out1_Port:
                                 out std_logic_vector (31 downto 0);
            Out1_Stop_Port:
                                 out std_logic);
20 end FSM_and_D_B_dHS_q_clock_ex;
  Architecture FSM_and_D_B_dHS_q_clock_ex_behavioral of FSM_and_D_B_dHS_q_clock_ex is
  -- outputs
25 signal Out1: std_logic_vector (31 downto 0);
  signal Out1_Stop: std_logic;
  signal DP_Reset:
                        std_logic;
  signal ld_A , ld_B , ld_Stop : std_logic ;
30 signal StopMsg: std_logic;
  signal ALU_M: std_logic_vector (1 downto 0);
  signal ld_O: std_logic;
  signal CMP: std_logic_vector(1 downto 0);
  signal MUX_sel: std_logic;
  Component DP
     port (Clock:
                             in std_logic;
            DP_Reset:
                             in std_logic;
            ld_A , ld_B , ld_Stop: in std_logic;
                              out std_logic;
40
            StopMsg:
            ALU_M:
                             in std_logic_vector (1 downto 0);
            ld_O :
                             in std_logic;
            In 1, In 2:
                             in std_logic_vector(31 downto 0);
            In_Stop:
                             in std_logic;
            O_Port:
                             out std_logic_vector (31 downto 0);
            Out_Stop:
                             out std_logic);
  end Component;
  Component FSM_B_dHS_q_clock_ex
     port (Reset, Start:
                                    in std_logic;
                                    in std_logic;
            Clock_B:
            QC_Ready_B, TB_Ackn_B: in std_logic;
            B_Rd_Req_QC, B_Ready_TB: out std_logic;
            DP_Reset:
                                    out std_logic;
55
            ld_A , ld_B , ld_Stop: out std_logic;
            StopMsg:
                                    in std_logic;
            ALU_M:
                                    out std_logic_vector (1 downto 0);
            ld_O:
                                    out std_logic);
  end Component;
  begin -- Architecture
  DP_B_I: DP
       Port Map (Clock=>Clock_B, DP_Reset=>DP_Reset,
```

```
\label{eq:ld_A} $$ $ \operatorname{Id_A} = \operatorname{Id_A} , $ \operatorname{Id_B} = \operatorname{Id_B} , $ \operatorname{Id_Stop} = \operatorname{Id_Stop} , $$
                        StopMsg => StopMsg \,, \quad ALU\_M => ALU\_M \,, \quad ld\_O => ld\_O \,,
65
                        In1=>Buffer_QB, In2=>Buffer_QB, In_Stop=>Buffer_QB_Stop,
                        O_Port=>Out1, Out_Stop=>Out1_Stop);
70 FSM_B_dHS_q\_clock_I: FSM_B_dHS_q\_clock\_ex
          Port Map (Reset=>Reset, Start=>Start, Clock_B=>Clock_B,
                         QC_Ready_B = QC_Ready_B, TB_Ackn_B = TB_Ackn_B,
                         B_Rd_Req_QC => B_Rd_Req_QC , \quad B_Ready_TB => B_Ready_TB ,
                         DP\_Reset => DP\_Reset ,
75
                         ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                        StopMsg=>StopMsg, ALU_M=>ALU_M, ld_O=>ld_O);
   -- Entity Outputs
80 Out1_Port <= Out1;</pre>
   Out1_Stop_Port <= Out1_Stop;
   end FSM_and_D_B_dHS_q_clock_ex_behavioral;
```

## C.3.18 FSM of B

```
\label{thm:local_vhole} \begin{tabular}{ll} WHDL/double HS\_queued\_SMs/FSM\_dHS\_q\_clock/FSM\_dHS\_q\_clock\_ex\_B. vhd \\ \end{tabular}
  -- File:
                    FSM\_dHS\_q\_clock\_ex\_B . vhd
  -- implements:
                    FSM B
  -- of Model:
                    FSM and separate Datapath
  -- for Example: FSM, DP (A) --> FSM, DP (Queue) --> FSM, DP (B)
                    using double handshake protocol
5 - -
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity FSM_B_dHS_q_clock_ex is
     port (Reset, Start:
                                  in std_logic;
                                 in std_logic;
            Clock_B:
            QC_Ready_B, TB_Ackn_B: in std_logic;
            B_Rd_Req_QC, B_Ready_TB: out std_logic;
1.5
                                     out std_logic;
            ld_A , ld_B , ld_Stop: out std_logic;
            StopMsg:
                                     in std_logic;
            ALU_M:
                                     out std_logic_vector (1 downto 0);
            ld_O:
                                     out std_logic);
20
  end FSM_B_dHS_q_clock_ex;
  Architecture FSM_B_dHS_q_clock_ex_behavioral of FSM_B_dHS_q_clock_ex is
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_END);
  signal next_STATE, STATE: State_Set;
  begin -- Architecture
  transition: Process
  begin
    wait until (Clock_B'event and Clock_B = '1');
    STATE \le next\_STATE;
35 end process;
  behavior1: process(STATE, Start, QC_Ready_B, TB_Ackn_B, StopMsg)
  begin
      if (Reset = '1') then
            DP_Reset <= '1';
            B_Rd_Req_QC <= '0';
            B_Ready_TB <= '0';
                        <= '0';
            ld_A
                        <= , 0 ;
            ld_B
                        <= '0';
            ld_Stop
45
                        <= "--";
            ALU_M
            ld_O
                        <= '0';
         next_STATE <= S_BEGIN;
       else
         case next_STATE is
           when S_BEGIN => DP_Reset
                                        <= '1';
                             B_Rd_Req_QC <= '0';
                             B_Ready_TB <= '0';
                                         <= '0';
55
                             ld_A
                             ld_B
                                         <= '0';
                             ld_Stop
                                         <= '0';
                                         <= "--":
                             ALU_M
                             ld_O
                                         <= '0':
60
                             if Start = '1'
                               then next_STATE \leq S_1;
                               else next_STATE <= S_BEGIN;</pre>
```

```
end if;
65
            when S_1 =>
                             DP_Reset
                                        <= '0';
                             B_Rd_Req_QC \le '1';
                             B_Ready_TB <= '0';
                                         <= '1';
                             ld_A
                             ld_B
                                         <= '1';
70
                                         <= '1'; <= "--";
                             ld_Stop
                             ALU_M
                             ld_O
                                         <= 0.75
75
                            if (QC_Ready_B='1')
                              then next_STATE \leq S_2;
                              else next_STATE <= S_1;
                            end if;
            when S_2 =>
                            DP_Reset <= '0';
80
                            B_Rd_Req_QC \le '0';
                            B_Ready_TB <= '0';
                            ld_A
                                        <= '0';
                            ld\_B
                                        <=\ ,0\ ,;
                            ld_Stop
                                        <= '0';
85
                                        <= "10";
                            ALU_M
                                        <= '0';
                            ld_O
                            if (QC_Ready_B='0')
                              then next STATE \le S_3;
90
                              else next_STATE <= S_2;
                            end if;
            when S_3 =>
                            DP_Reset <= '0';
                            B_Rd_Req_QC <= '0';
B_Ready_TB <= '0';
95
                                        <= '0';
                            ld_A
                                        <= '0';
                            ld_B
                            ld\_Stop
                                        <= '0';
100
                            ALU_M
                                        <= "10";
                                        <= '1';
                            ld_O
                            next\_STATE <= S\_4;
                            DP_Reset <= '0';
105
            when S_4 =>
                            B_Rd_Req_QC <= '0';
                            B_Ready_TB \le 0;
                                       <= '0';
                            ld_A
                                        <= ', 0';
                            ld\_B
                                        <= ',0';
<= "--";
110
                            ld_Stop
                            ALU_M
                                        <= '0';
                            ld_O
                            next\_STATE \le S\_5;
115
                            DP_Reset <= '0';
            when S_5 =>
                            B_Rd_Req_QC \le '0';
                            B_Ready_TB <= '1';
                                       <= '0';
                            ld_A
                                        <= '0';
                            ld_B
120
                            ld_Stop
                                        <= '0';
                                        <= "--";
                            ALU_M
                                        <= '0';
                            ld_O
                            if (TB_Ackn_B='1')
125
                              then next_STATE<=S_6;
                              else next\_STATE \le 5;
                            end if;
```

```
DP_Reset <= '0';
B_Rd_Req_QC <= '0';
B_Ready_TB <= '0';
Id_A <= '0';
Id_B <= '0';
130
                 when S_6 =>
                                                       <= '0';
<= "--";
                                      \operatorname{ld} \_ \operatorname{Stop}
135
                                      ALU_M
                                      ld_O
                                                       <= '0';
                                       if (TB_Ackn_B='0')
                                         then if (StopMsg='1')
then next_STATE<=S_END;
140
                                                     else next\_STATE \le S_1;
                                                 end if;
                                          else next_STATE<=S_6;</pre>
                                      end if;
145
                 when S_END=> -- nothing (Procedure quits)
             end case;
            end if;
150
    end process;
    \mathbf{end} \ \mathrm{FSM\_B\_dHS\_q\_clock\_ex\_behavioral};
```

# C.3.19 Datapath of B identical to A (see A)

The datapath for the receiver B is identical to the datapath of the sender  $A. \rightarrow See$  datapath for sender A.

# C.4 Transfer via Memory with Double Handshake

- A (FSM Controlling Datapath)  $\rightarrow$  Memory || Arbiter (FSM)  $\rightarrow$  B (FSM Controlling Datapath)
  - FSM Controlling Datapath A
    - \* FSM of A
    - \* Datapath of A (identical B)
    - Memory
    - Arbiter (FSM)
    - FSM Controlling Datapath B
      - \* FSM of B
      - \* Datapath of B identical to A (see A)

# C.4.1 A (FSM Controlling Datapath) $\rightarrow$ Memory || Arbiter (FSM) $\rightarrow$ B (FSM Controlling Datapath)

```
\label{lem:clock} \mbox{WHDL/doubleHS\_mem\_SMs/FSM\_dHS\_mem\_clock/FSM\_and\_D\_dHS\_mem\_clock\_ex\_top.vhd}
  -- File:
                   FSM\_and\_D\_dHS\_mem\_clock\_ex\_top. vhd
  -- bounding the three parts of the example
  -- for Example: FSM, DP(A) --> Memory --> FSM, <math>DP(B)
                           <--> FSM <-->
                   using double handshake protocol
5 - -
  library IEEE;
  use IEEE.std_logic_1164.all;
  use IEEE.std_logic_arith.all;
  entity FSM_and_D_dHS_mem_clock_ex_top is
     port (Reset, A_Start, B_Start:
                                            in std_logic;
            Clock_A , Clock_B , Clock_M :
                                          in std_logic;
            TB_Ready_A, TB_Ackn_B: in std_logic;
            A_Ackn_TB, B_Ready_TB: out std_logic;
15
            A_Addr_In , B_Addr_In : in std_logic_vector (3 downto 0);
            In 1 In 2:
                                 in std_logic_vector(31 downto 0);
            In_Stop:
                                 in std_logic;
            Out1_Port:
                                out std_logic_vector (31 downto 0);
            Out1_Stop_Port:
                                out std_logic);
20
  end FSM_and_D_dHS_mem_clock_ex_top;
  architecture FSM_and_D_dHS_mem_clock_ex_top_arch of FSM_and_D_dHS_mem_clock_ex_top is
25 Component FSM_and_D_A_dHS_mem_clock_ex
     port (Reset, A_Start:
                                   in std_logic;
                                in std_logic;
            Clock_A:
            TB_Ready_A, AC_Grant_A: in std_logic;
            A_Ackn_TB, A_Req_AC: out std_logic;
            A_nCS, A_nWE, A_nOE: out std_logic;
30
            Addr_In:
                                 in std_logic_vector(3 downto 0);
            Addr_Out:
                                 out std_logic_vector (3 downto 0);
            In1, In2:
                                 in std_logic_vector(31 downto 0);
                                 in std_logic;
            In_Stop:
            Data_AM_and_Stop_Port: out std_logic_vector(32 downto 0));
  end Component;
  Component Mem_Grant_Control
     Port ( Clock_M: in std_logic;
             AC_Reset: in std_logic;
40
             AC_Start: in std_logic;
             A_Req_AC, B_Req_AC: in std_logic;
             AC_Grant_A , AC_Grant_B: out std_logic );
  end Component;
  Component Memory_DxW
     generic (D, W, A_W: integer);
     Port (nCS, nWE, nOE: in std_logic;
            Addr: in std_logic_vector (A_W-1 downto 0);
            Data: inout std_logic_vector(W-1 downto 0));
  end Component;
  Component FSM_and_D_B_dHS_mem_clock_ex
     port (Reset, B_Start:
                                   in std_logic;
            Clock_B:
                                in std_logic;
            AC_Grant_B, TB_Ackn_B: in std_logic;
            B_Req_AC, B_Ready_TB: out std_logic;
            B_nCS, B_nWE, B_nOE: out std_logic;
            Addr_In:
                                 in std_logic_vector(3 downto 0);
60
            Addr_Out:
                                 out std_logic_vector (3 downto 0);
            Data_MB_and_Stop_Port: in std_logic_vector (32 downto 0);
            Out1_Port:
                                out std_logic_vector (31 downto 0);
```

```
Out1_Stop_Port:
                                  out std_logic);
   end Component;
65
   signal A_Req_AC, B_Req_AC: std_logic;
   signal AC_Grant_A, AC_Grant_B: std_logic;
   signal Data_and_Stop: std_logic_vector(32 downto 0);
   signal Addr: std_logic_vector(3 downto 0);
70 signal nCS, A_nCS, B_nCS, nWE, nOE: std_logic;
   begin
   FSM\_and\_D\_A\_dHS\_mem\_clock\_I: FSM\_and\_D\_A\_dHS\_mem\_clock\_ex
        port map ( Reset=>Reset, A_Start=>A_Start, Clock_A=>Clock_A,
             TB_Ready_A=>TB_Ready_A, AC_Grant_A=>AC_Grant_A,
             A_Ackn_TB = > A_Ackn_TB, A_Req_AC = > A_Req_AC,
             A\_nCS=>A\_nCS\;,\;\;A\_nWE=>nWE,\;\;A\_nOE=>nOE\;,
             A ddr_{In} = > A A ddr_{In}
             Addr_Out = > Addr,
             In 1 = > In 1, In 2 = > In 2,
             In_Stop=>In_Stop,
             Data_AM_and_Stop_Port=>Data_and_Stop);
85 Mem_FSM_dHS_clock_I: Mem_Grant_Control
      Port map ( Clock_M=>Clock_M,
              AC_Reset=>Reset,
              AC_Start => A_Start,
              A_Req_AC=>A_Req_AC, B_Req_AC=>B_Req_AC,
              AC_Grant_A=>AC_Grant_A, AC_Grant_B=>AC_Grant_B);
90
   Memory_I: Memory_DxW
      generic map(D=>16, W=>33, A_W=>4)
      Port map (nCS=>nCS, nWE=>nWE, nOE=>nOE,
             Addr=>Addr,
             Data=>Data_and_Stop);
   FSM\_and\_D\_B\_dHS\_mem\_clock\_I: FSM\_and\_D\_B\_dHS\_mem\_clock\_ex
        port map ( Reset=>Reset, B_Start=>B_Start, Clock_B=>Clock_B,
             AC_Grant_B=>AC_Grant_B, TB_Ackn_B=>TB_Ackn_B,
100
             B_Req_AC=>B_Req_AC, B_Ready_TB=>B_Ready_TB,
             B_nCS=>B_nCS, B_nWE=>nWE, B_nOE=>nOE,
             Addr_{In} = > B_Addr_{In}
             Addr_Out = > Addr,
             \label{eq:def-def} {\tt Data\_MB\_and\_Stop\_Port=>} {\tt Data\_and\_Stop} \;,
105
             Out1_Port=>Out1_Port,
             Out1_Stop_Port=>Out1_Stop_Port );
   nCS \le A_nCS and B_nCS;
110
   end FSM_and_D_dHS_mem_clock_ex_top_arch;
```

# C.4.2 FSM Controlling Datapath A

```
VHDL/doubleHS_mem_SMs/FSM_dHS_mem_clock/FSM_and_D_dHS_mem_clock_ex_A.vhd
  -- File:
                   FSM_and_D_dHS_mem_clock_ex_A. vhd
  -- implements:
                   bounding of FSM A and Datapath A
  -- of Model:
                   FSM and separate Datapath (FSM+D)
  -- for Example: FSM, DP(A) --> Memory --> FSM, DP(B)
                           <--> FSM <-->
5 --
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
10 use IEEE.std_logic_arith.all;
  Entity FSM_and_D_A_dHS_mem_clock_ex is
     port (Reset, A_Start:
                                 in std_logic;
           Clock_A:
                               in std_logic;
           TB_Ready_A, AC_Grant_A: in std_logic;
1.5
           A_Ackn_TB, A_Req_AC: out std_logic;
           A_nCS, A_nWE, A_nOE: out std_logic;
                                in std_logic_vector(3 downto 0);
           Addr_In:
           Addr_Out:
                                out std_logic_vector (3 downto 0);
           In 1 . In 2 :
                                in std_logic_vector(31 downto 0);
20
           In_Stop:
                                in std_logic;
           Data_AM_and_Stop_Port: out std_logic_vector(32 downto 0));
  end FSM_and_D_A_dHS_mem_clock_ex;
  Architecture FSM_and_D_A_dHS_mem_clock_ex_structural of FSM_and_D_A_dHS_mem_clock_ex is
  -- outputs
  signal Data_AM: std_logic_vector (31 downto 0);
  signal Data_AM_Stop: std_logic;
  signal DP_Reset:
                      std_logic;
  signal ld_A, ld_B, ld_Stop: std_logic;
  signal StopMsg: std_logic;
  signal ALU_M: std_logic_vector (1 downto 0);
35 signal ld_O , A_tri_en : std_logic ;
  signal A_ld_Addr, A_inc_Addr: std_logic;
  signal CMP: std_logic_vector(1 downto 0);
  signal MUX_sel: std_logic;
40 Component DP
     port (Clock:
                            in std_logic;
           DP_Reset:
                            in std_logic;
           ld_A , ld_B , ld_Stop: in std_logic;
           StopMsg:
                            out std_logic;
           ALU_M:
                            in std_logic_vector (1 downto 0);
45
           ld_O, tri_en:
                            in std_logic;
           ld_Addr, inc_Addr: in std_logic;
                            in std_logic_vector(3 downto 0);
           Addr_In:
           Addr_Out_Port:
                            out std_logic_vector(3 downto 0);
           In1, In2:
                            in std_logic_vector(31 downto 0);
           In_Stop:
                            in std_logic;
           O_Port:
                            out std_logic_vector (31 downto 0);
           Out_Stop:
                            out std_logic);
  end Component;
  {\tt Component FSM\_A\_dHS\_mem\_clock\_ex}
     port (Reset, A_Start:
                                    in std_logic;
           Clock\_A:
                                  in std_logic;
           TB_Ready_A, AC_Grant_A: in std_logic;
60
           A_Ackn_TB, A_Req_AC: out std_logic;
                                  out std_logic;
           A_nCS, A_nWE, A_nOE:
           DP_Reset:
                                  out std_logic;
           ld_A , ld_B , ld_Stop: out std_logic;
```

```
StopMsg:
                                                    in std_logic;
                 ALU_M:
                                                    out std_logic_vector (1 downto 0);
65
                  ld_O , A_tri_en:
                                                    out std_logic;
                  A_ld_Addr, A_inc_Addr:
                                                         out std_logic);
   end Component:
70 begin -- Architecture
   DP_A_I: DP
           Port Map (Clock=>Clock_A, DP_Reset=>DP_Reset,
                           ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                           StopMsg=>StopMsg, ALU_M=>ALU_M, ld_O=>ld_O, tri_en=>A_tri_en,
                           \label{eq:ddr} \begin{array}{ll} \operatorname{Id} \operatorname{A} \operatorname{d} \operatorname{dr} => & \operatorname{A} \operatorname{Id} \operatorname{A} \operatorname{d} \operatorname{dr} \;, & \operatorname{in} \operatorname{c} \operatorname{A} \operatorname{d} \operatorname{dr} => & \operatorname{A} \operatorname{in} \operatorname{c} \operatorname{A} \operatorname{d} \operatorname{dr} \;, \end{array}
75
                           Addr_In=>Addr_In, Addr_Out_Port=>Addr_Out,
                           In1=>In1, In2=>In2,
                           In_Stop=>In_Stop,
O_Port=>Data_AM, Out_Stop=>Data_AM_Stop);
   FSM\_A\_dHS\_mem\_clock\_I: \ FSM\_A\_dHS\_mem\_clock\_ex
           \label{eq:continuous_continuous_relation} Port \ Map \ ( \ Reset => Reset \ , \ A \_S \ tart => A \_S \ tart \ ,
                           C lock_A = > C lock_A,
                           TB\_Ready\_A => TB\_Ready\_A\;, \quad AC\_Grant\_A => AC\_Grant\_A\;,
                           A\_A\,ckn\_TB => A\_A\,ckn\_TB\;,\quad A\_Req\_AC => A\_Req\_AC\;,
85
                           A_nCS=>A_nCS, A_nWE=>A_nWE, A_nOE=>A_nOE,
                           DP_Reset=>DP_Reset,
                           ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                           \label{eq:stopMsg} $$ StopMsg, ALU_M=>ALU_M, Id_O=>Id_O, A_tri_en=>A_tri_en, $$
                           A_ld_Addr => A_ld_Addr, A_inc_Addr => A_inc_Addr);
90
   -- Entity Outputs
   Data_AM_and_Stop_Port <= Data_AM_Stop & Data_AM;
95 end FSM_and_D_A_dHS_mem_clock_ex_structural;
```

## C.4.3 FSM of A

```
VHDL/doubleHS_mem_SMs/FSM_dHS_mem_clock/FSM_dHS_mem_clock_ex_A.vhd
  -- File:
                   FSM\_dHS\_mem\_clock\_ex\_A . vhd
  -- implements:
                   FSM A
  -- of Model:
                   FSM and separate Datapath (FSM + D)
  -- for Example: FSM, DP(A) --> Memory --> FSM, DP(B)
                           <--> FSM <-->
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
10 use IEEE.std_logic_arith.all;
  Entity FSM_A_dHS_mem_clock_ex is
     port (Reset, A_Start:
                                   in std_logic;
            Clock_A:
                                 in std_logic;
            TB_Ready_A, AC_Grant_A: in std_logic;
1.5
            A_Ackn_TB, A_Req_AC: out std_logic;
           A_nCS, A_nWE, A_nOE:
                                     out std_logic;
                                   out std_logic;
            DP_Reset:
           ld_A , ld_B , ld_Stop :
                                   out std_logic;
           StopMsg:
                                   in std_logic;
20
           ALU_M:
                                   out std_logic_vector (1 downto 0);
           ld_O, A_tri_{en}:
                                   out std_logic;
            A_ld_Addr, A_inc_Addr: out std_logic);
  end FSM_A_dHS_mem_clock_ex;
  Architecture FSM_A_dHS_mem_clock_ex_behavioral of FSM_A_dHS_mem_clock_ex is
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_7, S_8, S_9, S_END);
30 signal next_STATE, STATE: State_Set;
  begin -- Architecture
  transition: Process
35 begin
    wait until (Clock_A'event and Clock_A = '1');
    STATE <= next_STATE;
  end process;
40 behavior1: process(STATE, A_Start, TB_Ready_A, AC_Grant_A, StopMsg)
  begin
      if (Reset = '1') then
           DP_Reset  <= '1';
            A_Ackn_TB <= '0';
           A Req AC  <= '0';
45
           ld_A
                      <= '0';
            ld_B
                      <= '0';
                      <= '0';
            ld_Stop
           ALU_M
                      <= "--";
                      <= '0';
           ld_O
            A_ld_Addr <= '0';
            A = i n c = A d d r < = '0';
           A_tri_en <= 0;
            A_nCS <= '1';
           A_nWE <= 'Z';
55
           A_nOE <= 'Z';
        next_STATE <= S_BEGIN;
      else
        case STATE is
60
          when S_BEGIN => DP_Reset <= '1';
                           A\_Ackn\_TB <= '0';
                           A_Req_AC <= '0';
```

```
ld_A
                                            <= '0';
                                             <= 0,0,
                                 ld_B
65
                                 ld_Stop
                                            <= '0';
                                             <= "--";
                                 ALU_M
                                 ld_O
                                            <= '0';
                                 A_ld_Addr <= '0';
                                 A = i n c = A d d r < = '0';
70
                                 A_{\text{trien}} <= '0';
                                 A_nCS <= '1';
A_nWE <= 'Z';
                                 A_nOE <= 'Z';
75
                                if A\_Start = '1'
                                  then next_STATE <= S_1;
                                  else next_STATE <= S_BEGIN;</pre>
                                end if;
80
             when S_1 =>
                                 DP\_Reset <= '0';
                                 A\_Ackn\_TB <= '0';
                                 A_R = q_A C \quad <= \ '1';
                                            <= '0';
                                 ld_A
                                 ld_B
                                            <= '0';
85
                                            <= '0';
                                 ld_Stop
                                            <= "--";
                                 ALU_M
                                            <= '0';
                                 ld_O
                                 A_ld_Addr <= '1';
                                 A = i n c = A d d r < = '0';
90
                                 A_tri_en <= '0';
                                 A_nCS <= ',1';
A_nWE <= ',Z';
                                 A_nOE <= 'Z';
95
                                if (AC_Grant_A = '1')
                                  then next_STATE \leq S_2;
                                  else next\_STATE \le S_1;
                                end if;
100
                                 \mathrm{DP} \text{-}\mathrm{Reset} \quad <= \ '0';
             when S_2 =>
                                 A\_Ackn\_TB <= '0';
                                 A_Req_AC <= '1';
                                 ld_A
                                          <= '0';
                                            <= '0';
105
                                 ld_B
                                 ld_Stop
                                            <= 0, 0, 0, 0
                                            <= "--";
                                 ALU_M
                                 ld_O
                                            <= '0';
                                 A_ld_Addr <= '0';
                                 A = i \, n \, c = A \, d \, d \, r < = '0';
110
                                 A \perp tri en <= '1';
                                A_nCS <= '0';
A_nWE <= '1';
A_nOE <= '1';
115
                                if (TB_Ready_A='1')
                                  then next_STATE<= S_3;
                                  else next_STATE<= S_2;
                                end if;
120
              when S_3 =>
                                 DP_Reset <= '0';
                                 A_Ackn_TB <= '1';
                                 A_Req_AC <= '1';
                                            <= '1';
                                 ld_A
                                             <= '1';
                                 ld_B
125
                                 ld_Stop
                                            <= ',1';
                                             <= "11";
                                 ALU_M
                                 ld_O
                                            <= '0';
                                 A_ld_Addr <= '0';
```

```
A = i n c = A d d r < = '0';
130
                                   A _tri_en <= '1';
                                  A_nCS <= '0';
A_nWE <= '1';
A_nOE <= '1';
                                 if (TB_Ready_A='0')
                                    then next_STATE\leq= S_4;
                                    else next_STATE<= S_3;
                                 end if;
140
                                  \mathrm{DP} \_\mathrm{Reset} \quad <= \ '0';
               when S_4 =>
                                  A\_Ackn\_TB <= '0';
                                  A_Req_AC <= '1';
                                               <= '0';
                                  ld_A
                                               <= '0';
                                  ld_B
145
                                  ld_Stop
                                              <= '0';
                                               <= "11";
                                  ALU_M
                                              <= '1';
                                  ld_O
                                  A_ld_Addr <= '0';
                                  A_i n c_A d d r <= '0';
150
                                   A = tri = en < = '1';
                                  A_nCS <= ',0';
A_nWE <= ',1';
                                  A_nOE <= '1';
155
                                 next_STATE \le S_5;
               when S_5 =>
                                  DP\_Reset <= '0';
                                  A_Ackn_TB <= '0';
                                  A \operatorname{Req} AC \quad <= \ '1';
160
                                              <= '0';
                                  ld_A
                                               <= '0';
                                  ld_B
                                  ld_Stop
                                              <= '0';
                                               <= "--";
                                  ALU_M
                                  ld_O
                                              <= '0';
165
                                   A_ld_Addr <= '0';
                                  A = i n c = A d d r < = '0';
                                  A = t r i = e n <= '1';
                                  A_nCS <= '0';
A_nWE <= '0';
A_nOE <= '1';
170
                                 next\_STATE \le S\_6;
               when S_6 =>
                                  \mathrm{DP\_Reset} \quad <= \ '0';
175
                                  A_Ackn_TB <= '0';
                                  A_Req_AC <= '1';
                                               <= '0';
                                  ld_A
                                  ld_B
                                               <= 0,0,0;
                                  ld_Stop
180
                                              <= '0';
                                               <= "--":
                                  ALU_M
                                              <= '0';
                                  ld_O
                                   A_ld_Addr <= '0';
                                   A = inc = A ddr < = '0';
185
                                   A _{tri_en} <= '1';
                                  A_nCS <= '0';
A_nWE <= '0';
                                  A_nOE <= '1';
                                 next\_STATE \le S\_7;
190
               when S_7 = >
                                  DP_Reset <= '0';
                                  A\_Ackn\_TB <= '0';
                                  A\_Req\_AC \quad <= \ '1';
                                          <= '0';
                                  ld_A
```

```
ld_B
                                             <= '0';
                                            <= '0';
                                 ld_Stop
                                 ALU_M
                                             <= "--";
                                            <= '0';
                                 ld_O
                                 A_ld_Addr <= '0';
200
                                 A = i n c = A d d r < = '0';
                                 A = tri = en < = '1';
                                 A_nCS <= '0';
A_nWE <= '1';
                                 A_nOE <= '1';
205
                                next\_STATE \le S\_8;
              when S_8 =>
                                 DP\_Reset <= '0';
                                 A\_Ackn\_TB <= '0';
210
                                 A_Req_AC <= '1';
                                 ld_A
                                            <= '0';
                                             <= '0';
                                 ld_B
                                 \operatorname{Id} \_{\operatorname{Stop}}
                                            <= '0';
                                             <= "--";
215
                                 ALU_M
                                             <= '0';
                                 ld_O
                                 A_ld_Addr <= '0';
                                 A = i n c = A d d r < = '1';
                                 A_{\text{trien}} <= '1';
                                 A_nCS <= ',0';
A_nWE <= ',1';
220
                                 A_nOE <= '1';
                                 if (StopMsg='1')
225
                                   then next_STATE \le S_9;
                                   else next_STATE<=S_2;
                                 end if;
              when S_9 =>
                                 DP\_Reset <= '0';
                                 A_Ackn_TB <= '0';
                                 A_Req_AC <= '0';
                                 ld_A
                                            <= '0';
                                             <= '0';
                                 ld_B
                                 ld_Stop
                                            <= '0';
                                             <= "--";
                                 ALU_M
235
                                 ld_O
                                             <= '0';
                                 A_ld_Addr <= '0';
                                 A = i n c = A d d r < = '0';
                                 A_tri_en <= '0';
                                A_nCS <= '1';
A_nWE <= 'Z';
A_nOE <= 'Z';
240
                                if (AC_Grant_A = '0')
                                  then next_STATE<=S_END;
245
                                  else next_STATE<=S_9;
                               end if;
              when S_END=> -- nothing
250
            end case;
         end if;
   end process;
255
   end FSM_A_dHS_mem_clock_ex_behavioral;
```

## C.4.4 Datapath of A (identical B)

```
VHDL/doubleHS_mem_SMs/Datapath/DP.vhd
  -- File:
                  DP. vhd
  -- implements:
                  Datapath (instanced twice: for A and for B)
  -- for Example: FSM, DP (A) --> Memory --> FSM, DP (B)
                           <--> FSM <-->
5 - -
                   using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity DP is
     port (Clock:
                            in std_logic;
           DP_Reset:
                            in std_logic;
           ld_A , ld_B , ld_Stop: in std_logic;
           StopMsg:
                            out std_logic;
1.5
           ALU_M:
                            in std_logic_vector (1 downto 0);
                            in std_logic;
           ld_O, tri_en:
           ld_Addr, inc_Addr: in std_logic;
           Addr_In:
                            in std_logic_vector(3 downto 0);
                            out std_logic_vector (3 downto 0);
           Addr_Out_Port:
20
           In 1, In 2:
                            in std_logic_vector(31 downto 0);
           In_Stop :
                            in std_logic;
           O_Port:
                            out std_logic_vector (31 downto 0);
           Out_Stop:
                            out std_logic);
25 end DP ;
  Architecture DP_schematic of DP is
     -- connection signals
     signal A, B, O: std_logic_vector(31 downto 0);
     signal ALU_Out:
                      std_logic_vector (31 downto 0);
                       std_logic;
     signal Stop:
     signal Addr_Out: std_logic_vector(3 downto 0);
     signal HL, LL:
                       std_logic;
35
     component Reg_32bit
        Port ( Clock:
                          in std_logic;
                Reset:
                          in std_logic;
                          in std_logic;
                Load:
                Data_In: in std_logic_vector (31 downto 0);
40
                Data_Out: out std_logic_vector (31 downto 0));
     end component;
     component Latch_impl
        Port ( Clock:
                           in std_logic;
45
                Reset:
                           in std_logic;
                Load:
                           in std_logic;
                Data_In:
                           in std_logic;
                Data_Out: out std_logic);
     end component;
     component Counter_4
        Port ( Clk: in std_logic;
                Reset: in std_logic;
55
                Ld: in std_logic;
                C_In: std_logic_vector(3 downto 0);
               D: in std_logic;
                En: in std_logic;
               C: out std_logic_vector(3 downto 0));
60
     end component;
     component ALU_32
        Port ( Mode:
                            in std_logic_vector (1 downto 0);
```

```
Data_In1:
                             in std_logic_vector (31 downto 0);
                 Data_In2:
                             in std_logic_vector (31 downto 0);
65
                 Data\_Out:
                             out std_logic_vector (31 downto 0));
      end component;
      component tri
         generic (W: integer);
70
         port (Enable:
                           in std_logic;
                           in std_logic_vector(W-1 downto 0);
                 Data_In:
                 Data_Out: out std_logic_vector (W-1 downto 0));
      end component;
75
      component tri_bit
         port (Enable:
                           in std_logic;
                 Data_In: in std_logic;
                 Data_Out: out std_logic );
      end component;
80
   begin
   Register_A: Reg_32bit
     Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_A, Data_In=>In1, Data_Out=>A);
   Register_B: Reg_32bit
      Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_B,
                  Data_In=>In2, Data_Out=>B);
   Latch_Stop: Latch_impl
      Port Map ( Clock=>Clock, Reset=>DP_Reset, Load=>ld_Stop,
                  Data_In=>In_Stop, Data_Out=>Stop);
   Addr_Counter: Counter_4
      Port Map ( Clk=>Clock, Reset=>DP_Reset, Ld=>ld_Addr,
                  C_In = A ddr_In, D = LL, E_In = inc_A ddr,
                  C=>Addr_Out);
100
   ALU: ALU_32
      Port Map ( Mode=>ALU_M, Data_In1=>A, Data_In2=>B,
                  Data_Out=>ALU_Out);
105 Register_O: Reg_32bit
      Port Map ( \bar{C}lock => Clock, Reset => DP_Reset, Load => ld_O,
                  Data_In=>ALU_Out, Data_Out=>O);
   O_tri: tri
      generic map ( ₩=>32 )
      Port map ( Enable => tri_en , Data_In=>O, Data_Out=>O_Port );
   Out_Stop_tri: tri_bit
      Port map ( Enable => tri_en , Data_In => Stop , Data_Out => Out_Stop );
   StopMsg_tri: tri_bit
      Port map ( Enable => tri_en , Data_In => Stop , Data_Out => StopMsg );
   Addr_Out_tri: tri
      generic map ( W=>4 )
120
      Port map ( Enable => tri_en , Data_In=>Addr_Out, Data_Out=>Addr_Out_Port );
   HL <= '1';
   LL <= '0';
125
   end DP_schematic:
```

## C.4.5 Memory

```
VHDL/doubleHS_mem_SMs/FSM_dHS_mem_clock/Mem_ex.vhd
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE.std_logic_arith.all;
  use IEEE. std_logic_unsigned . all;
  -- File:
                   Mem\_ex.vhd
  --implements: Memory
  -- for Example: FSM, DP (A) --> Memory --> FSM, DP (B)
                            <--> FSM <-->
10 --
                    using \quad double \quad handshake \quad protocol
  entity Memory_DxW is
     generic (D, W, A_W: integer);
Port (nCS, nWE, nOE: in std_logic;
            Addr: in std_logic_vector (A_W-1 downto 0);
1.5
            Data: inout std_logic_vector(W-1 downto 0));
  end Memory_DxW; -- pragma template
  Architecture Memory_DxW_arch of Memory_DxW is
  type mem_array is ARRAY(D-1 downto 0) of std_logic_vector(W-1 downto 0);
  signal mem: mem_array;
25 begin
  read_write: process (nCS, nWE, nOE, Addr)
  begin
    if (nCS = '0') then
      if (nWE = '0') then
        mem(conv_integer(Addr)) <= Data;
       elsif (nOE = ', 0') then
        Data <= mem(conv_integer(Addr));
       else
        Data \le (others=>'Z');
35
      end if;
    end if:
  end process;
40 end Memory_DxW_arch;
```

#### C.4.6 Arbiter (FSM)

```
VHDL/doubleHS_mem_SMs/FSM_dHS_mem_clock/Mem_FSM_dHS_clock_ex.vhd
                     Mem\_FSM\_dHS\_clock\_ex. vhd
  -- File:
  -- implements: FSM for granting memory access
  -- for Example: FSM, DP (A) --> Memory --> FSM, DP (B)
                              <--> FSM <-->
 5 — —
                     using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
  use IEEE. std_logic_arith . all;
  Entity Mem_Grant_Control is
     Port ( Clock_M: in std_logic;
              AC_Reset: in std_logic;
AC_Start: in std_logic;
              A \_ Req \_ AC \; , \; \; B \_ Req \_ AC \; ; \quad \textbf{in} \quad std \_ logic \; ;
1.5
              AC_Grant_A , AC_Grant_B: out std_logic );
  end Mem_Grant_Control;
20 Architecture Mem_Grant_Control_beh of Mem_Grant_Control is
  type State_Set is (S_0, W, R);
  signal next_STATE, STATE: State_Set;
25 begin
  transition: Process
  begin
    wait until (Clock_M'event and Clock_M = '1');
   STATE <= next STATE;
  end process;
  Control: Process(STATE, AC_Start, A_Req_AC, B_Req_AC)
  begin
       if (AC_Reset = '1') then
35
         AC_Grant_A <= 0.7;
         AC_Grant_B \le 0;
         next\_STATE \le S\_0;
       else
         case STATE is
40
           when S_0 => AC_Grant_A <= '0';
                         AC_Grant_B <= '0';
                         if (A_Req_AC = '1')
                            then next_STATE<=W;
45
                          elsif (B_Req_AC = '1')
                           then next_STATE <= R;
                          else
                           next_STATE \le S_0;
                         end if;
           when W=>
                         AC_Grant_A <= '1';
                         AC_Grant_B \le 0;
                         if (A_Req_AC = '0')
55
                           then next_STATE <= S_0;
                          _{
m else}
                           next_STATE <= W;
                         end if:
60
           when R = >
                         \mathrm{AC\_G\,ran\,t\_A} \quad <= \ ^{,}0\ ^{,};
                         AC_Grant_B \le '1';
```

## C.4.7 FSM Controlling Datapath B

```
\label{lem:clock} \mbox{WHDL/double} \mbox{HS\_mem\_SMs/FSM\_dHS\_mem\_clock/FSM\_and\_D\_dHS\_mem\_clock\_ex\_B.vhd \mbox{WHDL/double} \mbox{Wholl} \mbox{Who
    -- File:
                                      FSM_and_D_dHS_mem_clock_ex_B. vhd
                                     bounding of FSM B and D at a path B
    -- implements:
    -- of Model:
                                     FSM and separate Datapath (FSM + D)
    -- for Example: FSM, DP(A) --> Memory --> FSM, DP(B)
                                                     <--> FSM <-->
 5 --
                                      using double handshake protocol
    library IEEE;
    use IEEE. std_logic_1164.all;
10 use IEEE. std_logic_arith . all;
    Entity FSM_and_D_B_dHS_mem_clock_ex is
           port (Reset, B_Start:
                                                                 in std_logic;
                                                              in std_logic;
                       Clock_B:
                       AC_Grant_B, TB_Ackn_B: in std_logic;
1.5
                       B_Req_AC, B_Ready_TB: out std_logic;
                       B_nCS, B_nWE, B_nOE: out std_logic;
                                                                in std_logic_vector(3 downto 0);
                       Addr_In:
                       Addr_Out:
                                                                out std_logic_vector (3 downto 0);
                       Data_MB_and_Stop_Port: in std_logic_vector (32 downto 0);
20
                       Out1_Port:
                                                              out std_logic_vector (31 downto 0);
                       Out1\_Stop\_Port:
                                                              out std_logic);
    end FSM_and_D_B_dHS_mem_clock_ex;
     Architecture FSM_and_D_B_dHS_mem_clock_ex_behavioral of FSM_and_D_B_dHS_mem_clock_ex is
    -- outputs
    signal Out1: std_logic_vector (31 downto 0);
    signal Out1_Stop: std_logic;
    signal Data_MB: std_logic_vector (31 downto 0);
    signal Data_MB_Stop: std_logic;
    signal DP_Reset:
                                             std_logic;
35 signal ld_A , ld_B , ld_Stop: std_logic;
    signal StopMsg: std_logic;
    signal ALU_M: std_logic_vector (1 downto 0);
    signal ld_O , B_tri_en : std_logic ;
    signal B_ld_Addr, B_inc_Addr: std_logic;
40 signal CMP: std_logic_vector(1 downto 0);
    signal MUX_sel: std_logic;
    Component DP
           port (Clock:
                                                        in std_logic;
                       DP_Reset:
                                                        in std_logic;
                       ld_A , ld_B , ld_Stop: in std_logic ;
                                                        out std_logic;
                       StopMsg:
                                                        in std_logic_vector (1 downto 0);
                       ALU M:
                                                        in std_logic;
                       ld_O , tri_en :
                       ld_Addr, inc_Addr: in std_logic;
                       Addr_In:
                                                        in std_logic_vector(3 downto 0);
                       Addr_Out_Port:
                                                        out std_logic_vector (3 downto 0);
                       In1, In2:
                                                        in std_logic_vector(31 downto 0);
                       In_Stop:
                                                        in std_logic;
55
                       O_Port:
                                                        out std_logic_vector (31 downto 0);
                       Out_Stop:
                                                        out std_logic);
    end Component;
    Component FSM_B_dHS_mem_clock_ex
          port (Reset, B_Start:
                                                                        in std_logic;
                       Clock_B:
                                                                    in std_logic;
                       AC_Grant_B, TB_Ackn_B: in std_logic;
                       B\_Req\_AC\;,\;\;B\_Ready\_TB:\;\;\mathbf{out}\;\;\mathbf{std\_logic}\;;
```

```
B_nCS, B_nWE, B_nOE:
                                     out std_logic;
             DP_Reset:
                                     out std_logic;
65
             ld_A , ld_B , ld_Stop :
                                     out std_logic;
             StopMsg:
                                     in std_logic;
             ALU_M:
                                     out std_logic_vector (1 downto 0);
             ld_O, B_tri_en:
                                     out std_logic;
             B_ld_Addr, B_inc_Addr: out std_logic);
70
   end Component;
   begin -- Architecture
   DP_B_I: DP
        Port Map (Clock=>Clock_B, DP_Reset=>DP_Reset,
                    ld_A = > ld_A, ld_B = > ld_B, ld_S top = > ld_S top,
                    StopMsg=>StopMsg, ALU_M=>ALU_M, ld_O=>ld_O, tri_en=>B_tri_en,
                    ld_Addr=>B_ld_Addr, inc_Addr=>B_inc_Addr,
                    A\,d\,dr\_In => A\,d\,dr\_In\;,\quad A\,d\,d\,r\_O\,u\,t\_P\,or\,t => A\,d\,d\,r\_O\,u\,t\;,
                    In1 = > Data_MB, In2 = > Data_MB,
80
                    In_Stop => Data_MB_Stop,
                    O_Port=>Out1, Out_Stop=>Out1_Stop);
   FSM\_B\_dHS\_mem\_clock\_I: \ FSM\_B\_dHS\_mem\_clock\_ex
        Port Map (Reset=>Reset, B_Start=>B_Start,
                    Clock_B=>Clock_B,
                    AC_Grant_B=>AC_Grant_B, TB_Ackn_B=>TB_Ackn_B,
                    B_Req_AC => B_Req_AC \;, \; \; B_Ready_TB => B_Ready_TB \;,
                    B_nCS=>B_nCS, B_nWE=>B_nWE, B_nOE=>B_nOE,
                    DP_Reset=>DP_Reset,
90
                    ld_A => ld_A, ld_B => ld_B, ld_S top => ld_S top,
                    StopMsg=>StopMsg, ALU_M=>ALU_M, ld_O=>ld_O, B_tri_en=>B_tri_en,
                    B_ld_Addr=>B_ld_Addr, B_inc_Addr=>B_inc_Addr);
95 -- Entity Outputs
   Out1_Port \le Out1;
   Out1_Stop_Port <= Out1_Stop;
   Data_MB <= Data_MB_and_Stop_Port(31 downto 0);
100 Data_MB_Stop <= Data_MB_and_Stop_Port(32);
   end FSM_and_D_B_dHS_mem_clock_ex_behavioral;
```

#### C.4.8 FSM of B

```
\label{thm:clock} VHDL/double HS\_mem\_SMs/FSM\_dHS\_mem\_clock/FSM\_dHS\_mem\_clock\_ex\_B.vhd
  -- File:
                    FSM\_dHS\_mem\_clock\_ex\_B. vhd
  -- implements:
                   FSM B
  -- of Model:
                   FSM and separate Datapath (FSM + D)
  -- for Example: FSM, DP(A) --> Memory --> FSM, DP(B)
                            <--> FSM <-->
                    using double handshake protocol
  library IEEE;
  use IEEE. std_logic_1164.all;
10 use IEEE. std_logic_arith . all;
  Entity FSM_B_dHS_mem_clock_ex is
     port (Reset, B_Start:
                                   in std_logic;
            Clock_B:
                                  in std_logic;
            AC_Grant_B, TB_Ackn_B: in std_logic;
1.5
            B_Req_AC, B_Ready_TB: out std_logic;
            B_nCS, B_nWE, B_nOE:
                                      out std_logic;
            DP_Reset:
                                    out std_logic;
            ld_A , ld_B , ld_Stop :
                                   out std_logic;
            StopMsg:
                                   in std_logic;
20
            ALU_M:
                                    out std_logic_vector (1 downto 0);
            ld_O, B_tri_en:
                                        out std_logic;
            B_ld_Addr, B_inc_Addr:
                                        out std_logic);
  end FSM_B_dHS_mem_clock_ex;
  Architecture FSM_B_dHS_mem_clock_ex_behavioral of FSM_B_dHS_mem_clock_ex is
  type State_Set is (S_BEGIN, S_1, S_2, S_3, S_4, S_5, S_6, S_7, S_8, S_9, S_10, S_END);
30 signal next_STATE, STATE: State_Set;
  begin -- Architecture
  transition: Process
35 begin
    wait until (Clock_B'event and Clock_B = '1');
    STATE <= next_STATE;
  end process;
40 behavior1: process (STATE, B_Start, AC_Grant_B, TB_Ackn_B, StopMsg)
  begin
      if (Reset = '1') then
            DP_Reset
                       <= '1';
                       <= '0';
            B_Req_AC
            B_Ready_TB <= '0';
45
                        <= '0';
            ld_A
            ld_B
                        <= '0';
                        <= '0';
            ld_Stop
            ALU_M
                       <= "--";
            ld_O
                        <= '0';
            B_ld_Addr <= '0';
            B_{inc}Addr <= '0';
            B_tri_en <= '0';
            B_nCS <= '1';
            \mathrm{B\_nWE} \ <= \ '\mathrm{Z} \ ';
55
            B_nOE <= 'Z';
        next_STATE <= S_BEGIN;
      else
        case STATE is
60
          when S_BEGIN => DP_Reset
                                        <= '1';
                            B_Req_AC
                                       <= '0';
                            B_Ready_TB <= '0';
```

```
ld_A
                                           <= '0';
                                           <= '0';
                               ld_B
65
                               ld_Stop
                                           <= '0';
                                           <= "--":
                               ALU_M
                               ld_O
                                           <= '0';
                               B_i n c_A d d r <= '0';
70
                               B_tri_en <= '0';
B_nCS <= '1';
B_nWE <= 'Z';
                               B_nOE <= 'Z';
75
                               if B\_Start = '1'
                                 then next_STATE <= S_1;
                                 else next_STATE <= S_BEGIN;</pre>
                               end if;
80
            when S_1 =>
                              \mathrm{DP} \_\mathrm{Reset}
                                          <= '0';
                                         <= '1';
                              B_Req_AC
                              B_Ready_TB <= '0';
                                          <= '0';
                              ld_A
                              ld_B
                                          <= '0';
85
                                          <= '0';
                              ld_Stop
                                          <= "--";
                              ALU_M
                                          <= '0';
                              ld_O
                              B_ld_Addr <= '1';
                              B_i nc_A ddr <= '0';
90
                              B_tri_en <= '0';
                             B_nCS <= '1';
B_nWE <= 'Z';
                             B_nOE <= 'Z';
95
                             if (AC_Grant_B='1')
                               then next_STATE \leq S_2:
                               else next\_STATE \le S\_1;
                            end if;
100
                                         <= '0';
                            DP_Reset
            when S_2 =>
                                        <= '1';
                             B_Req_AC
                             B_Ready_TB <= '0';
                             ld_A
                                       <= '0';
                                         <= '0';
105
                             ld_B
                            \operatorname{ld\_Stop}
                                         <= '0';
                                         <= "--";
                             ALU_M
                             ld_O
                                         <= '0';
                             B_ld_Addr <= '0';
                             B_i n c_A d dr <= '0';
110
                             B_tri_en <= '1';
                            B_nCS <= '0';
B_nWE <= '1';
B_nOE <= '1';
115
                            next_STATE \le S_3;
            when S_3 = >
                                         <= '0';
                            DP_Reset
                                        <= '1';
                             B_Req_AC
                             B_Ready_TB <= '0';
120
                                         <= '0';
                             ld_A
                                         <= '0';
                             ld_B
                             ld_Stop
                                         <= '0';
                                         <= "--";
                             ALU_M
                                         <= '0';
125
                             ld_O
                             B_1d_Addr <= '0';
                             B_{inc}Addr <= 0;
                             B_tri_en <= '1';
                             B_nCS <= '0';
```

```
B_nWE <= '1';
B_nOE <= '0';
130
                               next\_STATE \le S\_4;
                                          <= '0';
             when S_4 =>
                               DP_Reset
                               B_Req_AC
                                           <= '1';
                               B_Ready_TB \le 0;
                                            <= '1';
                               ld_A
                                            <= '1';
                               ld_B
                               ld_Stop
                                            <= '1';
140
                                            <= "10":
                               ALU_M
                                            <= '0';
                               ld_O
                               B_1d_Addr <= '0';
                               B_inc_Addr <= '0';
                               B_tri_en <= '1';
145
                              B_nCS <= '0';
B_nWE <= '1';
B_nOE <= '0';
                               next\_STATE \le S\_5;
150
                                          <= '0';
             when S_5 =>
                               DP_Reset
                               B_Req_AC <= '1';
                               B_Ready_TB \le 0;
                               ld_A
                                           <= '0';
155
                                            <= '0';
                               ld_B
                                            <= '0';
                               \operatorname{ld} \_S \operatorname{top}
                               ALU_M
                                            <= "10";
                                            <= '1';
                               ld_O
                               B_ld_Addr <= '0';
160
                               B_{inc}Addr <= '0';
                               B_tri_en <= '1';
                              B_nCS <= '0';
B_nWE <= '1';
B_nOE <= '1';
165
                               next\_STATE \le S\_6;
             when S_6 =>
                               DP_Reset
                                           <= '0';
                                           <= '1';
170
                               B_Req_AC
                               B_Ready_TB <= '0';
                                          <= '0';
                               ld_A
                                            <= '0';
                               ld_B
                                           <= '0';
                               \operatorname{ld\_Stop}
                               ALU_M
                                            <= "--";
175
                                            <= '0';
                               ld _O
                               B_ld_Addr <= '0';
                               B_{inc}Addr <= '0';
                               B_tri_en <= '1';
                              B_nCS <= '0';
B_nWE <= '1';
B_nOE <= '1';
180
                               next_STATE \le S_7;
185
                                           <= '0';
             when S_7 =>
                               DP_Reset
                               B_Req_AC <= '1';
                               B_Ready_TB <= '1';
                               ld_A
                                           <= '0';
                                            <= '0';
                               ld_B
190
                               ld\_Stop
                                            <= '0';
                                            <= "--":
                               ALU_M
                                           <= '0';
                               ld_O
                               B_- l \, d_- A \, d \, d \, r \quad <= \; , \, 0 \; , \; ;
                               B_{inc}Addr <= 0;
195
```

```
B_tri_en <= '1';
                             B_nCS <= '0';
B_nWE <= '1';
B_nOE <= '1';
200
                              if (TB_Ackn_B='1')
                                then next_STATE<=S_8;
                                else next\_STATE \le S\_7;
                              end if;
205
             when S_8 =>
                              \mathrm{DP} \_\mathrm{Reset}
                                          <= '0';
                                          <= '1';
                              B_Req_AC
                              B_Ready_TB \le 0;
                                          <= '0';
                              ld_A
                              ld_B
                                          <= '0';
210
                                          <= '0';
                             1d_Stop
                              ALU_M
                                          <= "--";
                                          <= '0';
                              ld_O
                              B_1d_Addr <= '0';
215
                              B_{inc}Addr <= '0';
                              B_tri_en <= '1';
                             B_nCS <= '0';
B_nWE <= '1';
B_nOE <= '1';
220
                              if (TB_Ackn_B='0')
                                then next_STATE<=S_9:
                                else next_STATE<=S_8;</pre>
                              end if;
225
                             DP_Reset
                                          <=\ ,0\ ,;
             when S_9 =>
                                          <= '1';
                              B_Req_AC
                              B_Ready_TB \le 0;
                                          <= '0';
                              ld_A
                                          <= '0';
                              ld_B
                             ld\_Stop
                                          <= '0';
                                          <= "--";
                              ALU_M
                                          <= '0';
                              ld_O
                              B_ld_Addr <= 0
                              B_inc_Addr <= '1';
235
                              B_tri_en <= '1';
                             B_nCS <= '0';
B_nWE <= '1';
B_nOE <= '1';
240
                              if (StopMsg='1')
                                then next_STATE<=S_10;
                                else next_STATE \le 2;
                              end if;
245
             when S_10 =>
                             DP_Reset
                                          <= '0';
                                          <= '0';
                              B_Req_AC
                              B_Ready_TB <= '0';
                                          <= '0';
                              ld_A
                                          <= '0';
                              ld_B
250
                                          <= '0';
                              ld_Stop
                                          <= "--":
                              ALU_M
                                          <= '0';
                              ld_O
                              B_ld_Addr <= '0';
                              B_{inc}Addr <= '0';
255
                              B_tri_en <= '0';
                             B_nCS <= '1';
B_nWE <= 'Z';
                             B\_nOE <= 'Z';
260
                              if (AC_Grant_B='0')
```

```
then next_STATE<=S_END;
else next_STATE<=S_10;
end if;

when S_END=> -- nothing (Procedure quits)

end case;
end if;

270
end process;
end FSM_B_dHS_mem_clock_ex_behavioral;
```

# C.4.9 Datapath of B identical to A (see A)

The datapath for the receiver B is identical to the datapath of the sender  $A. \rightarrow See$  datapath for sender A.