## AUTOMATIC PLA SYNTHESIS FROM A DDL-P DESCRIPTION

S. Kang<br>W. M. vanCleemput

Center for Integrated Systems<br>Department of Electrical Engineering<br>Stanford University<br>Stanford, California 94305

Abstract - This paper describes an automatic PLA aynthesis (APLAS) system which automatically generates a PLA for the control function of a design from a DDL-P description of a digital system. APLAS can also minimize and partition the PLA to meet the design constraints.

This is a very convenient tool for designing finite state machines. The control circuit of any digital system for which a state diagram can be drawn can be designed easily using this system.

## 1. Introduetion

Because of its regular structure, a programmable logic array (PLA) eases many difficult problems associated with hardware bynthesis. This prompts more use of PLA's these days, especially in VLSI. The main objective of our work was to develop a practical PLA synthesis procedure including PLA optinization techniques. The emphasis was put on VISII applications, rather than on FPLA's.

Considering the fact that most data operators are more or less standardized while the control circuitry varies from machine to machine, we use PL $\Lambda$ 's only for implementing the control, not for data manipulation. Also by doing so, the machine structure required by the design specification can be kept during the design process.

Since a PLA is a two-level structure, it has some inherent redundancy from the classical point of view, whicl put some constraints on ils usc. But by employing the design methodology of random logic, the PLA can be optimized, which can be done in three ways: minimization, partitioning and folding. We considered two of them, minimization and partitioning.

Although we have developed many algorithms, we can not properly present them in this paper due to limited space. Therefore the emphasis is placed on the results. For the algorithms, we advise the reader to refer to [1].

## 2. Overview of APLAS

Figure 1 shows the basic structure of APLAS. The first' program, SALT (Stauford Automatic Logic Translator), translates the DDL-P description of a digital system into Boolean equations, which the second program, SPAM (Stanford Programmable Array Minimizer), minimizes and converts into PLA format. The third program, PAPA (Programmable Array Partitioner), is used to partition a large PLA into smaller PLA's to meet the design constraints.

These programs are relatively independent of each other and each can be used separately. Figure 2 shows the stcps required in designing a digital circuit using APLAS.

In the next sections, DDL-P and the three subsystems will be discussed.


Figure 1. API_AS system


Figure 2. Design Steps of APLAS

## 3. DDL.P

DDL-P $[8,10]$ is a Stanford version of DDL [6-8] (Digital Design Language), a register transfer language. In this section, the philosophy of DDL-P will be discuased.

### 3.1 DDL-P Machine Model

The DDL-P machine consists of two main parts, OPERATION and CONTROL (Figure 3). The OPERATION part conaists of registers, ALU, memory, etc. which basically store and/or change data according to the instructions generated by the CONTROL part.

The CONTROL part is a eystem controller which goea through certain steps and insues command signals in each step, which control the actions in the OPERATION part. In turn, the OPERATION sends mome information to the CONTROL, which is used to determine what next step the machine should take and/or what signals the CONTROL should generate In the next step. The reader can consider the CONTROL part to be a finite atate machine. The DDL-P language also supports a more complicated machine itructure (interpretively linked machines), which is not supported by APLAS.


Figure 3. DDL-P Machine Model

Example - BLACKJACK machine
". blackjack machtme...
register score[5], cardbuf[5]. ff.
terminal hit, broke, stand,
value [1:5] = input(1.value).
YCRD $=$ IMPUT ( $1, Y C R D$ ) .
YL17 $=\operatorname{sCORE}$ 17. $\mathrm{n}^{22}=\operatorname{sCORE}$ 222.
mace $=$ cardbuf\#1.
OPERATION
TPT $=$ [CARDBUF $<-$ SD10]. TMT $=$ [CARDBuF $<-$ 5D22].
TVC $=\{$ CARDBUF $<-\mathrm{VALUE}]$, IhIt $=\left\{\mathrm{HIT}=1 \mathrm{Bi}_{\mathrm{i}}\right\}$.
ISTD $=[$ STAND $=1 B 1]$, IBRK $=[$ BROKE $=1 B 1]$,
CLS $=[$ SCORE $<-5 D 0]$.
ADD $=$ [SCORE $<-(\operatorname{SCDRE}(+)$ CARDBUF)TAIL 5).
$\mathbf{K F F}=[\mathrm{FF}<-1 \mathrm{DO}] . \mathrm{JFF}=[\mathrm{FF}<-1 \mathrm{D} 1 \mathrm{l}$ ].
CONTROL

$$
\begin{aligned}
& \text { CLS, KFF, -> B/ } \\
& \text { IHIT,TVC, IF YCRD THEN } \rightarrow \text { C ELSE } \rightarrow \text { b ENDIF/ } \\
& \text { If YCRD THEN } \rightarrow \text { C ELSE } \rightarrow \text { D ENDIF/ } \\
& \text { add. IF mace +FF then } \rightarrow \text { f else } \rightarrow \text { e enoif/ } \\
& \text { JFF. TPT, -> D/ } \\
& \text { IF YLit THEN } \rightarrow \text { B 日les } \rightarrow \text { C ENDIF/ } \\
& \text { If K22 THEN } \rightarrow \text { n ElSE } \rightarrow \text { H ENDIF/ } \\
& \text { KFF, TNT, IF FF THEN } \rightarrow \text { D ELSE } \rightarrow \mathrm{J} \text { ENDIF/ } \\
& \text { IBRK, IF YCRD THEN } \rightarrow \text { A ELSE } \rightarrow \text { J ENDIF/ } \\
& K \text { : ISTD. IF YCRD THEN } \rightarrow \text { A ELSE } \rightarrow K \text { ENDIF/. } \$
\end{aligned}
$$

In the above example, the CON'rROL section in the description corresponds to the control unit in Figure 4. The other parts of the machine are described in TER:MINAL and OPERATION which thow implicitly or explicitly how the components are interconnected and at what points the flow of data is controlled. For example, some combinational networks are explicitly described in TERMINAL (e.g. YL17, YL22, etc.) while the ADD function described in OPERATION implicitly assumes the existence of a combinational network (adder) and a control point.

How all these actions are controlled to nake this circuit a blackjack machine is described in the CONTROL section. This CONTROL section can be translated into Doolean equationa by SALT, which can be directly mapped into physical hardware.


Figure 4. Blackjack Machine

### 3.2 Characteristics of DDL-P

If carefully used, a DDL.P description can contain all the structural information as well as functional behavior. It is suitable for a digital system which is synchronized by one master clock. At the current stage, DDL-P does not support concurrency of mudular blocks, which limits its use to a simple machine structure.

One of the most important and distinct characteristics of DDL-P is that it has a clear boundary between data How and control flow. This can be painful to a designer who wants to describe a digital system in DDL-P. On the other hand, it is a convenient feature for a designer who wants to synthesize physical hardware from a DDL-P description.

## 4. SALT

The basic function of SALT is to translate the DDL-P description of a digital machine into Boolean equations that can be realized by physical hardware. The entire process is interactive and consists of three major steps.

First SALT interprets the input and sets up various tables for further processing. The next step is the state assignment. A user can do this manually, or abk SALT to do it automatically. Finally, SALT generates output equations for the control signals and fip-flop equations for the next state functions.

Currenily SALT gencrates Boolean equations only for the control circuitry. The operation part (data paths) should be designed manually or by other design aids to make the design complete.

### 4.1 CONTROL Section Implementation

There are eeveral ways of realizing CONTROL. One of them, which is used in SALT, is shown in Figure 5. CONTROL gets signals (qualitiers, e.g. YCRD, etc. in Figure 4) from the outside world (OPERATION) and combines them with the present state information which is internally stored in the state fip-llops. And then it generates control signals (instructions, e.g. TPT, ADD, etc. in Figure 4) to the outside world and next state function which will be stored in the state filp-flops.


Figure 5. Control Realization

SALT gencrates Boolean equations for the instructions and next state functions in terms of the qualifiers and the present state codes. The equations are used to build the hardware. Random logic or PLA's can be used as building blocks depending on the circumstances.

### 4.2 Decoder in TERMINAL

Conceptually it is possible to put everything in the CONTROL section of the DDJ.-P description, which should be included in the control circuit. But for practical reasons, it is not desirable.

For example, suppose we have 8 qualifiers which are not disjoint. Then we have up to 256 different conditions and should consider all or pari of them in each state. If wo use the 8 qualifiers directly in the CONTROL section of the description, we have to generate all thosc conditions in evcry state, which is quite inefficient. It is also difficult to read and understand that type of description.

As a way of avoiding that difficulty, we decode the qualifiers to generate the mutually exclusive signals, and consider them separately in the CONTROL section. This decoding section may be included in the TERMINAL section of the description.

If the designer wants the decoder in the real hardware, there is no problem. But if he merely uses the decoder to enhance the readability of the description, the description itself does not reflect the real hardware exactly. Therefore we should have a means to tell SAITT that the decoder does not exist and should be included in the control circuit. This is done as a comment in the TEILMINAL Bection.

### 4.3 State Assignment

State assigument is one of the important steps in SALT. Even though many methods have been proposed for the state assignment process [11-14], the practical method for finding the optimum state assignment has not been found, yet.

In real design, reducing the gate complexity is ouly one objective. In fact, for a proper circuil operation, race aid hazard problems should be considered and they have to be eliminated at any cost.

SALT provides two ways of atate assignment: manual and sutomatic. Automatic asagnment is provided as a simple way of doing the asignment without too much concern with timing probleme.

## Manual state assignment

The user must select the number of flip-flops. After chooring the radir for atate assignment, the user can type in the atate codes for each state. SALT does not accept identical codes for two different states. During the procesa, the user can ree the result up to the current position, change the previous code, or drop everything and restart the whole process.

## Automatic state assignment

SALT always ures the minimum number of FF's. The user can choose between two types of codes: binary and Gray. The user may use these codes as a otarting point for his search for better atate state assignment. SALT assigns the selected code to the states in the written order of the description.

It should be noted that for an odd number of states, Gray code cannot be formed, i.e., the starting state and the last atate can not be unit distance apart. SALT does not overcome this difficulty and is satisfied with distance two between the first and last states in the description.

### 4.4 Selection of Flip-Flop types

Together with state assignment, the selection of fip-flops affects the hardware cost. Since there are many kinds of flipflops, it is not easy to choose any particular type of flip-fiop for a given job. At the moment, it reems that only the desig.er's experience, intuition and/or preference can justify the choice of the flip-flops.

Currently three types of flip-fiops are used in SALT: JK, D and T. JK flip-fiops tend to result in less hardware if used with conventional combinational networks, but need twice the number of next state functions compared to D or T fip-flops. D flip-flops are preferable because of their simplicity if a PLA is used for the combinational network.

### 4.5 Example

The output of SALT is a met of Boolean equations. Most identifiers used in the equationa come from the description. Some identifiers related to the atate flip-flops are internally created by SALT. The atate flip-fiops are implicitly numbered by integers and their outputs and inputs are referenced as

| 0 On | output of the n th F |
| :---: | :---: |
| Dr | input to the a th D FF |
| Tn | input to the n th T FF |
| Jn . Kn | lapute to the n |

When not all atate codes are used, the unused codes can be don't-cares depending on the conditions. For that purpose, SALT puts out the number of flip-flops and the used codea for atate assignment if the poasible codes are not used up.

The following is an example output of the blackjack machine of which the DDL-P description is given in the previous nection.


Some results of SALT are shown in Table 1.
Table 1. SALT Execution

| DDL-P Input | $\begin{aligned} & \text { Quali-- } \\ & \text { fiers } \end{aligned}$ | Control Signals | $\left\lvert\, \begin{aligned} & \# \text { \# of } \\ & \text { Suates } \end{aligned}\right.$ | Statc Codes | FF | $\begin{gathered} \text { Ex. } \\ \text { Time } \end{gathered}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Modulo 3/4 counter | 1 | 2 | 4 | Gray | IK | 1.096 s |
| Traffic Light Controller | 3 | 5 | 4 | Gray | D | 1.233 s |
| 5-Bit Shift Registers | 0 | 0 | 5 | Binary | D | 1.502 s |
| Decade counter | 0 | 1 | 10 | Binary | D | 1.463 s |
| Blackjack Machine | 5 | 10 | 10 | Gray | JK | 1.984 s |
| Prime Number Counter | 3 | 1 | 19 | Binary | D | 2.643 s |
| Intel 8008 Microprocessor | 26 | 41 | 18 | Gray | D | 4.425 s |
| Intel 8080 Microprocessor | 60 | 61 | 19 | Gray | D | 6.112 s |
| PDP 11 | 95 | 54 | 112 | Gray | D | 61.201 s |

## 5. 8PAM

SPAM (Stanford Programmable Array Minimiser) is a program to minimize a multiple output Boolean switching function. The switching function is assumed to be implemented on a PLA. Since the number of products is important in a PLA implementation, the primary function of SPAM is to minimize the number of products which cover the given awitching function.

SPAM accepts inputs of three different formats: Boolean equations, truth table, or SALT results. It then converts the function to sum-of-products form, minimizes the number of products and outputs the result in the PLA format.

After minimizing the number of products, SPAM tries to reduce the connections in the AND and OR arrays. The fewer connections in the AND and OR arrays, the smaller the AND and OR gates become in a random logic implementation. Therefore SPAM can be used to minimize two-level random logic (AND-OR, NOR-NOR, NAND-NAND) as well as PLA's.

SPAM does not guarantce an optimal colution, but most solutions are near optimal. Absolute minimality is often achieved especially, for relatively small problems.

The current version of SPAM can handle 72 inputs, 144 outputs and several thousand products. These limits may be changed easily as they are essentially a function of the available address space for the program.

### 5.1 Example

SPAM recognizes the input format with the first character. The truth table form must start with '\#'. SALT result starts with ' $\$$ '. Anything else will be considered to be Boolean equations. The input is essentially freeformat. Blanks may be inserted at will to improve readability, except the blanks should not be embedded in identifiers of Boolean equations. Cemments may also appear anywhere blanks are allowed.

The following output came from the SALT result of Blackjack machine given in the previous section.

SOLUTION : B INPUTS 14 OUTPUTS 18 PRODUCTS

6. 0-00---- $11 \ldots . . . .1 \ldots$
7. 11---0---.............. 1
B. 1-11--1- .1.....1...1.
9. 11---1--- ...............
10. 0-1~--- .............
11. 0010----...1....... 1 .
12. 1-010-2-- ..............
13. 1-0-0-m-.............. 1
14. 1-01~-- $-. . . . . .1 . .$.
15. 1010-ッ-.. ................ 1
16. 1010-0-0-............11.
17. -010----1 ................. 1
18. -010---1- ................. 1

### 5.2 Results

Some results of SPAM are shown in Table 2.

Table 2. SPAM Exccution

| Source | InputType | in/Out | \# or Products |  | Fxccution Time |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  |  |  | Inicial * | Final |  |
| Modula 3/4 counicr | Salt | 3/6 | 8/10 | 5 | 0.31 s |
| Traffic light Controller | Satit | $5 / 7$ | 28/0 | 10 | 0.51 s |
| 5-bit Shift Register | SAIT | 5/5 | 5/11 | $s$ | 0.45 s |
| Decade counter | SAIT | 4/5 | $16 / 2$ | 9 | 0.53 s |
| Blackjack Machine | SAIT | 9/14 | $40 / 2$ | 18 | 1.30 s |
| Prime Number Counter | SAIT | 8/6 | 77/3 | 37 | 4.55 s |
| Intel 8008 Microprocessor | SAIT | 31/46 | 148/3 | 68 | 19.31 s |
| Intel 8080 Mic 保rucessor | Snit | 65/66 | 181/3 | 129 | 43.00 s |
| PIJP 11 | SAIT | 102/61 | 1651/1 | 348 | 156.00 s |
| $4 \times 4$-bit Multiplier | It. | 8/8 | 225/0 | 129 | 92.12 s |
| 4-bit ^dder | 'Tr. | 8/5 | 255/0 | 75 | 26.95 s |
| 4-bit Adder (with carry-in) | It. | 9/5 | 511/0 | 135 | 129.94 s |
| PIAI | 'Tr. | 9/15 | 33/29 | 29 | 5.42 s |
| PL^ 2 | Tr. | 8/14 | 33/28 | 32 | 7.29 s |
| PLA 3 | 1 t . | 23/40 | 273/273 | 165 | 390.00 s |

* Initial number of products $=>$ normal products/don't care products

Tr. : Truth cable
6. PAPA

In a real circuit, a large PLA tends to be quite wasteful or not fast enough to support the other parts of the aystern. In this case, we can split it into several smaller PLA's to reduce the chip area and/or improve the speed.

PAPA in a program which does this job. It alao has a redundancy removal routine which detects input and output redundancy [1] from a given PLA. The chip area and the overhead of using the many PLA's are the major factors considered in partitioning.

### 6.1 Erample

PAPA can only accept the truth table input. The output is the partitioned truth table with come information about the result of partitioning such as the size of each partitioned PLA and the reduction ratio of chip area. The order of output lines is changed by PAPA.

The following example is the result of Blackjack machine whose truth table is given in the previous section. The ordering of outputs is omitted on purpose for clarity.

```
SOLUTION : 2 PLA'&
PLA 1 : 0 INPUTS O OUTPUTS 14 PRODUCTS
PLA . 2: & INPUTS 6 OUTPUTS 4 PRODUCTS
TOTAL AREA : IMITIAL 
REDUCIION RATIO :0.746
    000000000 000000000111111
    123456789 12345678901234
    1-010---- 1.......
        11---1--- 1.......
        0-01----- 111.....
        1010--0-- 1.....1.
        0-00-----1..11...
        1-11---0-1...11.1
        0--11----1,...1.
        1-11---1-....111.
        0-1-----*.........
        11---0--- ......11
        -010---1- ........1
        -010----1 ....... 1
        1010----- ........ 1
        1-0-0--=- ........1
        lll
```


### 6.3 Results

Some results of PAPA are shown in Table 3. All inpute are the PLA'I minimized by SPAM. Overhcad of using many PLA's is also accounted for calculating the reduction ratio. The partitioned PLA's are not minimized by SPAM, which may be poszible.

Table 3. PAPA Execution

| INPUT (in/outproduct) | \# of PLA's | Arca |  | $\left\|\frac{1}{\text { Reduction }}\right\|$ | Execution Time |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  |  | Initial | Final |  |  |
| Blackjack (9/18/17) | 3 | 612 | 314 | 1.89 | 0.17 s |
| 8080 A (65/66/129) | 16 | 25284 | 4893 | 4.33 | 5.83 s |
| 8080 B (14/66/247) | 8 | 23218 | 9354 | 2.23 | 4.32 s |
| 4-bit MulL (8/8/129) | 4 | 3096 | 2460 | 1.18 | 0.39 s |

* $1 /$ Reduction $=\operatorname{lnital}$ area $/($ Final arca + Overhcad $)$


## 7. Conelusiona

We have presented an automatic PLA synthesis system with some results on its performance. We successfully used this system to synthesize the control circuitry of microprocessors cimilar to the Intel 8008 and the 8080.

The system includes a PLA minimizer and a PLA partitioner for better use of PLA's. The PLA minimizer is powerful enough to aecept a awitching function with 72 inputs, 144 outputs and several thousand products. The performance of the partitioner largely depends on a PLA format. But it can reduce the chip area drastically ( sometimes more than $50 \%$ ) in many cases.

## REFERENCES

1. S. Kang, Minimization, Partitioning and Synthesis of Programmable Logic Arrays, Ph.D. dissertation, Dept. of EE, Stanford University, to be published.
2. S. Kang and W. M. vanCleemput, SALT user's manual, Computer Systems Lab. Tech. Report No. 203, Stanford University, Mar. 1981.
3. S. KANG and W. M. vanCleernput, SPAM user's manual, Computer Systems Lab. Tech. Report No. 204, Stanford University, Mar. 1981.
4. W. M. vanCleemput, Design automation at Stanford, Computer Systems Lab. Tech. Report No. 178, Stanford University, July 1979.
5. W. M. vanCleemput, Design automation at Stanford, Computer Systems Lab. Tech. Report No. 184, Stanford University, Feb. 1980.
6. J. R. Duley and D. L. Dietmeyer, "A digital syatem design language (DDL)," IEEE Trans. Comput., vol. C-17, pp. 850-861, Sept. 1968.
7. J. R. Duley and D. L. Dietmeyer, "Tranalation of a DDL digital aystem specification to Boolean equations," IEEE Trans. Comput., vol. C-18, pp. 305-313, Apr. 1969.
8. R. L. Arndt and D. L. Dietmeyer, "DDLSIM - A digital deaign language bimulator," Proc. National Electronics Conf., vol. 26, pp. 116-118, Dec. 1970.
9. W. E. Cory, J. R. Duley and W. M. vanCleemput, An introduction to the DDL-P language, Computer Systems Lab. Tech. Report No. 163, Stanford University, Mar. 1879.
10. W. E. Cory,J. R. Duley and W. M. vanCleemput, DDLP Command Languago Manual, Computer Systems Lab. Tech. Report No. 164, Stanford University, Mar. 1979.
11. D. B. Armstrong, "A programmed algorithm for assigning internal codes to requential machincs," IRE Trans. Electron. Comput., vol. EC-11, pp. 466-172, Aug. 1062.
12. D. B. Armstrong, "On the efficient assignment of internal codes to sequential machines," IRE Trans. Electron. Comput., vol. EC-11, pp. 611-622, Oct. 1962.
13. T. A. Dolotta and E. J. McCluskey, "The coding of internal states of sequential circuits," IEEE Trans. Electron. Comput., vol. EC-13, pp. 549-563, Oct. 1964.
14. J. R. Story, H. J. Harrison, and E. A. Reinhard, "Optimum atate assignment for oynchronous sequential circuits," IEEE Trans. Comput., vol. C-21, pp. 1365-1373, Dec. 1972.
15. P. Bricaud and J. Campbell, "Multiple output PLA Minimization : EMIN," Wescon 78.
16. S. J. Hong, R. G. Cain and D. L. Ostapko, "MINI : A heuristic approach for logic minimization," IBM J. Res. Develop., vol. 18, pp. 443-458, Sept. 1974.
17. H. Langenbacher, "Boolean minimization for logic arrays," M. S. Thesis, Dept. of EE, San Diego State Univ. , Fall, 1979.
