Copyright © 1980, by the author(s).
All rights reserved.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission.

# HIGH SPEED CMOS A/D <br> <br> CONVERSION TECHNIQUES 

 <br> <br> CONVERSION TECHNIQUES}
by
William C. Black, Jr.

Memorandum No. UCB/ERL M80/54
1 November 1980

# HIGH SPEED CMOS A/D CONVERSION TECHNIQUES 

by
William C. Black, Jr.

Memorandum No: UCB/ERL M80/54 November 1980

```
    ELECTRONICS RESEARCH LABORATORY
    College of Engineering
University of California, Berkeley
    94720
```

TABLE OF CONTENTS
CHAPTER 1 - INTRODUCTION ..... 1
CHAPTER 2 - HIGH SPEED A/D CONVERTER APPLICATION AREAS ..... 3
2.1 Digital Time-Base Correction ..... 3
2.2 Digital Video Transmission and Storage ..... 4
2.3 Real Time Image Processing ..... 6
2.4 Digital Output Monolithic Imager ..... 7
2.5 Wide Dynamic Range Digital Audio ..... 7
CHAPTER 3 - CHARACTERIZATION AND DESIGN OF HIGH SPEED MONOLITHIC ..... 9 A/D CONVERTERS
3.1 Characterization of High Speed A/D Converters ..... 9
3.1.1 Quasi-Static Characteristics ..... 9
3.1.2 Dynamic Characteristics ..... 15
3.2 Flash Converter Technique ..... 23
3.2.1 Comparator Input Bias Currents ..... 26
3.2.2 Resistor Matching ..... 26
3.2.3 Comparator Intrinsic Offset ..... 34
3.2.4 Dynamic Ladder Tap Loading ..... 36
3.2.5 Clock Skew Between Comparators ..... 38
3.2.6 Comparator Phase Delay ..... 40
3.3 Successive Approximation Technique ..... 41
3.3.1 Weighted-Capacitor Methods ..... 44
3.3.2 Capacitor Matching Requirement ..... 50
3.3.3 Speed Limitations ..... 57
3.4 Pipeline Converters ..... 62
3.4.1 Divided Reference Pipeline ..... 64
3.4.2 Multiplied Remainder Pipeline ..... 69
3.4.3 Pipeline Performance Limitations ..... 71
3.5 Hybrid Converters ..... 72
3.5.1 Two-Step Flash Converter ..... 73
3.5.2 Flash-Pipeline Converter ..... 74
CHAPTER 4 - PRECISION ELEMENTS ..... 79
4.1 Resistor Matching Considerations ..... 79
4.2 Characteristics of Planar Diffused Resistor Structures ..... 82
4.3 Characteristics of Diffused Resistors Due to Sidewalls ..... 87
4.4 Compensation of Voltage Dependence in a Diffused Monolithic ..... 97 Resistor
4.5 Temperature Coefficient of Resistors ..... 101
4.6 Matching Consideration in Precision Capacitors ..... 103
4.7 Capacitance of Nonplanar Structures ..... 107
4.8 Voltage Coefficient of MOS Capacitors ..... 113
4.9, Temperature Coefficient of MOS Capacitors ..... 117
CHAPTER 5 - ANALOG CIRCUIT BUILDING BLOCKS ..... 120
5.1 Source Followers ..... 121
5.1.1 Frequency Response and Gain ..... 121
5.1.2 Transfer Characteristic Nonlinearity ..... 123
5.1.3 Elimination of Intrinsic Offset ..... 125
5.2 Single Gain Stage Linear Amplifiers ..... 127
5.2.1 Class-A Amplifiers ..... 127
5.2.2 Class-B Amplifiers ..... 133
5.2.3 Methods for Increasing Bandwidth ..... 136
5.3 Two Gain-Stage Linear Amplifiers ..... 140
5.3.1 Compensation Techniques ..... 142
5.3.1A Simple Pole-Splitting ..... 142
5.3.1B Pole-Splitting with Feedforward Blocking ..... 144
5.3.1C R-C Compensation Method ..... 145
5.4 Integrators ..... 149
5.4.1 Dyanmic Characteristics ..... 150
5.4.2 Power Supply Rejection ..... 154
5.4.2A General Case ..... 154
5.4.2B Effect of Offset Cancellation Cycle ..... 157
5.5 Output Stages ..... 158
5.6 Voltage Comparators ..... 164
5.6.1 Single Ended Comparators ..... 166
5.6.2 Differential Comparators ..... 169
5.6.3 Bandwidth Considerations ..... 171
5.6.3A Reset Cycle ..... 171
5.6.3B Compare Cycle ..... 175
5.7 Noise in Analog Circuits ..... 176
CHAPTER 6 - TIME INTERLEAVED CONVERTER ARRAYS ..... 180
6.1 Comparison with Other Methods ..... 180
6.2 Error Analysis ..... 187
6.2.1 Noise and Distortion Error Power ..... 187
6.2.2 Spectral Distribution of Noise and Distortion ..... 197
6.3 A Monolithic Time-Interleaved Converter Array ..... 203
6.3.1 Chip Description ..... 203
6.3.2 Experimental Results ..... 230
6.3.3 Anticipated Characteristics of Implementations in ..... 239 an Advanced Process
6.4 Application to a Digital Signal Processor ..... 240
CHAPTER 7 - CONCLUSIONS ..... 244
REFERENCES
APPENDIX A
APPENDIX B

# William C. Black, Jr. Dept. of Electrical Engineering and Computer Sciences 

Chairman of Committee

ABSTRACT

Several high speed analog to digital (A/D) conversion techniques are examined for potential application in large-scale integrated (LSI) complementary-symmetry metal-oxide-semiconductor (CMOS) converters. These techniques, both new and old, are considered with regard to fundamental limitations and ease of practical CMOS realization.

One of the new techniques examined, is demonstrated in a high speed 7-bit A/D converter chip. In this chip, an array of $A / D$ converters with interleaved sampling times are used as if they were effectively a single converter operating at a much higher effective sampling rate. This array method allows a reduction in the required circuit die size when compared with other high speed techniques (in the same fabrication technology) and may be used to achieve higher sampling rates than those obtainable with any single converter. Using a $10 \mu \mathrm{~m}$ metal-gate CMOS integrated circuit process, the chip described here achieves a conversion rate of 2.5 MHz with full linearity and displays a signal to noise ratio of 39 dB for a 100 kHz input signal. Conversion at reduced linearity is achieved up to 4 MHz .

## CHAPTER 1

## INTRODUCTION

The advantages of digital signal processing and storage techniques could be exploited in many new application areas if high-speed integrated circuit (IC) analog-to-digital (A/D) converters were available at a sufficiently low cost. As present day high-speed converters use techniques which require large IC die sizes and/or fairly exotic fabrication processes, these circuits have remained too expensive for many applications. Furthermore, these $A / D$ techniques will be exceedingly difficult to integrate along with a VLSI digital signal processor, because of the large $A / D$ die size and/or process requirements.

The objective of this research has been to study alternative highspeed $A / D$ conversion techniques amenable to realization in MOS LSI. In this dissertation, several techniques will be described which may allow a substantial reduction in high-speed analog interfacing cost, and hence allow a number of new application areas to become economically realizable. In the second chapter, several digital application areas are described which are of this type. While this list is not intended to be exhaustive, it demonstrates the broad range of application areas that may be impacted by developments in inexpensive high-speed $A / D$ converters. The third chapter covers a number of specific $A / D$ conversion methods, both old and new, and discusses the inevitable tradeoffs between speed, IC chip area and power that exist for each converter type. Chapter 4 deals with the fundamental characteristics and limitations of precision elements in integrated circuit processes, while in Chapter 5, a set of basic analog circuit building blocks are described. The sixth chapter
describes the analysis, design and experimental performance of a new single-chip $A / D$ converter, which can provide substantial improvements in effective conversion, speed, and a significant reduction in die area compared to alternative designs for similar specifications. Conclusions are presented in Chapter 7.

## CHAPTER 2

## HIGH-SPEED A/D CONVERTER APPLICATION AREAS

Perhaps the largest single application of high-speed monolithic converters would be in the area of digital video. Digital techniques are presently employed in most television studios, but are not yet economical in home television receivers. A number of new techniques could be applied in both areas if less expensive high-speed converters were available. Similarly, digital techniques would have profound effects in low-cost imaging systems and in high fidelity audio equipment if the required hardware could be sufficiently integrated. A few of the application areas that would be impacted by the development of inexpensive high-speed $A / D$ converters are presented below.

### 2.1 Digital Time-Base Correction

A common problem in television studios is that of synchronizing broadcast sources. As different sources may be dubbed together or asynchronously switched, they must be synchronized to avoid annoying "page flipping." This problem is particularly severe when mobile stations or satellites are employed, as the signals may be variably phased and doppler shifted in addition to being asynchronous. Many systems that were used in the late sixties and early seventies required that each broadcast source use a rubidium atomic-standard, which would not drift more than a few microseconds per day. The broadcast signal was then electrically delayed [1] by an adjustable amount which would allow synchronization. Other methods employed conventional phase-locking techniques $[2,3]$ but required that a communication line exist between
the studio and the broadcast source (typically a phone line). More recently, digital techniques have been used to achieve synchronization within a line (corrects for magnetic tape jitter, [4]), or within an entire frame [5,6]. This latter technique is the most generally useful, as gross changes in phase or doppler shift will not result in synchronization loss.

As line synchronizers employ relatively little memory (typically a few video lines of approximately 512 bytes each), the price of the unit is dominated by the cost of the $A / D$ and $D / A$ converters. Thus, if an $A / D$ and D/A subsystem could be made with a sufficiently small die area, a monolithic line synchronizer should be relatively easy to implement. This device would have possible use in nearly all analog video recorders, if it could be made at a low enough cost.

Although frame synchronizers employ a fairly large digital memory (typically about 300 Kbytes), the overall cost is still affected by the complexity of the analog interfacing. While a reduction in frame synchronizer cost would have some impact on television studios, extensions of the technique into home systems with multiple broadcast sources (multiple channels on a split screen for instance), appear possible.

### 2.2 Digital Video Transmission and Storage

Digital transmission and storage of video information is somewhat involved, due to the relatively high bit rates required. A number of advantages, however, could be obtained from the use of this technique.

A studio which employed a digital image transmission system between various types of equipment, is more easily adapted for use with different video standards than equipment designed for operation with one standard
only. As Europe uses two incompatible standards, PAL and SECAM, which are also incompatible with the American NTSC standard, commonality of equipment is presently next to impossible. A studio which was entirely digital could be adapted to any analog standard just prior to transmission [7]. In addition, this technique would allow simple interfacing to digital image processors and storage devices, and reduces the hardware requirements of digital frame synchronizers and dubbing equipment as the encoding has already been performed.

Digital video transmission via satellite is under active consideration, as in certain cases it will result in a reduction in the required satellite transmission power [8]. Further, this technique may allow a reduction in receiver sensitivity requirements and hence potentially lower cost in direct "satellite to home" broadcast applications.

Digital video recording methods offer many advantages over analog techniques, including high picture quality, little or no video degradation after multiple program generations and no need for adjustment. This technique has been commonly used with small disk or semiconductor memories, and has recently been demonstrated with a fairly standard magnetic tape [9]. This technique would be highly desirable in home videotape players, if a method could be obtained for reducing the required bit rate. A number of techniques have been investigated, including subNyquist sampling [10] and differential coding [11]. The first method takes advantage of the statistical properties of most video images to allow a reduction in the minimum required sampling rate, although comb filtering in the reconstructed waveform is required, and some types of image aberration may occur. The second method of bit rate reduction utilizes psychological perception characteristics to reduce the required number
of encoded bits. As the techniques which allow the greatest reduction in bit rate are also the most hardware intensive [12], the optimal recording economy will be dictated by a compromise between the two extremes. An integrated video codec and a simple video processor would have profound effects upon the overall cost of a digital video recorder, and quite possibly enable this technique to be applied in even low-cost units.

### 2.3 Real Time Image Processing

Digital image processing has typically been used in only a few application areas, because of the relatively high costs involved. It is, for instance, routinely used in the enhancement and correction of satellite photographs. This technique is rarely used in real time systems, however, as the analog converter and signal processor bandwidths are usually quite high, and as a result quite expensive. If the cost of these components could be reduced, however, real time image processing could be used in even low level application areas. One of the most discussed application of this type, is the commercial television receiver. By utilizing fairly straightforward techniques, vertical and horizontal image enhancement, low level coring ${ }^{1}$, high level clipping and adaptive gain control are all easily accomplished [13]. Further, digital color correction and control is fairly simple to implement. All of these techniques appear to be possible in a one or two chip codec-video processor system, which could be used alone or with shared components, in for example, a digital video recorder.

[^0]
### 2.4 Digital Output Monolithic Imager

A semiconductor imager which employed an on-chip digital encoder would have considerable reduced hardware interfacing requirements when used in a digital video system. If the $A / D$ converter could be made sufficiently small, some degree of image processing may even be economical on-chip. This would allow, for instance, sophisticated digital motion detectors to be realized in as few as one or two chips.

### 2.5 Wide Dynamic Range Digital Audio

Conventional audio recording methods have difficulty in achieving a consistantly wide dynamic range ( $>75 \mathrm{~dB}$ ), because of their susceptability to mechanical imperfections and electronic noise. Recently PCM encoded audio has become popular as an expensive but high performance alternative to analog recending methods $[14,15]$. In this technique, audio input signals are sampled at typically a $40-50 \mathrm{kHz}$ rate with at least 14 bits of resolution [16]. This data is then stored on a high bandwidth recorder (typically a modified video recorder), in a format which allows considerable immunity to tape imperfections and fallout $[17,18]$.

In adapters which allow digital audio recordings to be made on a video tape player (which can still be used for analog video recording), the codec and anti-alias output smoothing filters dominate the unit cost. Using improved monolithic conversion techniques and switched capacitor filters [19] of sufficiently wide dynamic range [20], the entire adapter function could be reduced to a $2-4$ chip circuit. This should allow a considerable reduction from the present $\$ 2000$ unit price [21], and increase the popularity and use of this technique.

Digital encoding of audio would also allow the use of improved equalizers and mixers [22], which could make ever increasing use of audio-band digital signal processing chips [23].

## CHAPTER 3

CHARACTERIZATION AND DESIGN OF HIGH SPEED
MONOLITHIC A/D CONVERTERS

In this chapter the characterization and design of high speed monolithic $A / D$ converters is discussed at length. In Section 3.1 the common methods for evaluating converter performance are shown, including both quasi-static and dynamic input signal tests. Sections 3.2-3.4 discuss the flash, successive approximation and pipeline converter methods, while Section 3.5 demonstrates hybrid, or multiple converter techniques.

### 3.1 Characterization of High Speed A/D Converters

The evaluation of high speed converter performance (whether anticipated or measured) is an important aspect of the design and use of these devices. The tests discussed in this section cover both quasi-static and dynamic input conditions and are generally useful over a wide range of application areas.

### 3.1.1 Quasi-Static Characteristics

Perhaps the most obvious $A / D$ converter test is that of $D C$ linearity. In this test, a slow ramp is applied to the input of the $A / D$ converter as shown in Fig. 3.1. The output of the converter is then reconstructed with a high-quality $D / A$ and compared with the input signal via an $X-Y$ recording mechanism. The resulting pattern for an input signal which extends over the entire dynamic range of the $A / D$ is the converter transfer characteristic. An example of this characteristic for an ideal A/D converter is shown in Fig. 3.2a, while characteristics which result


Fig. 3.1 Measurement of an $A / D$ converter's transfer characteristic.


Fig. 3.2 Quantization characteristics of linear A/D converters. (a) Ideal. (b) Gain error. (c) Offset error. (d) Nonlinearity.
because of various converter nonidealities are shown in $b$ through d. As shown in Fig. 3.2a, the "characteristic line" of a converter is defined in terms of the first and last transition points within the converters dynamic range. The slope of this line indicates the converter gain, while the intercept with the $X$-axis indicates converter offset. As illustrated in Fig. 3.2b and Fig. 3.2c, both gain and offset are somewhat variable in many converter types and in real circuits may result in a reduction in the converters dynamic range. In most applications, however, a small constant error in either the gain or offset is not considered a performance limiting problem. A far worse nonideal effect is that demonstrated in Fig. 3.2d. In this figure a transfer characteristic is shown which exhibits nonlinear behavior. This nonlinearity at low frequencies is invariably caused by simple precision component mismatches within the A/D converter, and is usually described in terms of the integral and differential nonlinearity specifications.

The integral nonlinearity of a converter is defined to be the maximum deviation of the transfer characteristic, from the characteristic line, and is usually expressed in terms of converter LSBs. Similarly, the differential nonlinearity, is defined to be the worst case step size error relative to 1 LSB. In other words, if $\hat{x}$ is the quantized voltage value of $x$, and if the positive going input voltage which is just high enough to create an $A / D$ output code of $D_{n}$ is defined as $V(n)$, then the linearity specifications are given by

$$
\begin{equation*}
\left.I N L \triangleq \max |x-a x-b|\right|_{x_{\min }} ^{x_{\max }} \tag{3.1}
\end{equation*}
$$

[^1]and

$D N L \triangleq \max |1 L S B-V(n+1)+V(n)| \left\lvert\, \begin{aligned} & D_{\max -1} \\ & D_{\min }\end{aligned}\right.$
where $a$ is the converter gain and $b$ is the offset. Note that when the DNL exceeds 1 LSB, either missing codes or nonmonotonicities may result, as shown in Fig. 3.3. These types of converter errors are often regarded as catastrophic in control systems ${ }^{2}$, however in communication systems where the effects of nonlinearities are usually statistical in nature, nonmonotonicities may not result in a significant degradation in performance unless they occur often or are near the quiet channel bias point. In these systems the INL specification tends to be more important, as in many converter types it more accurately reflects the nonlinear performance over the entire input range. ${ }^{3}$

It is interesting to note that in practical test situations the foregoing analysis may usually be simplified. For most binary weighted converters the entire transfer function may be inferred by measuring the input voltage required for each major bit transition (from 0... 0111 to 0... 1000 for example) or by evaluating the differential nonlinearity at

[^2]

Fig. 3.3 Quantization characteristics for converters with a DNL > 1 LSB. (a) Missing code. (b) Non-monotonic behavior.
an odd number of points across the entire input range [26].

### 3.1.2 Dynamic Characteristics

Dynamic tests of a high frequency $A / D$ converter are generally more useful in the evaluation of a device than static tests, as they more accurately reflect the converter's characteristics during real operation. Although different applications may require fairly unique converter test procedures, several types of dynamic tests are generally useful. Four of these tests are described here, they are: single-frequency signal to noise ratio (SNR), noise power ratio (NPR), code density test and gain and phase linearity.

The first of these, the single-frequency signal to noise ratio test, is used extensively in this and subsequent chapters as a performance measure for a variety of converter types. In this test, a typically full scale sinusoidal signal is fed into the $A / D$ to be measured, as shown in Fig. 3.4. The output of the $A / D$ is then reconstructed with a high quality digital-to-analog converter, and bandlimited to the Nyquist rate of the sampler or below. The ratio of the measured power at the original signal frequency to all other measured power is the single frequency signal to noise ratio. ${ }^{4}$ The power at nonfundamental frequencies, or the "error power," is due to the A/D converter displaying both finite resolution and limited linearity. As shown in Fig. 3.5, the converter quantizing error may be seen to be

$$
\begin{equation*}
\varepsilon(x)=\delta(x) \tag{3.3}
\end{equation*}
$$

[^3]

Fig. 3.4 Single-frequency signal to noise ratio test.


Fig. 3.5 Quantization and nonlinearity error in an $A / D$ converter.
where

$$
\begin{equation*}
\delta(x)=\hat{x}-(a x+b) \tag{3.4}
\end{equation*}
$$

and where $a$ and $b$ are error minimization terms corresponding to the best fit converter gain and offset. This results in an average error power (referred to 1 ohm) of simply

$$
\begin{equation*}
E_{p}=\left\langle\delta^{2}\right\rangle \tag{3.5}
\end{equation*}
$$

where $\left\langle\delta^{2}\right\rangle$ indicates the mean or expectation value of $\delta^{2}$, and where a and b in Eq. (3.4) have been chosen to minimize $E_{p}$ for a specified input distribution. In general, for an input signal with probability density $p(x)$,

$$
\begin{align*}
& \langle f(x)\rangle \triangleq \int_{-\infty}^{\infty} f(x) p(x) d x  \tag{3.6}\\
& a \triangleq \frac{\langle\hat{x} x\rangle-\langle\hat{x}\rangle\langle x\rangle}{\left\langle x^{2}\right\rangle-\langle x\rangle^{2}} \tag{3.7}
\end{align*}
$$

and
$b \triangleq\langle x\rangle-a\langle x\rangle$

Note that $a$ and $b$ are potentially quite different from the gain (a) and offset (b) described previously, as the new values take into account all converter nonlinearities and the probability distribution of the input signal, while the old values were only a function of the converter transition points at the very ends of the transfer characteristic.

The determination of these parameters, $a$ and $b$, is only necessary during computer simulations of converter performance and is not required
experimentally. This $S / N$ ratio test is particularly amenable to computer simulation in that it allows a dynamic performance measure to be obtained without the need for frequency domain calculation. All of the necessary calculations may be done in terms of input probability functions and specific converter nonidealities, as will be outlined in a subsequent section.

Another useful dynamic converter measurement, is the noise power ratio test. In this case a bandlimited gaussian noise source is used as the input to the A/D converter, as shown in Fig. 3.6. The power from the reconstructed output in a selected frequency band is measured, and then measured again with the same frequency band notched out of the input source. The ratio of the two power measurements indicates the relative performance level of the converter, as dynamic nonlinearity, quantizing and aperture effects will tend to spill noise power into the notched-out band [27]. This tends to be a more complete test of a converter than a sinusoidal SNR measurement of a single input amplitude. As a rule, however, it is a more difficult test to perform and is computationally more difficult to simulate, as both time and frequency domain effects must be included in the simulation. This test is relatively common for evaluation of voiceband communication systems, but is rarely discussed in the context of high frequency or video bandwidth converters.

Yet another useful converter measurement is the output code density test. In this measurement, a signal with a known probability density is applied to the converter under test as shown in Fig. 3.7a. The output of the $n$-bit converter is then used as a memory pointer to indicate which of the $2^{n}$ memory locations is to be incremented. Assuming each location

Fig. 3.6 Noise power ratio test.


Fig. 3.7 Code density test. (a) Test configuration. (b) Output code density of a nonideal A/D converter for a full scale uniform probability input signal (such as a sawtooth).
was initially zeroed, this makes each location a counter which reflects the relative frequency of a specific output code. As converter nonlinearities will tend to distort the distribution of codes, various types of converter behavior may be deduced from the pattern of memory counts. An example of this is shown in Fig. 3.7b. This figure indicates a typical output frequency map for a nonideal A/D converter with a sawtooth input waveform. Ideally, this input signal should produce a constant frequency of occurrence. Note that nonlinearities produce a nonuniform distribution, while missing codes are indicated by the lack of any counts at all.

A problem with this technique, is that nonmonotonic behavior may not result in a variation of count distribution from that of a circuit which is operating properly. In fact, for certain input distributions, several of the A/D output bits may actually be switched without any change in apparent performance based upon this test. Further, this test can be relatively difficult to perform with high speed converters, in that the memory must be read, the contents incremented and the results subsequently stored all in one conversion cycle. In addition, this test can take a significant time to perform in high resolution converters as a large number of counts, on the average, must be obtained in each memory location. As a rule this converter test may be used to supplement the tests presented previously, but is almost never used alone.

The last test to be described, the differential gain and phase measurement is quite important in color video systems and is included here for completeness. In this test a high frequency sinusoid is impressed upon a linear ramp, which usually has a period equal to the line scan time of an NTSC (in North America) or PAL or SECAM (Europe) video signal. This composite signal is fed into the $A / D$ under test, as
shown in Fig. 3.8, and subsequently reconstructed. The high frequency component in the output is then separated from the ramp, and compared in gain and phase to the same signal in the original composite input, as a function of position along the ramp. As the chrominance component of a color video signal is phase encoded atop the luminance signal, a variation in converter response with input amplitude may result in chromatic aberration at different luminance values. A variety of video relevant converter errors may be detected with tests of this type [28,29].

### 3.2 Flash Converter Technique

The most straightforward procedure for realizing a fast $n$-bit A/D conversion, is by performing $2^{n}$ simultaneous voltage comparisons with $2^{n}$ linearly graduated voltage sources. This general "flash" technique is shown in Fig. 3.9. As is evident from the figure, a resistor string is typically used for the generation of unique tap voltages, while a gate array (possibly in conjunction with a ROM) is used for the required $2^{n}$ to $n$ line conversion. This method has long been used in discrete and hybrid circuits, but has been extremely expensive to implement at resolution levels above 4 or 5 bits. Recently flash circuits have been designed in monolithic bipolar form in initially 4-bit [30] and subsequently 8-bit [31] configurations. Thus far only a single MOS flash circuit has been reported [32], it being a 6-bit converter fabricated in a silicon on sapphire technology.

Depending upon the choice of technology, a number of potential problem areas must be considered in the flash converter design. Several of these including some not-so-obvious ones, are discussed below.


Fig. 3.8 Gain or phase linearity test.


Fig. 3.9 Flash converter method.

### 3.2.1 Comparator Input Bias Currents

In flash circuits which are fabricated in a bipolar technology, the effect of comparator input bias current can be quite severe. This current may result in a substantial nonlinearity in the converter transfer characteristic if it is not compensated for in the chip design. This is particularly true in high resolution converters which must use a large number of comparators, and in converters fabricated in a high speed process which usually will display a low bipolar $\beta$. This problem may be reduced by using an input bias cancellation technique [33-35], although this usually requires the use of a pnp device which is typically slow or not available at all (triple diffused processes ${ }^{5}$ ). Alternately high resistor string bias currents, single or few tap trimming, and the use of a tapered resistor string [36] may be used. None of these latter techniques are particularly desirable, however, in that they force a high power dissipation, increased testing complexity (or use, depending upon where trimming is performed), and very accurate knowledge of input bias currents over temperature. This entire problem may be eliminated by the use of an MOS input device on the comparator, either as part of a combination bipolar/MOS circuit $[37,38]$ or as part of an all MOS design. Both MOS and bipolar designs, however, will be susceptible to the problems discussed below.

### 3.2.2 Resistor Matching

Assuming that the reference source is just a string of series resistors, the voltage at any tap " $k$," may be given by

[^4]\[

$$
\begin{equation*}
V_{K}=\frac{\sum_{i=1}^{k} R_{i}}{\sum_{i=1}^{2^{n}} R_{i}} \cdot\left(v_{r e f}^{+}-v_{r e f}^{-}\right)-v_{r e f}- \tag{3.9}
\end{equation*}
$$

\]

where it is assumed that the $k-t h$ tap is above the $k$ th resistor, and $n$ is the bit resolution of the converter. This indicates that the integral nonlinearity about the nominal line may be given by (in LSBs)

$$
\begin{equation*}
I N L=\max \left|\frac{\sum_{i=1}^{k} R_{i}}{\sum_{i=1}^{2^{n}} R_{i}}-\frac{k}{2^{n}}\right|_{k=0}^{k=2^{n}-1} \cdot 2^{n} \tag{3.10}
\end{equation*}
$$

and the differential nonlinearity by

$$
\begin{equation*}
D N L=\max \left|\frac{R_{k}}{\sum_{i=1}^{2^{n}} R_{i}}-\frac{1}{2^{n}}\right|_{k=1}^{k=2^{n}} \cdot 2^{n} \tag{3.11}
\end{equation*}
$$

As the worst case linearity is actually a function of all of the component mismatches, the required component matching for a specified performance level tends to be probablistic in nature. If the resistors are assumed to have some distribution of values, converter performance in various static and dynamic tests may be simulated. A computer program has been written to perform just this simulation. This program, which is listed in Appendix A, assumes a gaussian distribution of component variations where the standard deviation of error is given by the user. By modifying
the input probability distribution, worst case linearity or signal to noise performance may be determined. This program may also be used to simulate the performance of converter arrays, as discussed in Chapter 6.

Simulated worst case linearity error for 100 flash converters of 7-bit resolution, as determined by this program, is indicated in Fig. 3.10. In this figure, the average and standard deviation of the worst case linearity is shown for error about the best fit line, and error about the nominal characteristic. As is evident from the figure, a $\sigma_{R} / \bar{R}$ of approximately $5 \%$ is required to obtain less than 1 LSB error about both lines most of the time. Calculated full scale SNR performance for the same set of converters is shown in Fig. 3.11. Note that converters which display an average $1 / 2$ LSB INL, exhibit an average SNR of about 41.6 dB .

In real flash converters the effect of linear processing gradients is potentially as, or more severe, than seemingly random variations. This is particularly true since most converters of this type are designed with a linear, or folded linear layout. The effect of a linear resistor gradient on maximum converter error is shown in Fig. 3.12a for an unfolded converter. Similarly, the effects of a resistor gradient upon the error of converters which are folded once and twice, are indicated in Fig. 3.12b. This same linear resistor gradient has been used for simulation of full-scale SNR performance and is shown in Fig. 3.13 for each of the converter topologies mentioned. As is apparent from all of these figures, interdigitating or folding of the converter layout will substantially reduce the degradation of converter performance due to a linear resistor gradient, even if the "folding" is only done once or twice.

Fig. 3. 10 Simulated maximum quantizing error of 1007 -bit flash converters as a function of resistor mismatch. Resistors are assumed to be normally distributed about their nominal value.


Fig. 3.11 Simulated SNR performance of 100 7-bit flash converters as a function of resistor mismatch.

Fig. 3.12a Calculated maximum quantizing error of a 7-bit flash converter which has a linear


Fig. 3.12b Calculated maximum best-fit quantizing errors of "folded" 7-bit flash converters as a function of linear resistor gradient.


Fig. 3.13 Calculated SNR performance of 7-bit flash converters as a function of 1 inear resistor graident and "folding."

### 3.2.3 Comparator Intrinsic Offset

If the $2^{n}-1$ comparators in a flash circuit exhibit a constant $D C$ offset, the effect on the converter's DC transfer function will be a simple offset shift by the same amount. If, however, the comparators exhibit a variation in offsets, converter nonlinearity will result. As this is in addition to resistor nonlinearity, the ultimate matching requirements of the comparators and resistors for a specific performance level are actually coupled together. In fact, the linearity expressions are now given by (in LSBs)

$$
I N L=\max \left|\frac{\sum_{i=1}^{k} R_{i}}{\sum_{i=1}^{2^{n}} R_{i}}+\frac{V_{o s_{k}}}{\left(V_{r e f}^{+}-V_{r e f}^{-}\right)}-\frac{k}{2^{n}}\right| \begin{align*}
& k=2^{n}-1  \tag{3.12}\\
& \cdot 2^{n} \\
& k=0
\end{align*}
$$

and

$$
\begin{equation*}
D N L=\max \left|\frac{R_{k}}{\sum_{i=1}^{2^{n}} R_{i}}-\frac{\left(v_{o s_{k-1}}-V_{o s_{k}}\right)}{\left(V_{r e f}{ }^{+}-V_{r e f}-\right)}-\frac{1}{2^{n}}\right|_{k=2^{n}}^{k=1} \tag{3.13}
\end{equation*}
$$

Note that for cases in which the DNL is greater than 1 LSB, a situation potentially much worse than a simple missing code may develop. If the combination of resistor nonlinearity and comparator offset is sufficiently large that comparator $k$ indicates that $V_{i n}>V_{k}$ but comparator $k-1$ still reads that $V_{i n}<V_{k-1}$, a disallowed internal logic state will occur. Depending upon the specific implementation, this may result in
neither code being represented at the converter output. In fact, in circuits which employ XOR gates and a ROM, the truncated algebraic sum of the two codes is even possible.

If the comparator offsets in a flash circuit are assumed to be normally distributed with mean $\overline{V_{0 S}}$, the required offset variance $\sigma_{0 S}^{2}$ may be determined for a real converter that is to have no missing codes a certain percentage of the time. The mean and variance of the sum of any two comparator offsets is given by simply $2 V_{0 S}$ and $2 \sigma_{0 S}^{2}$ respectively. A comparator offset induced missing code may develop if for any $i, V_{o s}$ $-V_{0 S_{i+1}} \mid>1$ LSB. Further, because there are nearly $2^{n}$ comparator pairs in an n-bit converter, the probability of a missing code occurring at all, is approximately $2^{n}$ times the probability of occurrence in any single pair. Thus, for example, if it is required that no missing codes occur $62 \%$ of the time in an 8 bit converter which has a 2 V dynamic range, then it is straightforward to show that the required offset distribution must be such that,

$$
\begin{equation*}
3 \sqrt{2 \sigma_{O S}^{2}} \leqq 1 L S B \tag{3.14}
\end{equation*}
$$

or

$$
\begin{equation*}
\sigma_{O S} \leqq 1.87 \mathrm{mV} \tag{3.15}
\end{equation*}
$$

This indicates that tight control of offsets is essential if good linearity is to be obtained with good yield. The requirement of 1.87 mV can probably be met in bipolar circuits if care is exercised in the comparator design (as evidenced by [39]), but this may be difficult to achieve in MOS converters even with offset cancellation techniques.

### 3.2.4 Dynamic Ladder Tap Loading

A potentially severe problem in flash circuits, arises due to dynamic loading of the precision ladder structure. As each tap from the resistor string is ultimately connected to a comparator, any input charge-pumping which occurs during normal comparator operation may modify apparent resistor linearity. This charge-pumping may occur due to a comparator offset cancellation cycle, or from comparator regeneration. Dingwall has shown that charge-pumping from offset cancelled comparators (see Chapter 5) may be modeled as a lumped reactance, and further, that for comparators in which $n \geq 6$ the ladder may be represented as a distributed R-Z circuit, as shown in Fig. 3.14 [40]. He has analyzed this network with respect to network linearity, and found that the maximum fractional error $\varepsilon_{\max }$ is given by

$$
\begin{equation*}
\varepsilon_{\max }=\left[V\left(x_{\max }\right)-x_{\max } \cdot V_{\text {ref }}\right] / V_{\text {ref }} \tag{3.16}
\end{equation*}
$$

where $x_{\max }$ is the position of maximum error described by

$$
\begin{equation*}
x_{\max }=\frac{1}{K} \cosh ^{-1}\left[\frac{v_{r e f}-v_{i n} \cdot k \cdot e^{-k x_{\max }}}{2 \cdot k \cdot c_{2}}\right] \tag{3.17}
\end{equation*}
$$

The function $V(x)$ in Eq. (3.16) is given by

$$
\begin{equation*}
V(x)=\frac{V_{r e f}\left(e^{K x}-e^{-K x}\right)}{e^{K}-e^{-K}}+V_{i n}\left[1-e^{-K x}-\frac{1-e^{-K}}{e^{K}-e^{-K}}\right] \tag{3.18}
\end{equation*}
$$

where

$$
\begin{equation*}
K=2^{n} \sqrt{\frac{R_{\text {TAP }}}{|z(\omega)|}} \tag{3.19}
\end{equation*}
$$



Fig. 3.14 Effect of comparator reset cycles on flash converter linearity. (a) Discrete model. (b) Distributed equivalent for a large number of tapes.

$$
\begin{equation*}
c_{2}=\frac{v_{r e f}-v_{i n}\left(1-e^{-K}\right)}{e^{K}-e^{-K}} \tag{3.20}
\end{equation*}
$$

and

$$
\begin{equation*}
c_{1}=\left(c_{2}+v_{i n}\right) \tag{3.21}
\end{equation*}
$$

In the above equations, $R_{\text {TAP }}$ is the resistance between taps in the reference ladder, $|z(\omega)|$ is the comparator input reactance at the comparator reset frequency and the worst case $V_{i n}$ for the ladder shown in Fig. 3.14 is at $V_{i n}=O V$. The tap loading requirements $\left(z(\omega) / R_{T A P}\right)$ for $1 / 2$ LSB linearity errors due to this source alone, are shown in Fig. 3.15 for commonly occurring resolution levels. Note that for a constant comparator input capacitance, the required reference power goes as approximately the square of the resolution level, i.e., 8 bit resolution converters will require 16 times the reference power of a 6 bit device for an equivalent conversion rate. This will have profound effects upon non-SOS converters which will suffer from non-negligible input parasitic capacitance.

An alternative flash operation cycle would be to only offset cancel the comparators during system dead times, rather than between every conversion cycle [41]. This should be feasible in video systems which normally do not use information transmitted during the horizontal line sync., or frame sync. periods, although this may not be possible or convenient in many data acquisition systems.

### 3.2.5 Clock Skew Between Comparators

If the comparators within a flash converter are not clocked at precisely the same time, a clock induced phase skew will develop between different comparators. As will be shown in Chapter 6, this results in a form of aliasing distortion, creating signal related noise spikes


Fig. 3.15 Minimum allowable $\mathrm{R}_{\mathrm{T}} / 2(\omega)$ as a function of bit resolution for $1 / 4$ LSB error due to charge pumping alone.
which are not usually harmonically related to the input frequency. The error power resulting from clock time skew for a sinusoidal input of frequency $\omega$ and amplitude $A$, may be approximated by

$$
\begin{equation*}
E_{p} \simeq \frac{A^{2} \omega^{2} \sigma_{t}^{2}}{2} \tag{3.22}
\end{equation*}
$$

where $\sigma_{t}$ is the variance of sampling period. For a $S / N$ ratio of 40 dB in a full scale sinusoidal signal of frequency $\omega$, this indicates that

$$
\begin{equation*}
\sigma_{t} \leq .0016 \cdot 2 \pi / \omega \tag{3.23}
\end{equation*}
$$

which for a 4.2 MHz video signal becomes $\phi_{t} \leq 380 \mathrm{psec}$. In systems of even lower bandwidth this requires that transmission and loading effects between different comparators be well matched in order to prevent phase skew effects from degrading converter performance.

### 3.2.6 Comparator Phase Delay

In many flash circuit applications, the converter is used without a preceding sample/hold circuit. In these cases the input is directly tied to the comparators, and all conversions are performed "on the fly." A potential problem with this approach is the effect of finite comparator bandwidth upon the reconstructed signal. In particular, because of this limited bandwidth, the comparators will cause nonlinear phase effects in the transmitted signal. In situations which must use relatively high comparator gain (prior to regeneration) this has a profound effect upon the required comparator bandwidth for a stated phase requirement. In fact, for a single pole comparator with pole frequency $p$,

$$
\begin{equation*}
\phi_{\mathrm{err}}(s)=\tan ^{-1}\left(\frac{s}{\mathrm{p}}\right) \tag{3.24}
\end{equation*}
$$

Thus for a $3^{\circ}$ maximum phase shift at the maximum input frequency, it must be the case that $\mathrm{p} / \mathrm{s} \geq 19$. For a 4.2 MHz video signal this
translates to a required pole frequency of nearly 80 MHz . Further, variations in comparator bandwidth in the array may create a reconstructed signal phase which is not constant over different signal levels, or which is phase "noisy." The high bandwidths discussed are difficult to achieve in MOS circuits (especially at low power levels) as the required comparator gains are typically fairly high. As a point of reference, the stated -3 dB frequency of the comparators in the only existing 8-bit monolithic flash converter (which is a bipolar circuit [42]) is 40 MHz .

### 3.3 Successive Approximation Technique

Perhaps the most common A/D conversion algorithm is the n-step sequential, or successive approximation technique. In this approach $n$-sequential approximations to the input voltage are made for an $n$-bit conversion, each typically a factor of two more accurate than the approximation made previously. ${ }^{6}$ In general, this algorithm can take on either of the forms shown in Fig. 3.16. The first of these, the routine employing a successively divided reference, is used in most successive approximation converters which employ $n$-stages of binary weighted components (or the equivalent, e.g., R-2R ladders) for an n-bit conversion. The second algorithm which employs a multiplied remainder scheme, is used in circuits which only have a few precision components, but which require the use of amplifiers [44]. As will be shown in Section 3.4, this latter technique is also useful in easily realizable pipeline converters.

As a converter employing a successive approximation algorithm uses typically $n(o r n+1$ ) cycles for an $n$-bit conversion, very high conversion

[^5]

Fig. 3.16a Successive approximation algorithm using reference division.


Fig. 3.16b Successive approximation algorithm using remainder multiplication.
rates may only be obtained in circuits fabricated in very fast processes. Although a high bandwidth monolithic converter has been realized at least once using this technique in a fast bipolar process [45], and appears to be possible in a few chip implementations built around a recently discussed high speed D/A [46], MOS implementations using this algorithms will be difficult or impossible to realize at conversion rates above a few MHz. This technique in one form appears to be quite useful in array configurations, however, as discussed in Chapter 6,. This particular method, which employs precision binary-weighted capacitors, is discussed in the next section.

### 3.3.1 Weighted Capacitor Methods

A relatively new $A / D$ converter technique that has gained considerable acceptance, is the weighted-capacitor charge-redistribution method. This technique, first introduced by McCreary and Gray [47] typically uses binary-weighted capacitors to carry out a divided reference algorithm similar to that given in Fig. 3.16a. An implementation of a circuit of this type which directly carries out this algorithm is shown in Fig. 3.17. Converter operation proceeds in the sequence shown in this figure. More specifically, converter sampling is performed by tying all of the capacitor bottom plates to the input voltage and shorting the top plate to a fixed voltage such as ground, or the comparator offset point. At the end of the sampling period the bottom plates are reconnected to analog ground while the top plates are allowed to float. This results in a voltage present upon the top plate which is roughly proportional to $-V_{i n}$. The relationship would be exact except for parasitic capacitances present on the top plate, which are usually somewhat voltage variable.


Fig. 3.17a Weighted capacitor A/D converter: Sample mode.


Fig. 3.17b Hold mode.


Fig. 3.17c Successive approximation cycling:

1. Determine sign of $V_{x}$

If $V_{x}<0$; Switch $S_{1}$ to $V_{\text {ref }}{ }^{+}, D_{1}=0$
If $V_{x}>0$; Switch $S_{1}$ to $V_{\text {ref }^{-},} D_{1}=1$
$V_{x}=-V_{\text {in }} \pm V_{r e f} / 2$
2. Determine sign of $V_{x}$

Converter operation proceeds by sequentially flipping each capacitor bottom plate to either $V_{\text {ref }}{ }^{+}$or $V_{\text {ref }}{ }^{-}$(starting with the largest capacitor) depending upon the sign of $V_{x}$. This sequence continues until $n$ capacitors have been switched, at the rate of one capacitor per time slot, for an n-bit plus sign conversion.

As this conversion method requires references to each polarity, and as it is designed for operation with bipolar input signals, it is not necessarily useful in all applications. A modification of this basic scheme which only uses one reference polarity and which operates on unipolar input signals, is shown in Fig. 3.18. In this circuit converter sampling is done as before, by tying the top plates of the capacitors to a fixed voltage such as ground, and the bottom plates to $V_{i n}$. After the sampling period, the bottom plates are all switched to ground and subsequently switched to $V_{\text {ref }}$ at the rate of one capacitor per clock period. After a capacitor has been flipped to $V_{\text {ref }}$, a sign determination is made upon the voltage present on the capacitor top plates. If the voltage is now above ground (or in many implementations, above the comparator offset point), the bottom plate is returned to ground during the next cycle. If, however, the top plate voltage is still below ground (or below the comparator offset point) the bottom plate is left at $V_{\text {ref }}$ during the remainder of the conversion cycle. Note that this procedure differs from the one presented previously in that sign testing here only begins after the first capacitor has been flipped, while previously it was performed before. Thus this latter procedure provides one-bit less resolution than the circuit of Fig. 3.17, for the same number of capacitors. This drawback is acceptable in many applications, however, as the bit cell is usually simpler and only one reference polarity is required.


Fig. 3.18a Unipolar input converter: Sample mode.


Fig. 3.18b Hold mode.


Fig. 3.18c Successive approximation cycling:

1. Switch $S_{1}$ to $V_{\text {ref }}$

Determine sign of $V_{x}$
If $V_{x}<0$; Leave $S_{1}$ on, $D_{1}=1$
If $V_{x}>0$; Switch back, $D_{1}=0$

$$
v_{x}=-v_{i n}+D_{1} v_{r e f} / 2
$$

2. Switch $S_{2}$ to $V_{\text {ref }}$

This circuit may be modified yet again to allow bipolar inputs to be accepted, but with still just a single polarity reference. This method is illustrated in Fig. 3.19. In this approach the largest capacitor is not connected to the input during the sample mode, but instead, has its bottom plate tied to the reference voltage. At the end of the hold interval the bottom plate is connected to ground if the voltage present on the top plate is greater than zero, otherwise it is left at $V_{\text {ref }}$. The remainder of the conversion sequence is then the same as in the unipolar case. Note that the end result of this procedure is a compression of the input range by nominally a factor of two, and a shift in the transfer characteristic by nominally $V_{\text {ref }} / 2$.

### 3.3.2 Capacitor Matching Requirement

The transition voltage between digital code $D$ and $D-1$ may be given by

$$
\begin{equation*}
V(D)=\frac{\sum_{i=1}^{n} D_{i} V_{\text {ref }}}{\sum_{i=1}^{n+1} C_{i}} \tag{3.25}
\end{equation*}
$$

for the unipolar converter of Fig. 3.18, and

$$
\begin{equation*}
V(D)=\frac{\sum_{i=2}^{n} D_{i} C_{i} V_{\text {ref }}}{C_{1}}-\frac{C_{1} D_{1} V_{r e f}}{\sum_{i=2}^{n} c_{i}} \tag{3.26}
\end{equation*}
$$

for the bipolar converter of Fig. 3.19 where $D_{i}$ indicates the $i$-th bit of code representing the number $D$. Using these results, the matching requirements for various distributions of capacitor error may be determined via simulation, as was done in the flash case. It has been shown


Fig. 3.19a Bipolar input converter: Sample mode.


Fig. 3.19b Hold mode.


Fig. 3.19c Successive approximation cycling:

1. Determine sign of $V_{x}$

If $V_{x}<0$; Leave $S_{1}$ on, $D_{1}=0$
If $V_{x}>0$; Switch back, $D_{1}=1$
$V_{x}=-V_{i n} / 2-D_{1} V_{r e f} / 2$
2. Switch $S_{2}$ to $V_{\text {ref }}$

Determine sign of $V_{x}$
If $V_{x}<0$; Leave $S_{2}$ on, $D_{2}=1$
If $V_{x}>0$; Switch back, $D_{2}=0$
$V_{x}=-V_{i n} / 2-D_{1} V_{r e f} / 2+D_{2} V_{r e f} / 4$
3. Switch $S_{3}$ to $V_{\text {ref }}$
experimentally that a least one type of precision capacitor (metal over $n+$ ) exhibits a nearly normal distribution of error [48]. If this is assumed as true, and if it is further assumed that all of the binary weightings are obtained by simply grouping the required number of unit cells together, the typical linearity and SNR performance is easily calculated (via program in Appendix A). The average and standard deviation of worst case error about the nominal and best fit line is shown in Fig. 3.20 for a 7-bit unipolar converter, and in Fig. 3.21 for a single reference bipolar type, for a number of different spreads in unit capacitor size. As is apparent from Fig. 3.20, the unipolar converter's fit to the nominal characteristic is nearly as good as to the best fit line. This contrasts to the bipolar case in Fig. 3.21 and the flash converter considered previously (Fig. 3.10). In these last two converter types, ratio errors will tend to modify best fit gain or offset, as well as result in nonlinearity. Of further interst in these figures, is the relative sensitivity of $\overline{\varepsilon_{\max }}$ to precision component ratio errors. For an average maximum error of $1 \mathrm{LSB}, \sigma_{C} / \bar{C}$ is approximately .06 for the unipolar converter, but is only .04 in the bipolar case. This compares to a required $\sigma_{R} / \bar{R}$ of approximately .06 for the flash converter. Thus, the bipolar input converter exhibits a slightly higher component matching requirement than either of the other two types.

The signal to noise ratio performance for the weighted-capacitor circuits is indicated in Fig. 3.22 for the same distributions of capacitor errors. As is apparent, the unipolar converter is again the slightly better performer of the two, but is not quite as good as the flash converter, whose performance was indicated in Fig. 3.11.


Fig. 3.20 Simulated worst case linearity error of 100 unipolar input weighted-capacitor $A / D$ converters as a function of unit capacitor mismatch.


Fig. 3.21 Simulated worst case linearity error of 100 bipolar input weighted-capacitor $A / D$ converters as a function of unit capacitor mismatch.


Fig. 3.22 Simulated SNR performance of 100 unipolar (a) and 100 bipolar (b) input weighted-capacitor $A / D$ converters as a function of unit capacitor mismatch.

### 3.3.3 Speed Limitations

The ultimate speed limitations of real binary-weighted capacitor circuits, tends to be dominated by simple R-C time constants associated with the largest (most significant) capacitors. Depending upon the implementation, the worst case charging problem is usually during the converters sampling period. It is during this time slot that most or all of the capacitors are being tied across $V_{i n}$ and ground, or $V_{i n}$ and the comparator offset point. An equivalent circuit for this situation is shown in Fig. 3.23a. In this figure it is assumed that the switches to each capacitor are scaled with the capacitor size. This usually isn't practical or desirable beyond the most significant few bits, but gives a conservative estimate of the required charging time for a stated precision level. Resistor $R_{c}$ in this figure represents the top plate switch resistance, or the equivalent input impedance of the comparator during it's offset cancellation cycle, while $R_{s}$ is the switch impedance of the input bus selector. This network may be modeled as a simple R-C circuit as shown in Fig. 3.23b, where the equivalent time constant becomes,

$$
\begin{align*}
\tau & \simeq\left(R_{c}+R_{s}+R / 2\right) \sum_{i=1}^{n+1} C_{i}  \tag{3.27}\\
& \simeq R_{e q} C_{T} \tag{3.28}
\end{align*}
$$

which for a settling accuracy of $1 / 2$ LSB indicates that a minimum acquisition time is needed of the amount

$$
\begin{equation*}
\tau_{a q} \geq(n+1) R_{e q} C_{T} \ln 2 \tag{3.29}
\end{equation*}
$$

If instead of a constant $D C$ level, the input signal is a sinusoid of angular frequency $\omega$, it is easy to show that this network produces an amplitude roll-off of the amount


Fig. 3.23a Weighted-capacitor R-C charging model.


Fig. 3.23b Lumped R-C equivalent.

$$
\begin{equation*}
\frac{v_{\text {in }}^{\prime}}{V_{\text {in }}} \simeq \frac{1}{1+\omega^{2} \tau_{\mathrm{aq}}^{2}} \tag{3.30}
\end{equation*}
$$

where $V_{i n}^{\prime}$ is the sampled value across the capacitors. This network also produces a phase shift in the sampled quantity of the amount

$$
\begin{equation*}
\phi_{\mathrm{err}} \simeq \tan ^{-1}\left(\omega \tau_{\mathrm{aq}}\right) \tag{3.31}
\end{equation*}
$$

As indicated previously, nonlinear phase shift may be deleterious to system operation. Both the phase shift and amplitude rolloff effects may be minimized by using the bipolar double-reference circuit of Fig. 3.17, which uses the smallest amount of capacitance for a given resolution level. This arrangement requires more complicated bit cells, however, which are typically larger and slower than those of either of the other two circuits. Because of this, use of this circuit will not necessarily result in a higher conversion rate. Alternately, the bipolar input converter of Fig. 3.19 may be used. This circuit has the advantage of simple bit cells and is generally faster than the unipolar converter, as the MSB capacitor only goes through one switch to a voltage source, rather than through a pair of switches in series. Although this circuit is not as easily analyzed as its unipolar relation, it is possible to show that the use of this technique has reduced the required sampling time by $20 \%-30 \%$ in a real circuit [49].

Another approach for reducing the charging time constants, which is quite effective in high resolution converters, is to use a split-capacitor array as shown in Fig. 3.24. Although this circuit is functionally equivalent to a binary-weighted array, it requires a substantially smaller total capacitance. This circuit is different from the previously discussed weighted-capacitor circuits, however, in that converter linearity is dependent upon the parasitic capacitance on the capacitor top plates
(in this case in the low weight array). In fact the minimum unit capacitor size in this circuit will usually be determined by the diffusion capacitance from the top plate grounding switch. This capacitance will tend to produce a gain error in the weight of the lower precision array if it is relatively constant, and a nonlinear "bowing" if it is voltage variable. The constant gain error may be compensated for by modifying the weight of coupling capacitor $C_{x}$, while voltage variable effects may be minimized by employing a complementary switch which may be designed to have a small voltage coefficient of capacitance in a given bias region. Similarly, the voltage excursions on $V_{y}$ may be minimized by intentionally employing a dummy MOS capacitor in parallel with $C_{p}$ and scaling $C_{x}$ to maintain a constant lower array gain. This will tend to reduce the nonlinearity error created by a given voltage variable switch capacitance. One final means of improving operation during the sample mode is to pre-sample the input with a separate sample and hold circuit which has a small aperature time and good high frequency properties. This sample and hold circuit may subsequently be used to charge the capacitor array over as long of a time as necessary. Although this results in an apparently slower conversion rate, the actual conversion rate may be faster if shorter time slots may be used for the remainder of the successive approxmiation cycle. This will occur in synchronous converters if the sampling period was forcing the use of longer time slots than were otherwise necessary. In addition, this procedure will reduce the rolloff and phase shift effects caused by directly coupling the capacitor array to the input. In particularly split-array converters, this sample and hold circuit is usually easier to design than circuits designed for use with flash converters, as the required capacitive loads are usually much less.

### 3.4 Pipeline Converters

A characteristic of the converter types previously presented, is that very little of the total circuitry is "active" at any given time. For example, only two of the $2^{n}-1$ comparators in a flash circuit will actually determine the output code. Similarly, in a successive approximation converter, only one or two of the $n$ different bit cells are involved in the conversion process during any time slot. It would be desirable if a technique could be developed which would use more of the converter circuitry a greater percentage of the time. This in theory at least, should allow a higher converter bandwidth to be realized.

One approach is to modify the previously presented successive approximation algorithms into the form presented in Fig. 3.25. In this method distinct circuitry exists for each bit of the conversion process, but no bit cell is idle during any time slot. This is accomplished by having each bit cell operating on a different sample during each clock period and synchronously passing the conversion remainder onto subsequent stages. For an n-bit converter this results in an effective throughput of one conversion per clock cycle, but with a time delay of typically n periods. A difficulty with this approach, however, particularly in high resolution converters, is the problem of achieving good isolation between bit cells without introducing distortion. In general this requires the use of high gain linear amplifiers between stages, although buffering with simpler techniques (e.g., source followers) appears possible in low resolution applications.

Three possible implementations of a pipeline are presented in the next two sections. The first two of these involve the divided reference algorithm of Fig. 3.16a, while the last approach uses the multiplied


Fig. 3.25 Pipeline converter method.
remainder algorithm of Fig. 3.17b. As will be demonstrated, the divided reference methods are useful in low resolution converters (although they require great care in the design), while the multiplied remainder technique is potentially useful in even high precision applications.

### 3.4.1 Divided Reference Pipeline

A straightforward implementation of a pipeline which uses the divided reference successive approximation algorithm, is shown in Fig. 3.26. In this circuit a succession of precision capacitor pairs are used to generate the necessary reference voltage fractions and the required additions and subtractions. Isolation between stages is provided by voltage followers which allow the capacitors of subsequent stages to be charged without a voltage loss. The voltage followers may be low distortion, low offset source followers (or the equivalent, see Chapter 6) or fast settling linear amplifiers connected in a unity gain configuration.

A difficulty with this circuit is that it is sensitive to parasitic capacitance on each integrating node (represented by the lumped capacitance $C_{p}$ ). The effect of this capacitance, if constant, is a gain rolloff of the summed reference fraction. Specifically, after the first summation, the voltage present on $V_{x}$ is given by

$$
\begin{equation*}
v_{x 1}=v_{i n} \pm \frac{B_{1} C v_{r e f}}{\left(A_{1}+B_{1}\right) C+C_{p}} \tag{3.32}
\end{equation*}
$$

which for proper converter operation requires that

$$
\begin{equation*}
\frac{B_{1} C}{\left(A_{1}+B_{1}\right) C+C_{p}}=1 / 2 \tag{3.33}
\end{equation*}
$$

or


| $\phi$ | $v_{x_{0}}$ | $v_{x_{1}}$ | $v_{x_{2}}$ |
| :---: | :---: | :---: | :---: |
| 1 | $v_{i n_{1}}$ | $x$ | $x$ |
| 0 | $"$ | $\left.v_{i n_{1}} \pm B_{1} c_{r} /\left(A_{1}+B_{1}\right) C+C_{p}\right)$ | $x$ |
| 1 | $v_{i n_{2}}$ | $"$ | $v_{i n_{1}} \pm B_{1} C v_{r} /\left(\left(A_{1}+B_{1}\right) C+C_{p}\right) \pm B_{2} v_{r} /\left(\left(A_{2}+B_{2}\right) C+C_{p}\right)$ |
| 0 | $"$ | $\left.v_{i n_{2}} \pm B_{1} c_{r} /\left(A_{1}+B_{1}\right) C+C_{p}\right)$ | $"$ |
| 1 | $v_{i n_{3}}$ | $"$ | $v_{i n_{2}} \pm B_{1} C v_{r} /\left(\left(A_{1}+B_{1}\right) C+C_{p}\right)+B_{2} C v_{r} /\left(\left(A_{2}+B_{2}\right) C+C_{p}\right)$ |

Fig. 3.26 Divided reference pipeline employing rationed capacitors and voltage followers. (Top) Pipeline schematic. (Bottom) Time-state definitions where time progresses downward.

$$
\begin{equation*}
A_{1}=B_{1}-C_{p} / C \tag{3.34}
\end{equation*}
$$

If it is desired that $A_{1}+B_{1}=2$, then

$$
\begin{equation*}
A_{1}=1-C_{p} / 2 C \tag{3.35}
\end{equation*}
$$

and

$$
\begin{equation*}
B_{1}=1+C_{p} / 2 C \tag{3.36}
\end{equation*}
$$

In fact, for any stage $i$, which is to generate a reference fraction of the amount $V_{\text {ref }} / 2^{i}$,

$$
\begin{equation*}
A_{i}=\frac{2^{i+1} C-2 C-C_{p}}{2^{i} C} \tag{3.37}
\end{equation*}
$$

and

$$
\begin{equation*}
B_{i}=\frac{2 C+C_{p}}{2^{i} C} \tag{3.38}
\end{equation*}
$$

A problem arises in this circuit, however, if $C_{p}$ has a voltage dependent component (as it invariably does). This voltage dependence is usually caused by the junction depletion capacitance of the switch on the capacitor top plate. As mentioned previously, this dependence may be reduced by employing a complementary top plate switch which may be designed to exhibit a small voltage dependence of capacitance over a modest voltage range. In precision applications, however, which require converter resolutions to greater than 5 or 6 bits, this procedure may not be sufficient to ensure adequate linearity. In these cases the circuit may be modified, as shown in Fig. 3.27.

In this circuit each stage of the converter is based upon a capacitively coupled multiplier/summer circuit which uses a fast settling inverting amplifier. This circuit may be used with constant sized

(b)

| $\phi$ | $V_{\times 0}$ | $V_{x_{1}}$ | $v_{x_{2}}$ |
| :---: | :---: | :---: | :---: |
| 1 | 0 | x | 0 |
| 0 | $\mathrm{V}_{\text {in, }}$ | 0 | X |
| $\stackrel{E}{=} 1$ | 0 | $v_{\text {in, }} \pm v_{r / 2}$ | 0 |
| $\cdots$ | $V_{\text {in } 2}$ | 0 | $V_{\text {in }} \pm V_{r / 2} \pm V_{r / 4}$ |
| 1 | 0 | $\mathrm{Vin}_{2} \pm \mathrm{V}_{\mathrm{r} / 2}$ | 0 |

Fig. 3.27 Divided reference pipeline using linear amplifiers. (a) Pipeline schematic. (b) Time-state definitions.
capacitors and binary-scaled reference sources (generated by an R-2R ladder for instance), or used with only a positive and negative reference and scaled capacitors. As the amplifiers effectively eliminate the effect of voltage dependent parasitic capacitance on linearity, good linearity may be obtained with even large voltage swings. In addition, the amplifiers maintain a unity gain addition or subtraction of the reference voltage, thus eliminating the need for scaling of voltages by peculiar fractions. Operation of this circuit proceeds in the manner shown in Fig. 3.27b. As is apparent from the figure, each amplifier is reset during alternate time slots. This maintains the amplifiers within their normal operating regions (even in the presence of leaky junctions) and allows each stage to operate as an isolated sample and hold. As in the previous circuit, the time required for a complete $n$-bit conversion is only $n / 2$ clock cycles, due to switches on adjacent stages being run from opposite clock phases. Addition or subtraction of the fractional reference voltage is accomplished most quickly if both positive and negative references are available. In this case the capacitors to the reference fractions are simply kept at ground during the amplifier reset cycles and switched to plus or minus $V_{r} / 2^{i}$, depending upon whether subtraction or addition is desired. This same function may be achieved with but one reference polarity if the reference input capacitor can be precharged to ground or $V_{r} / 2^{i}$ during the amplifier reset cycle, based upon the comparator result from the previous stage. This requires more time, however, and is thus not as attractive in high speed applications.

Practically speaking, the circuit of Fig. 3.27 has several problems. First, the generation of fractional reference voltages tends to make
the circuit somewhat cumbersome, whether the fractions are generated by capacitor weightings or R-2R circuits. Second, reset switch feedthrough and amplifier noise appears as an unattenuated error in the conversion remainder, which is made worse at every stage. This can easily result in poor linearity in the converter transfer characteristic. A preferred embodiment of a pipeline converter, which does not have such severe problems is discussed below.

### 3.4.2 Multiplied Remainder Pipeline

A slight modification of capacitor ratios in the circuit of Fig. 3.27, allows the multiplied remainder algorithm to be carried out. ${ }^{7}$ This circuit, which is shown in Fig. 3.28 , is in many ways preferrable to the divided reference pipeline converter in high resolution applications. As the circuit multiplies the remainder after each stage, no binary scaling of the reference voltage is necessary. Further, the remainder multiplication reduces the effect of switch feedthrough and amplifier noise by a factor of two at every stage. Thus the problem of feedthrough induced nonlinearity and amplifier related converter noise is principally limited to the first few stages. The required comparator resolution is also reduced after the first stage, possibly allowing the use of smaller and simpler comparators in the least significant bit positions.

[^6]
(b)

| $\phi$ | $v_{x_{0}}$ | $v_{x_{1}}$ | $v_{x_{2}}$ |
| :---: | :---: | :---: | :---: |
| 1 | 0 | $x$ | 0 |
| 0 | $v_{i n_{1}}$ | 0 | $x$ |
| 1 | 0 | $2 v_{i_{1}} \pm v_{r}$ | 0 |
| 0 | $v_{i n_{2}}$ | 0 | $2\left(2 v_{i n_{1}} \pm v_{r}\right) \pm v_{r}$ |
| 1 | 0 | $2 v_{i_{i_{2}}} \pm v_{r}$ | 0 |

Fig. 3.28 Multiplied remainder pipeline. (a) Pipeline schematic. (b) Time-state definitions.

### 3.4.3 Pipeline Performance Limitations

As all of the pipeline circuits presented employ some form of linear buffer circuitry, the settling time of these circuits will most directly effect the maximum conversion rate. The design of fast settling circuits which are suitable for use in pipeline converters, and their ultimate speed limitations are discussed in Chapter 5.

The ultimate resolution of the first pipeline considered (shown in Fig. 3.26), is affected by the voltage variablilty of the parasitic capacitance on the top plate and drift in this capacitance from the nominal design value (used in determining the required capacitor ratios). Circuit linearity is also affected by buffer nonlinearity and offset, (particularly if the buffer is not a linear amplifier employing feedback), and switch feedthrough. The result of all of these problems taken together is that this circuit will probably not be useful in high speed converters of even modest resolution unless great care is taken in every aspect of the design.

The resolution of the improved divided reference circuit of Fig. 3.27, will typically be dominated by switch feedthrough effects, as the myriad of parasitic capacitance and buffer nonlinearity problems will have been eliminated. In general, however, the resolution of this circuit will be less than the multiplied remainder circuit of Fig. 3.28. As the noise and feedthrough effects of this last circuit of Fig. 3.28. attenuated at every stage, the result of these nonidealities upon overall circuit linearity is greatly reduced. In fact, with regard to switch feedthrough alone, it is easy to show that $1 / 2$ LSB INL will be obtained if

$$
\begin{equation*}
\sum_{i=2}^{n} v_{f t_{i}}{ }^{(i-2)}<1 \text { LSB } \tag{3.39}
\end{equation*}
$$

where $V_{f t}$ represents the feedthrough resulting from the $i$ th stage. If the above constraint can be met or exceeded, the ultimate linearity obtainable will probably be limited by simple capacitor matching or comparator offset skews. Note that this latter problem, that of comparator offset skew, may be virtually eliminated by a variety of techniques which are discussed in Chapter 5 . This multiplied remainder pipeline scheme, appears to offer considerable promise in high resolution converters, either by itself, or in a hybrid configuration as discussed in the next section.

### 3.5 Hybrid Converters

Each of the previously discussed conversion techniques have characteristics which make their use desirable in certain systems. In general, however, every simple method for performing an A/D conversion becomes difficult to realize as either the required precision or the required speed increases. For example, a flash converter will require an inordinate die size for resolutions of greater than 6 or 7 bits while pipeline converters will increasingly suffer from switch feedthrough effects as the switch size is increased and the precision capacitor sizes are decreased in order to improve speed. In certain cases, some of these problems may be avoided by combining several converters (of the same or different types) into a single composite or hybrid converter. Several of these circuits are discussed in this section.

### 3.5.1 Two-Step Flash Converter

A fairly commonplace high-speed converter method is the 2-step or half flash technique. In this approach, a pair of flash circuits (of typically about $n / 2$ bits resolution) are used in sequence to obtain a fast $n$-bit conversion, as shown in Fig. 3.29. Although this method only allows a bandwidth of approximately half that of the straight flash approach, it can be made considerably smaller in high resolution applications. This technique has been demonstrated in a partially monolithic current-input converter fabricated in a double-level metal bipolar process [53].

The design considerations in a converter of this type are in many ways similar to those of the simple flash converter with the addition of matters relating to the D/A and differencing circuit. The resistor linearity requirement upon the first flash circuit is the same as if it were a complete n-bit converter, even though its actual resolution is only approximately $n / 2$ bits. The required comparator performance, however, is considerably reduced for the requirement of monotonic behavior. Problems relating to charge pumping within the precision ladders are also greatly reduced, as the number of comparators per flash circuit is so much smaller. Depending upon time delays present in the D/A and subtractor circuit, the half-flash circuit may have a greater need for an input sample and hold circuit than the straight flash approach. With a high frequency input signal the half-flash circuit of Fig. 3.29, if used without a sample and hold, can easily produce grossly distorted results. One method for alleviating this problem has been to combine the first flash circuit, the D/A converter and the subtractor into a single high speed circuit [54]. By minimizing the total delay through

Fig. 3.29 Two-step flash converter method.
this path, acceptable performance has been obtained with even broadband input signals.

A slight modification of the basic half-flash circuit should allow improved bandwidth to be obtained. In the circuit as just presented, the first flash converter must wait until the D/A and subtractor have fully settled and the second flash converter has finished sampling before it can proceed with subsequent conversions. If an analog sampler is incorporated into the forward differencing path, and if a latch is placed after the first flash circuit, the overall converter circuitry effectively becomes pipelined (Fig. 3.30). This allows the operation of the first flash circuit to overlap with that at the DAC/subtractor and second flash circuits, resulting in an improved throughput. Further, this technique should greatly reduce the need for a separate sample and hold circuit prior to the first flash converter.

### 3.5.2 Flash-Pipeline Converter

As dissussed in the previous sections of this chapter, the flash converter method becomes rather unwieldy as the required resolution increases above 6 or 7 bits. Similarly, for the pipeline converters discussed in Section 3.4, the effect of switch feedthrough becomes an increasingly serious problem at high resolution levels. An interesting converter approach which takes advantage of the favorable aspects of each converter type, yet which is not deleteriously affected by the problems mentioned above, is shown in Fig. 3.31. In this circuit a flash front-end is used in conjunction with a high-speed pipeline to obtain a single fast converter. Operation of this converter begins by having the input stage of the pipeline, and each of the flash comparator

Fig. 3.30 Pipelined two-step flash converter.
circuits "track" the input voltage. After the track mode, the flash converter proceeds to determine which voltage is closest but not greater than the input voltage. After this has occurred, the input capacitor to the pipeline is switched to the tap in the middle of the next higher resistor segment. This creates an amplified difference voltage which can subsequently be digitized by the pipeline circuitry.

Note that because the resistor string is inherently monotonic, the converter as a whole will display a monotonic characteristic if the pipeline circuitry does not display a nonmonotonicity by itself. Although the charging times of the pipeline input capacitor being switched to the appropriate ladder tap will probably limit the flash part verter to a relatively small number of bits, this will extend the useful range of the pipeline method by an equivalent amount. Further, because flash circuits of low resolution are small, this technique will not greatly increase the die size from that required by the pipeline circuitry alone.

## CHAPTER 4

## PRECISION ELEMENTS

Nearly all analog-digital conversion circuits require a precision analog multiplier or divider, which is at least as accurate as the converter itself. Thus a 10-bit conversion circuit will usually require precision components which match to a minimum of 10-bits (and usually more). A common means of implementing this precision multiplier/ divider circuit is with a pair (or more) of matched resistors or capacitors. Various resistor considerations including matching, voltage and temperature dependent behavior are presented in Sections 4.1-4.5, while capacitor related material is covered in Sections 4.6-4.9.

### 4.1 Resistor Matching Considerations

Depending upon the specific integrated circuit (IC) process, diffused, thin film or polysilicon resistors may be available. In certain cases, MOSFETs themselves may be used as a source of matched transresistance. Consider the general pair of matched resistors shown in Fig. 4.1. If the relative conductance is assumed homogeneous in the $y-z$ plane, then the value of either resistor may be given by:

$$
\begin{align*}
R & =\frac{L}{W \int_{0}^{x} L} \sigma(x) d x  \tag{4.1}\\
& =\frac{L}{W x_{L} \bar{\sigma}} \tag{4.2}
\end{align*}
$$

where $\bar{\sigma}$ is the average conductivity in the resistor. If an uncertainty $\delta$


Fig. 4.1 Matched resistor pair.


Fig. 4.2 Interdigitated resistor pair. Cross-coupled structure reduces the effect of linear gradients in resistor matching.
is assigned to each resistor parameter, a potential error will result in the final resistor value of the amount

$$
\begin{equation*}
\frac{\delta R}{R} \simeq\left[\left(\frac{\delta W}{W}\right)^{2}+\left(\frac{\delta L}{L}\right)^{2}+\left(\frac{\delta x_{L}}{x_{L}}\right)^{2}+\left(\frac{\delta \bar{\sigma}}{\sigma}\right)\right]^{1 / 2} \tag{4.3}
\end{equation*}
$$

where it is usually the case that the relevant uncertainty is the spread within the precision components on a single die, and not the variation across different wafers or lots. Further, it is generally true that $\delta W / W$ will dominate $\delta L / L$, as most resistors have an aspect ratio greater than one ( $L / W>1$ ). Variations in $W$ and $L$ may be due to dopant inhomogeneities and crystal defects (particularly in diffused resistors) or variations in deposition rate and uniformity (thin-film or poly resistors) or grain size (poly resistors). In addition, $\bar{\sigma}$ is quite sensitive to stress effects [55], which may seriously affect component matching after packaging. Resistor matching of from $.23 \%$ for diffused resistors to $.12 \%$ for implanted resistors [56] has commonly been reported, although one report indicates that diffused resistor matching to $.01 \%$ is at least possible [57]. Typically, resistor components which must match to greater than 8 or 9 bits are trimmed by any of a number of techniques to the required accuracy [58-60]. As a point of reference, $.1 \%$ resistor matching in $50 \mu \mathrm{~m}$ wide resistors, indicates an edge uncertainity of less than $.05 \mu \mathrm{~m}$. The effect of random edge variations may only be reduced by using larger resistor geometries, although linear gradients may be compensated by using an interdigitized structure such as that shown in Fig. 4.2. The purpose of the "dummy" resistors at each end of the array, is to maintain a uniform environment for all of the resistors during processing.

### 4.2 Characteristics of Planar Diffused Resistor Structures

Of the variety of resistor types available in an IC process, a diffused structure is often the most advantageous to use, as no process modification is necessary and the resistivity may be well controlled (at least in ion-implanted types). The planar characteristics of diffused resistors with gaussian impurity profiles are examined in this section, while additional effects, resulting from sidewall outdiffusion are considered in Section 4.4.

In diffused resistors, the effective dimensions are somewhat bias dependent, due to the junction depletion region extending a voltage variable distance into the resistor. This situation is illustrated in Fig. 4.3. The resistors value may still be determined from Eq. (4.2), however, corrections must be made for $X_{L}$.

An implanted resistor which has been diffused into the background material usually exhibits a gaussian impurity profile

$$
\begin{equation*}
N_{c}(x) \simeq \frac{Q_{I}}{\sqrt{\pi D T}} e^{-x^{2} / 4 D T} \tag{4.4}
\end{equation*}
$$

where $Q_{I}$ is the implant dose in ions $/ \mathrm{cm}^{2}, D$ is the effective impurity diffusivity over the length of the drive in, and $T$ is the drive in time. For a background doping of $N_{B}$ ions/ $\mathrm{cm}^{3}$ of an opposite impurity type, this results in a junction depth of

$$
\begin{equation*}
x_{j} \simeq\left[-4 D T \ln \left(\frac{N_{B}}{Q_{I}} \sqrt{\pi D T}\right)\right]^{1 / 2} \tag{4.5}
\end{equation*}
$$

The effective resistor depth is actually reduced from the junction depth by the portion of the depletion region on the resistor side of the metallurgical junction. Several workers have developed relatively simple


Fig. 4.3 Cross-section of a diffused resistor under bias, showing incursion of the depletion region into the diffusion.
means for evaluating the various depletion widths in certain types of diffused junctions [61-62], however, neither is as complete or as accurate as the original treatment by Lawrence and Warner [63]. Using a similar procedure (Appendix B.1), it may be shown that the electric field in the resistor (or left) depletion region, is given by

$$
\begin{align*}
E_{p}(x)= & \frac{q N_{s}}{\varepsilon_{s}}\left[\sqrt{\pi D T}\left[\operatorname{erf}\left(\frac{x}{\sqrt{4 D T}}\right)-\operatorname{erf}\left(\frac{x_{L}}{\sqrt{4 D T}}\right)\right]\right. \\
& \left.-\frac{N_{B}}{N_{s}}\left(x-x_{L}\right)\right] \tag{4.6}
\end{align*}
$$

while in the substrate (or right) side

$$
\begin{align*}
E_{2}(x)= & \frac{q N_{s}}{\varepsilon_{s}}\left[\sqrt{\pi D T}\left[\operatorname{erf}\left(\frac{x}{\sqrt{4 D T}}\right)-\operatorname{erf}\left(\frac{x_{R}}{\sqrt{4 D T}}\right)\right]\right. \\
& \left.-\frac{N_{B}}{N_{s}}\left(x-x_{R}\right)\right] \tag{4.7}
\end{align*}
$$

where $x_{L}$ and $x_{R}$ represent the left and right edges of the depletion region. For a specified $x_{R}$ (or $x_{L}$ ), the opposite side of the junctions depletion region may be determined by solving for the case $E_{p}\left(x_{j}\right)=E_{2}\left(x_{j}\right)$. The electrostatic potential on each side of the junction, is then found to be (Appendix B.1)

$$
\begin{align*}
V_{1}= & \frac{-q N_{s}}{\varepsilon_{s}}\left[\sqrt{\pi D T} x_{j}\left(\operatorname{erf}\left(\frac{x_{j}}{\sqrt{4 D T}}\right)-\operatorname{erf}\left(\frac{x_{L}}{\sqrt{4 D T}}\right)\right)\right. \\
& +2 D T\left(e^{-x_{j}{ }^{2} / 4 D T}-e^{-x_{L}^{2} / 4 D T}\right) \\
& \left.-\frac{N_{B}}{2 N_{s}}\left(x_{j}-x_{L}\right)^{2}\right] \tag{4,8}
\end{align*}
$$

and

$$
\begin{align*}
V_{2}= & \frac{q N_{s}}{\varepsilon_{s}}\left[\sqrt{\pi D T} x_{j}\left(\operatorname{erf}\left(\frac{x_{j}}{\sqrt{4 D T}}\right)-\operatorname{erf}\left(\frac{x_{R}}{\sqrt{4 D T}}\right)\right)\right. \\
& +2 D T\left(e^{-x_{j}^{2} / 4 D T}-e^{-x_{R}^{2} / 4 D T}\right) \\
& \left.-\frac{N_{B}}{2 N_{s}}\left(x_{j}-x_{R}\right)^{2}\right] \tag{4.9}
\end{align*}
$$

The total potential across the junction is then simply

$$
\begin{equation*}
v_{T}=v_{1}+v_{2} \tag{4.10}
\end{equation*}
$$

which equals the combined built-in voltage and applied reverse bias, or

$$
\begin{equation*}
V_{T}=\phi+V_{B} \tag{4.11}
\end{equation*}
$$

where

$$
\begin{equation*}
\phi=\frac{k T}{q} \quad \ln \left[\frac{N\left(x_{L}\right) N\left(x_{R}\right)}{n_{i}^{2}}\right] \tag{4.12}
\end{equation*}
$$

By iteration, the above may be solved for $X_{L}$ and $X_{R}$ given an applied $V_{B}$ (Appendix B.3).

To complete the resistivity determination, it is necessary to assume a specific form for the mobility and solve Eq. (4.1). Defining the mobility to be ${ }^{1}$

$$
\begin{equation*}
\mu_{c}(x)=K N_{c}^{m}(x) \tag{4.13}
\end{equation*}
$$

where $K$ is always positive and typically $0>m>-.5$, Eq. (4.1) then becomes

$$
\begin{equation*}
R=\frac{L}{W}\left[\int_{0}^{x} q K N_{s}^{m} e^{-m x^{2} / 4 D T}\left(N_{S} e^{-x^{2} / 4 D T}-N_{B}\right) d x\right] \tag{4.14}
\end{equation*}
$$

This is a similar expression to that used by Irwin [64] for piecewise mobility approximations; $\mu_{C}(x)=K\left[N_{C}(x)-N_{B}\right]$. Irwin's expression leads to more involved integrations, however, and is apparently less applicable when $N_{C}(x) \simeq N_{B}$.

$$
\begin{align*}
\frac{1}{R} \frac{R}{\partial V_{B}} & =\frac{L}{W R} \frac{\partial x_{L}}{\partial V_{B}} \frac{\partial}{\partial x_{L}}\left[\int_{0}^{x_{L}} \sigma(x) d x\right]^{-1}  \tag{4.22}\\
& =-\frac{L}{W R} \frac{\partial x_{L}}{\partial V_{B}} \frac{\sigma\left(x_{L}\right)}{x_{L}^{2} \sigma^{2}}  \tag{4.23}\\
& =-\rho_{S}\left(V_{B}\right) \frac{\partial x_{L}}{\partial V_{B}} \sigma\left(x_{L}\right) \tag{4.24}
\end{align*}
$$

where $\rho_{S}\left(V_{B}\right)$ is given by Eq. (4.21) $\left(\rho_{S}\left(V_{B}\right)=W /\left(L G\left(V_{B}\right)\right)\right.$ ) and $\partial x_{L} / \partial V_{B}$ must be determined numerically.

Plots of calculated $\rho_{s}$ and normalized bias voltage coefficients for several CMOS planar well resistors (which assume $W \gg x_{j}$ ) are shown in Fig. 4.4.

### 4.3 Characteristics of Diffused Resistors Due to Sidewalls

In diffused resistors where it is not the case that $W \gg x_{j}$, the effects of impurity outdiffusion from the sides and ends of the mask cut must be considered. This outdiffusion may modify both the resistivity and the voltage coefficient, in some cases dramatically. A simple approximation for this situation is to assume that the outdiffused sidewalls are cylindrical, while the four corners are spherical [65], as shown in Fig. 4.5.

For predepositions from a gaseous source, the total deposited impurity " $Q_{T}$," will usually increase, due to enhanced diffusion away from the surface near the mask edges. For implanted depositions, however, the total dose remains the same, as this is only determined by the mask opening size and implant density. This results in a lower average impurity concentration as the dose is now distributed over a larger volume (planar region plus sidewalls). This situation will be considered in detail.


Fig. 4.4 Normalized planar resistivity bias dependence (top) and bias voltage coefficient (bottom) for threp typical CMOS well ${ }_{8}$ resistors. In curve (a̧); $Q_{I}=1.75 \times 10^{\rho 2} / \mathrm{cm}$. , $D T=1.95$ $\times 10^{-8} \mathrm{~cm}$. and $\mathrm{N}_{\mathrm{B}}=5 \times 10^{4} / \mathrm{cm}$. . Multiply values by 2 for curve (b) and by 4 for curve (c).


Fig. 4.5 Geometrically simple model of impurity outdiffusion in a diffused resistor.

For a mask opening of length $L$ and width $W(L \gg W)$,

$$
\begin{equation*}
2 Q_{\text {side }}+Q_{I} L W \simeq Q_{T} L W \tag{4.25}
\end{equation*}
$$

where $Q_{I} L W$ is the total charge in the planar region after outdiffusion, $Q_{\text {side }}$ is the charge present in each cylindrical sidewall and $Q_{T}$ is the total ion dose in ions $/ \mathrm{cm}^{2}$. Note that the constraint $L \gg W$ allows elimination of the corner and endwall outdiffusion terms.

The total dose in one sidewall, is given by

$$
\begin{equation*}
Q_{\text {side }}=L \int_{0}^{\pi / 2} d \phi \int_{0}^{\infty} r N_{c}(r) d r \tag{4.26}
\end{equation*}
$$

which for a gaussian impurity distribution reduces to

$$
\begin{equation*}
Q_{\text {side }}=\pi N_{s} D T L \tag{4.27}
\end{equation*}
$$

where

$$
\begin{equation*}
N_{s}=\frac{Q_{I}}{\sqrt{\pi D T}} \tag{4.28}
\end{equation*}
$$

By combining terms,

$$
\begin{equation*}
Q_{I}=\frac{Q_{T}}{(1+2 \sqrt{\pi D T} / W)} \tag{4.28}
\end{equation*}
$$

Thus $Q_{I}$ is reduced from $Q_{T}$ by an amount which is dependent upon $W$. The junction depth will also be reduced from the planar case, and is given by

$$
\begin{equation*}
x_{j}=\left[-4 D T \ln \left[\frac{N_{B}}{Q_{T}} \sqrt{\pi D T}\left(1+\frac{2 \sqrt{\pi D T}}{W}\right)\right]\right] 1 / 2 \tag{4.29}
\end{equation*}
$$

Plots of $Q_{I} / Q_{T}$ and $X_{J}(W) / X_{J}(\infty)$ for various diffusion cases, are shown in Figs. 4.6 and 4.7.

The conductivity of a sidewall may now be determined in a manner similar to that used in the planar case. First, the depletion region

Fig. 4.6 Normalized surface concentration or effective implant dose as a function of $\sqrt{\pi D T} / W$.


Fig. 4.7 Normalized junction depth as a function of $\sqrt{\pi D T} / W$ and $p l a n a r N_{B} / N_{S}$.
thickness on the resistor side of the junction must be determined. It may be shown (Appendix B.2), that for this case the electric field on the diffusion side of the junction is given by

$$
\begin{align*}
E_{1}(r)= & -\frac{2 D T q N_{s}}{r \varepsilon_{s}}\left[e^{-r^{2} / 4 D T}-e^{-r_{L}^{2} / 4 D T}\right] \\
& -\frac{q N_{B}}{2 r \varepsilon_{s}}\left(r^{2}-r_{L}^{2}\right) \tag{4.30}
\end{align*}
$$

while on the substrate side

$$
\begin{align*}
E_{2}(r)= & -\frac{2 D T q N_{S}}{r \varepsilon_{S}}\left[e^{-r^{2} / 4 D T}-e^{-r_{R}^{2} / 4 D T}\right] \\
& -\frac{q N_{B}}{2 r \varepsilon_{S}}\left(r^{2}-r_{R}^{2}\right) \tag{4.31}
\end{align*}
$$

where $r_{L}$ and $r_{R}$ indicate the inside and outside edges of the cylindrical depletion region. For a specified $r_{R}$ (or $r_{L}$ ) the opposite side of the depletion region $r_{L}$ (or $r_{R}$ ) may be determined by solving for the case $E_{1}\left(r_{j}\right)=E_{2}\left(r_{j}\right)$. The electrostatic potential on each side of the junction is then found to be (Appendix B.2)

$$
\begin{align*}
v_{1}= & -\frac{q}{\varepsilon_{s}}\left[-\frac{N_{B}}{4}\left(r_{j}^{2}-r_{L}^{2}\right)+\frac{N_{B}}{2} r_{L}^{2} \ln \left(\frac{r_{j}}{r_{L}}\right)\right. \\
& \left.+2 D T N_{s} e^{-r_{L}{ }^{2} / 4 D T} \ln \left(\frac{r_{j}}{r_{L}}\right)-2 D T N_{s}\left[f\left(r_{j}\right)-f\left(r_{L}\right)\right]\right] \tag{4.32}
\end{align*}
$$

and

$$
\begin{align*}
v_{2} \approx \frac{q}{\varepsilon_{s}} & {\left[-\frac{N_{B}}{4}\left(r_{j}^{2}-r_{R}^{2}\right)+\frac{N_{B}}{2} r_{R}^{2} \ln \left(\frac{r_{j}}{r_{R}}\right)\right.} \\
& +2 D T N_{s} e^{-r_{R}^{2} / 4 D T} \ln \left(\frac{r_{j}}{r_{R}}\right)-2 D T N_{s}\left[f\left(r_{j}\right)-f\left(r_{R}\right)\right] \tag{4.33}
\end{align*}
$$

where

$$
\begin{equation*}
f(r)=\ln (r)+\sum_{i=1}^{N} \frac{\left(r^{2} / 4 D T\right)^{i}(-1)^{i}}{i!(2 i)} \tag{4.34}
\end{equation*}
$$

The total potential across the cylindrical junction is simply

$$
\begin{equation*}
v_{T}=v_{1}+v_{2} \tag{4.35}
\end{equation*}
$$

which equals the combined built-in voltage and applied reverse bias, or

$$
\begin{equation*}
V_{T}=\phi+V_{B} \tag{4.36}
\end{equation*}
$$

where

$$
\begin{equation*}
\phi=\frac{k T}{q} \ln \left[\frac{N\left(r_{L}\right) N\left(r_{R}\right)}{n_{i}^{2}}\right] \tag{4.37}
\end{equation*}
$$

By iteration, the above may be solved for $r_{L}$ and $r_{R}$ given an applied $V_{B}$ (Appendix B.3).

Assuming the same mobility dependence as in the planar case

$$
\begin{equation*}
\mu_{c}(r) \simeq K N_{c}^{m}(r) \tag{4.38}
\end{equation*}
$$

the sidewall conductivity then becomes

$$
\begin{align*}
G_{\text {side }}= & \frac{W_{\pi q K N_{s}^{m}}^{2 L}}{2 L}\left[N_{s} \int_{0}^{r_{L}} e^{-(m+1) r^{2} / 4 D T} r d r\right. \\
& \left.-N_{B} \int_{0}^{r_{L}} e^{-m r^{2} / 4 D T} r d r\right] \tag{4.39}
\end{align*}
$$

As

$$
\begin{equation*}
\int_{b}^{c} e^{a x^{2}} x d x=\left.\frac{e^{a x^{2}}}{2 a}\right|_{b} ^{c} \tag{4.40}
\end{equation*}
$$

then

$$
\begin{align*}
G_{\text {side }} \simeq & \frac{D T W \pi q K N_{s}^{m}}{L}\left[\frac{N_{s}}{(m+1)}\left[e^{-(m+1) r_{L}{ }^{2} / 4 D T}-1\right]\right. \\
& \left.-\frac{N_{B}}{m}\left[e^{-m r_{L}^{2} / 4 D T}-1\right]\right] \tag{4.41}
\end{align*}
$$

The voltage coefficient of the above is obtainable by differentiating

$$
\begin{align*}
\frac{\partial G_{\text {side }}}{\partial V_{B}}= & \frac{\pi q K W r_{L}}{2 L} \frac{\partial r_{L}}{\partial V_{B}} N_{S}^{m}\left[N_{S} e^{-(m+1) r_{L}^{2} / 4 D T}\right. \\
& \left.-N_{B} e^{-m r_{L}^{2} / 4 D T}\right] \tag{4.42}
\end{align*}
$$

where $\partial r_{L} / \partial V_{B}$ must be evaluated numerically.
The entire resistance and voltage coefficient of a diffused resistor with outdiffused sidewalls may now be determined. Specifically

$$
\begin{equation*}
R_{T}=\frac{R}{2 R G_{\text {side }}+1} \tag{4.43}
\end{equation*}
$$

and

$$
\begin{align*}
\frac{\partial R_{T}}{\partial V_{B}}= & \frac{1}{\left[2 R G_{\text {side }}+7\right]}\left[\frac{\partial R}{\partial V_{B}}-\frac{2 R}{\left[2 R G_{\text {side }}+1\right]}\left[G \frac{\partial R}{\partial V_{B}}\right.\right. \\
& \left.\left.+\frac{R \partial G}{\partial V_{B}}\right]\right] \tag{4.44}
\end{align*}
$$

Plots of normalized resistivity bias voltage coefficient as a function of applied bias and resistor width are shown in Fig. 4.8 for a typical CMOS well resistor.

Lastly, the voltage dependence for $R_{T}$ with respect to $V_{D}$ must be determined. Assuming

$$
\begin{equation*}
R_{T}=\frac{1}{W} \int_{0}^{L} \rho(y) d y \tag{4.45}
\end{equation*}
$$

and assuming that a linear IR drop exists along the length of the resistor

$$
\begin{equation*}
\rho(y)=\rho+\frac{\partial \rho}{\partial V} \frac{y}{L} V_{D} \tag{4.46}
\end{equation*}
$$

and as


Fig. 4.8 (a) Normalized resistivity width dependence as a function of applied bias. (b) Normalixed resistance bias voltage coefficient as a function of width and applied bias. For each figure; $Q_{T}=3.5 \times 10^{12} / \mathrm{cm}^{2}, D T=3.91 \times 10^{-8} \mathrm{~cm}^{2}, N_{B}=1 \times 10^{15} / \mathrm{cm}^{3}$ and $\rho_{S}(O V)=5.4 \mathrm{k} \Omega / \square$.

$$
\begin{equation*}
\frac{\partial \rho}{\partial V}=\frac{\partial R_{T}}{\partial V_{B}} \frac{W}{L} \tag{4.47}
\end{equation*}
$$

then

$$
R_{T}=\frac{1}{W} \int_{0}^{L}\left[\begin{array}{lll}
\rho+\frac{\partial R_{T}}{\partial V_{B}} & \frac{W y}{2} & V_{D} \tag{4.48}
\end{array}\right] d y
$$

so, for small values of $V_{D}$

$$
\begin{equation*}
\frac{\partial R_{T}}{\partial V_{D}}=\frac{1}{2} \frac{\partial R_{T}}{\partial V_{B}} \tag{4.49}
\end{equation*}
$$

This expression actually indicates the average of the resistivities in the area near $y=L$ and the area near $y=0$, because of the assumption that the I-R drop along the resistor be linear in $y$.

### 4.4 Compensation of Voltage Dependence in a Diffused Monolithic Resistor

From the discussion in the preceeding sections it is evident that voltage dependent behavior may substantially alter the matching characteristics of diffused resistors. In CMOS technologies, however, there are several techniques for reducing the size of these effects. In one method, the resistor to be matched is placed in an isolated well, which is biased at a variable voltage. Applications of this technique to a current to voltage converter is shown in Fig. 4.9. Using this technique [66], effective voltage coefficients which are on the order of those obtainable with polysilicon resistors have been achieved (~.001\%V).

Using a different technique, equivalent or lower, voltage coefficients should be obtainable. By again placing the precision resistor inside a well, but now inducing an I-R drop in the well which is similar to that in the precision resistor, the $n+$ depletion region width is kept fairly constant. The well may either be placed in parallel with the $n+$


Fig. 4.9 Variable well bias may be used to reduce the voltage coefficient of a diffused resistor. A current to voltage converter is shown [66].
resistor, as shown in Fig. 4.10a, or biased independently as shown in Fig. 4.10b. In the former case, the voltage dependence of the well resistivity will usually dominate, while in the latter, the nonlinear voltage drop along the well caused by its voltage dependence, will create a small bias dependence in the $n+$ resistivity. These effects will be analyzed in detail.

Assuming that the well resistivity may be represented as

$$
\begin{equation*}
\rho_{1}=K_{1}\left(V_{D}+V_{B}+\phi_{1}\right)^{n} \tag{4.50}
\end{equation*}
$$

where $V_{D}$ and $V_{B}$ are as defined in Fig. 4.3, then the voltage drop at any point along its length may be given by

$$
\begin{align*}
V(Y) & =\int_{0}^{Y} I \rho_{1}(t) d t \\
& =I K_{1} \int_{0}^{Y}\left(\frac{t}{L} V_{D}+V_{B}+\phi_{1}\right)^{n} d t \tag{4.51}
\end{align*}
$$

As the nominal I-R drop is simply

$$
\begin{equation*}
V_{\text {nom }}(Y)=\frac{Y}{L} V_{D} \tag{4.52}
\end{equation*}
$$

this indicates a nonlinear error voltage of the amount

$$
\begin{align*}
V_{e}(Y)= & \frac{I K_{1} L}{V_{B} W_{1}(n+1)}\left[\left(\frac{Y}{L} V_{D}+V_{B}+\phi_{1}\right)^{n+1}-\left(V_{B}+\phi\right)^{n+1}\right] \\
& -\frac{Y}{L} V_{D} \tag{4.53}
\end{align*}
$$

where the well current is $V_{D} / R_{p}(L)$. As

$$
\begin{equation*}
R_{1}(L)=\frac{1}{W_{1}} \int_{0}^{L} K_{1}\left(\frac{t}{L} V_{D}+V_{B}+\phi_{1}\right)^{n} d t \tag{4.54}
\end{equation*}
$$

this indicates that


Fig. 4.10a Low voltage coefficient resistor.


Fig. 4.10b Improved low voltage coefficient resistor configuration in a current to voltage converter.

$$
\begin{equation*}
I=\frac{V_{D}^{2} W_{1}(n+1)}{L K_{1}\left[\left(V_{D}+V_{B}+\phi_{1}\right)^{n+1}-\left(V_{B}+\phi_{1}\right)^{n+1}\right]} \tag{4.55}
\end{equation*}
$$

Assuming that the error voltage is small the $n+$ resistance may be given by

$$
\begin{align*}
R_{2}(Y)= & \frac{1}{W_{2}} \int_{0}^{Y} \rho_{2}+\frac{\partial \rho_{2}}{\partial V} V_{e}(t) d t  \tag{4.56}\\
\sim & \frac{Y \rho_{2}}{W_{2}}+\frac{1}{W_{2}} \frac{\partial \rho_{2}}{\partial V}\left[\frac { I K _ { 1 } L } { V _ { D } W _ { 1 } ( n + 1 ) } \left(\frac { L } { V _ { D } ( n + 2 ) } \left(\left(\frac{Y V_{D}}{L}+V_{B}+\phi_{1}\right)^{n+2}\right.\right.\right. \\
& \left.\left.\left.-\left(V_{B}+\phi_{1}\right)^{n+2}\right)-Y\left(V_{B}+\phi_{1}\right)^{n+1}\right)-\frac{1}{2} \frac{Y^{2} V_{D}}{L}\right] \tag{4.57}
\end{align*}
$$

and

$$
\begin{align*}
R_{2}(L)= & \frac{L \rho_{2}}{W_{2}}+\frac{L}{W_{2}} \frac{\partial \rho_{2}}{\partial V}\left[\frac { I K _ { 1 } L } { V _ { D } W _ { 1 } ( n + 1 ) } \left(\frac { 1 } { V _ { D } ^ { ( n + 2 ) } } \left(\left(V_{D}+V_{B}+\phi_{1}\right)^{n+2}\right.\right.\right. \\
& \left.\left.\left.-\left(V_{B}+\phi_{1}\right)^{n+2}\right)-\left(V_{B}+\phi_{1}\right)^{n+1}\right)-\frac{V_{D}}{2}\right] \tag{4.58}
\end{align*}
$$

In the first case considered (Fig. 4.10a), the total resistance is the parallel combination of $R_{1}(L)$ and $R_{2}(L)$, while in the second case (Fig. 4.10b), the total impedance is simply $R_{2}(L)$. Plots of absolute resistor error for each of these cases when applied to fairly conventional $\mathrm{n}+$ and p -well resistors are shown in Fig. 4.11.

### 4.5 Temperature Coefficient of Resistors

Because electron and hole drift mobilities in silicon are temperature as well as concentration dependent, silicon resistors of any type will display temperature dependent behavior. This is also true of thinfilm metallic resistors, although often to a smaller degree. A plot of electron and hole mobilities in silicon as a function of dopant concentration


Fig. 4.11 Resistor error as a function of voltage drop for the resistor configurations shown in Fig. 4.10. The type I resistor is for $R_{1}$ (well resistor) in parallel with $R_{2}$ ( $n+$ resistor), while in the type II resistor they are electrically isolated.
and temperature is presented in Fig. 4.12 [67]. For deposited resistors of uniform doping, the temperature dependence may be estimated directly from this figure. For diffused resistors a slight modification to the previous resistivity analysis, will allow resistance determination at any temperature. By assuming that the previously defined mobility coefficients $K$ and $m$ are not constant in a given region, but are in fact temperature dependent, all of the previous analysis will still remain valid. Thus it is now assumed that

$$
\begin{equation*}
\mu\left(N_{c}, T\right)=K(T) N_{c}^{m(T)} \tag{4.59}
\end{equation*}
$$

where as before $K$ and $m$ are defined for a region of values of $N_{c}$. Note that $K$ and $m$ are easily obtainable from Fig. 4.12, by determining the best fit of the equation
$\ln \mu=m \ln N_{c}+\ln K$
in the temperature and doping range of interest.

### 4.6 Matching Consideration in Precision Capacitors

An alternative to the precision resistor, is the precision MOS capacitor, which has recently gained considerable acceptance. Depending upon the specific type of IC process employed, capacitors with metal or poly top plates and poly or diffusion bottom plates may be available, with usually a fairly thin $\mathrm{SiO}_{2}$ layer acting as the intermediate dielectric. In certain processes, however, no precision capacitors are directly available (as in a single-poly self-aligned process for instance) which forces the use of precision resistors.

Consider the general pair of matched capacitors shown in Fig. 4.13. If the oxide thickness is assumed to be fairly uniform, and the top and


Fig. 4.12 Electron mobility as a function of doping concentration and temperature [67].
든

Fig. 4.13 A matched pair of MOS capacitors.
bottom plates are assumed to be either metallic or degenerately doped, the value of either capacitor may be given by

$$
\begin{align*}
C & =\varepsilon_{o x} \int \frac{1}{t_{o x}} d A \\
& =\frac{A \varepsilon_{o x}}{\overline{t_{o x}}} \tag{4.6}
\end{align*}
$$

where $t_{o x}$ is the average oxide thickness in the capacitor, and $A$ is the total overlap area. If an uncertainty $\delta$ is assigned to each capacitor parameter, a potential error will result in the final capacitor value of the amount

$$
\begin{equation*}
\frac{\delta C}{C}=\left[\left(\frac{\delta A}{A}\right)^{2}+\left(\frac{\delta \overline{t_{o x}}}{\overline{t_{o x}}}\right)^{2}\right]^{1 / 2} \tag{4.62}
\end{equation*}
$$

For approximately rectalinear capacitor structures, this becomes

$$
\begin{equation*}
\frac{\delta C}{C} \simeq\left[\left(\frac{\delta W}{W}\right)^{2}+\left(\frac{\delta L}{L}\right)^{2}+\left(\frac{\delta \overline{t_{0 x}}}{\overline{t_{0 x}}}\right)^{2}\right]^{1 / 2} \tag{4.63}
\end{equation*}
$$

As in the resistor case, the relevant uncertainties in the above, usually indicate the spread within the precision components on a single die, and not the variation across different wafers or lots. Unlike the situation with resistors, the aspect ratio of capacitors is usually near unity ( $L / W \simeq 1$ ), thus $\delta W / W$ and $\delta L / L$ will normally contribute about equally to overall capacitance error. Variations in $\overline{t_{o x}}$ may be due to thermal gradients, dopant inhomogeneities or crystal defects which are present during processing (if thermally grown), or silane concentration gradients (if deposited). Capacitor matching of better than $.5 \%$ in even moderate ratio situations [68] is commonly accepted, and one report indicates that an untrimmed $.1 \%$ matching is achievable with high yield [69]. Typically, capacitors which must match to greater than 8 or 9 bits and/or
which must be made quite small, are trimmed in quantum amounts by either electrical [70] or fusable link techniques [71].

As was the case with precision resistors, truely random edge variations may only be compensated for by using larger device geometries, although the effect of linear gradients may be reduced by using an interdigitized or common centroid structure (Fig. 4.14 a and 4.14 b repectively).

### 4.7 Capacitance of Nonplanar Structures

Depending upon the specifics of an IC process, completed capacitors may not resemble the planar structures illustrated in Fig. 4.13. This is particularly true of capacitor types which are principally defined by a thin oxide mask cut, such as those shown in Fig. 4.15. In cases such as these, the total overlap capacitance is divided into 10 different areas of 4 basic types, as illustrated in Fig. 4.16a. The region of largest capacitance is usually one of planar thin oxide as before, however now, this area is surrounded by slanted sidewall and corner regions and a planar field oxide area. In some circuit applications the contributions of these areas to the total cell capacitance will be unimportant, as circuit operation may depend only upon the absolute ratio of capacitance, and all capacitors are fabricated from unit cells of the same size [72]. In other applications, however, which for reasons of speed must attempt to minimize overall capacitance [73], or in circuits which must use noninteger capacitance ratios (e.g., filters), all of these regions must be taken into account.

A cross section of a thin oxide cut in a nonplanar capacitor, is shown in Fig. 4.16b. As is apparent in the figure, some undercutting from the original mask opening will usually have taken place, and the sidewalls are typically slanted (affords good step coverage). In terms


Fig. 4.14a Interdigitating of capacitors to compensate for linear oxide gradients $\left(C_{A} \simeq C_{B}\right)$.


Fig. 4.14b Common centroid geometry ( $C_{A}>C_{B}$ ).


Fig. 4.15 Nonplanar capacitor structures. (a) Metal on poly top plate and diffused bottom plate structure is common in metal gate and slightly modified silicon gate processes. (b) Metal top plate and poly bottom plate is available in some modified silicon gate processes.

(Field $0 x$ )
(b)


Fig. 4.16 Generalized nonplanar capacitor. (a) Top view showing division of capacitance into 10 different areas of 4 basic types. (b) Capacitor cross-section.
of the various dimensions noted in the figure, this results in a planar thin oxide region of 1 inear dimension $B$, where

$$
\begin{equation*}
B=L+2 T_{1}-2(D-T) \operatorname{Tan} \phi \tag{4.64}
\end{equation*}
$$

and a total planar capacitance of the amount

$$
\begin{equation*}
C_{p t}=\frac{\varepsilon_{o x}}{T}\left[L+2 T_{1}-2(D-T) \operatorname{Tan} \phi\right]^{2} \tag{4.65}
\end{equation*}
$$

The capacitance of a sidewall region may be determined by analyzing the simplified cross section shown in Fig. 4.17a. Assuming that the sidewall is linear as shown, a differential capacitance may be defined of the amount

$$
\begin{equation*}
\delta C=\left(\frac{\varepsilon_{0 x}}{y(x)}\right) \delta x \delta y \tag{4.66}
\end{equation*}
$$

so, the capacitance of a sidewall of length $B$ and width $T_{2}$ is given by

$$
\begin{align*}
C_{s} & =\int_{0}^{B} \int_{0}^{T_{2}}\left(\frac{\varepsilon_{0 x}}{D+\frac{(T-D)}{T_{2}} x}\right) d x d y \\
& =\varepsilon_{0 x} \frac{B T_{2}}{(T-D)} \ln \left(\frac{T}{D}\right) \tag{4.67}
\end{align*}
$$

where $T_{2}$ may be defined in terms of the sidewall angle $\phi$, by

$$
\begin{equation*}
T_{2}=(D-T) \operatorname{Tan} \phi \tag{4.68}
\end{equation*}
$$

The capacitance of a corner section may be determined in a similar fashion to the sidewall, except that now, the section of length $L$ actually becomes an effective length of $T_{2}-x$ (see Fig. 4.17b) so

$$
\begin{aligned}
c_{c} & =\int_{0}^{T_{2}-x} \int_{0}^{T_{2}}\left(\frac{\varepsilon_{0 x}}{D+\frac{(T-D)}{T_{2}} x}\right) d x d y \\
& =\int_{0}^{T_{2}}\left(T_{2}-x\right)\left(\frac{\varepsilon_{0 x}}{D+\frac{(T-D)}{T_{2}} x}\right) d x
\end{aligned}
$$



Fig. 4.17a Cross-section of one sloped sidewall.


Fig. 4.17b Diagram of corner region.


Fig. 4.17c Top plate overlap of field region.

$$
\begin{equation*}
=\varepsilon_{0 x}\left[(T-D)-\frac{D T_{2}^{2}}{(T-D)} \ln \left(\frac{T}{D}\right)\right] \tag{4.69}
\end{equation*}
$$

Lastly the planar capacitance over the field region must be determined. Defining the top plate overlap to be that shown in Fig. 4.17c, the field capacitance is given by

$$
\begin{equation*}
c_{p f}=\frac{\varepsilon_{0 x}}{0}\left[4 T_{3}\left(L+2 T_{1}\right)+4 T_{3}^{2}\right] \tag{4.70}
\end{equation*}
$$

where the capacitance due to interconnect tabs has not been included.
Thus the total capacitance of a structure of this type may be given by

$$
C_{T}=C_{p t}+4 C_{s}+4 C_{c}+C_{p f}+C_{t a b}
$$

where $C_{\text {tab }}$ is any interconnect capacitance, and all the other variables are as previously defined.

### 4.8 Voltage Coefficient of MOS Capacitors

For capacitors which employ a doped semiconductor top or bottom plate, the preceding capacitance calculations have been slightly simplified. For calculation of voltage and temperature dependence, a number of additional effects must be considered. Because a doped semiconductor does not behave as a metallic material, it will exhibit a finite surface capacitance. That is, a semiconductor which functions as a plate of a capacitor will be unable to supply charge exactly at the oxide interface, but only some distance from it. For a capacitor which employs one metal, and one semiconductor plate, this results in an effective total capacitance of

$$
\begin{equation*}
C_{T}=\frac{C_{0 x} C_{S}}{C_{0 x}+C_{S}} \tag{4.71}
\end{equation*}
$$

where $C_{0 x}$ is the oxide capacitance and $C_{s}$ is the effective silicon sur-
face capacitance. A number of techniques have been developed for the evaluation of $C_{s}$ under different conditions [74]: In general, however,
it is simplest to assume that the silicon surface is non-degenerate and
fully-ionized [75], which enables the use of Maxwell-Boltzman statis-

$$
Q_{s}=\frac{-U_{S}}{\left|U_{S}\right|}\left[2 \varepsilon_{s i} k T N_{D}\left(e^{U_{S}}-1-U_{S}\right)\right]^{1 / 2}
$$

$(\triangleright L \cdot \downarrow)$

$$
(G \angle \cdot b)
$$

$$
\left(\varepsilon L^{\circ} \triangleright\right)
$$ tics.

Kq uən!6 s! lof!כedes SOW ue ssouse po!ldde a6eflon lezof ayl

plate, this latter quantity can be expressed as [76]

$$
\begin{aligned}
& \text { total voltage coefficient may now be determined. Specifically } \\
& \qquad \alpha=\frac{1}{C_{T}} \frac{\partial C_{T}}{\partial V_{T}} \\
& \text { or eventually } \\
& \qquad \alpha \simeq \frac{C_{0 x}}{C_{S}^{2}} \frac{\partial C_{S}}{\partial U_{S}}\left[\frac{\partial V_{T}}{\partial U_{S}}\right]-1
\end{aligned}
$$

$$
\begin{aligned}
& V_{T}=V_{F B}+\phi_{S}-\frac{Q_{S}}{C_{O X}} \\
& (Z L \cdot \downarrow)
\end{aligned}
$$

where $\partial C_{S} / \partial U_{S}$ may be found from Eq. (4.74), and $\partial V_{T} / \partial U_{S}$ from Eqs. (4.72) and (4.73),

The total voltage coefficient may then be found to be [78]

$$
\begin{equation*}
\alpha=\frac{c_{o x}{ }^{2}}{\varepsilon_{s i}{ }^{q N_{D}}}\left[\frac{e^{2 U_{s}}-2 U_{s} e^{U_{s}}-1}{\left(e^{U_{s}}-1\right)^{3}}\right] \tag{4.78}
\end{equation*}
$$

or near $V=V_{F B}\left(U_{S} \rightarrow 0\right)$

$$
\begin{equation*}
\alpha=\frac{c_{o x}{ }^{2}}{3 q \varepsilon_{s i} N_{D}} \tag{4.79}
\end{equation*}
$$

As is apparent from the above, higher doping levels should result in a nearly linear improvement in voltage coefficient. As the impurity concentration approaches very high levels ( $N_{D} \geq 2 \times 10^{19}$ ); however, the assumption regarding non-degeneracy is no longer valid. In this region a more involved capacitance model must be used, which assumed only partial donor ionization and which includes impurity band broadening effects (IB). The predicted results of both of the above models, and a third which also assumed Fermi-Dirac statistics but with complete ionization (FI) are shown with experimental data in Fig. 4.18 [79].

Note that although the above analysis using Maxwell-Boltzman statistics is rather simplified, it still provides reasonable agreement with the experimental data in even degenerately doped cases. Of perhaps greatest interest in this figure, is the very low voltage coefficient of MOS capacitors with even a modestly doped semiconductor plate. These voltage coefficient figures compare favorably with those of thin-film resistors or diffused resistors which are used with a bias compensation technique. These figures are even more impressive when coupled with temperature characteristics which are described in the next section.


Fig. 4.18 Measured and predicted voltage coefficient of capacitance for different bottom plate impurity concentrations [79]. See text for label definitions.

### 4.9 Temperature Coefficient of MOS Capacitors

The temperature dependence of an MOS capacitor may be determined by using similar techniques to those used in the previous section.

The normalized temperature coefficient of capacitance may be given by

$$
\begin{equation*}
\beta=\frac{1}{A C_{T}} \frac{\partial A C_{T}}{\partial T} \tag{4.80}
\end{equation*}
$$

in which $C_{T}$ is the total capacitance per unit area, and where $A$ is the total capacitor area. The expression for the temperature coefficient may be expanded to give

$$
\begin{align*}
\beta & =\frac{1}{A} \frac{\partial A}{\partial T}+\frac{1}{C} \frac{\partial C}{\partial T} \\
& =\left[\frac{1}{A} \frac{\partial A}{\partial T}-\frac{1}{t_{o x}} \frac{\partial t_{o x}}{\partial T}\right]+\frac{C_{o x}}{C_{s}{ }^{2}} \frac{C_{s}}{\partial T}+\frac{1}{\varepsilon_{0 x}} \frac{\partial \varepsilon_{0 x}}{\partial T}  \tag{4.81}\\
& =\beta_{t h}+\beta_{s C}+\beta_{\varepsilon_{o x}} \tag{4.82}
\end{align*}
$$

where the first term corresponds to physical modification of the relevant capacitor dimensions with temperature, the second indicates modification of the effective silicon surface capacitance, while the last indicates temperature dependence of the oxide dielectric constant. McCreary has shown that the first of these terms for a circular capacitor may be expressed by [79]

$$
\begin{equation*}
\beta_{t h}=\frac{2 \alpha_{s i}-\left(1+v_{o x}\right) \alpha_{o x}}{\left(1-v_{o x}\right)} \tag{4.83}
\end{equation*}
$$

where $\alpha_{s i}$ and $\alpha_{o x}$ are the linear thermal expansion coefficients of silicon dioxide and $v_{o x}$ is the Poisson's ratio for $\mathrm{SiO}_{2}$.

The temperature dependence of the silicon surface capacitance may be obtained by differentiating Eq. (4.75). This differentiation is made considerably easier by first expanding $e^{U_{S}}$ about zero and ignoring all but first order terms. This approximation is valid about the flatband voltage, and results in a space-charge temperature dependence of

$$
\begin{equation*}
\beta_{s c} \simeq \frac{C_{o x} k}{2 q\left(\varepsilon_{s i} N_{D} k T\right)^{1 / 2}}\left[1-T \frac{1}{\varepsilon_{s i}} \frac{\partial \varepsilon_{s i}}{\partial T}\right] \tag{4.84}
\end{equation*}
$$

The value of $\beta_{\varepsilon_{0 x}}$ may be determined from previously reported data, which has been given as between $+15 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$ [80] and $+21 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$ [79]. Assuming values of $2.8 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$ and $.5 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$ for the linear expansion coefficients of Si and $\mathrm{SiO}_{2}$ [81], a value of .163 for $\mathrm{v}_{0 \mathrm{x}}$ [82], and a value of $+250 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$ for $\frac{1}{\varepsilon_{\mathrm{si}}} \frac{\partial \varepsilon_{\mathrm{si}}}{\partial T}$, this indicates $\mathrm{a} \cdot \beta_{\mathrm{th}}$ of $+6.0 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$ and a $\beta_{\text {sc }}$ of $-1.5 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$ in a capacitor with $1000 \AA$ of oxide and a surface doping of $1.6 \times 10^{20}$ [79]. This then indicates a total $\beta$ of between 20 and $26 \mathrm{ppm} /{ }^{\circ} \mathrm{C}$. This approximate value is verified by the experimental results shown in Fig. 4.19 [79]. Also included on this plot are the predicted results from a more complete model which includes degeneracy effects [83].


Fig. 4.19 Measured and predicted temperature coefficient of capacitance for different bottom plate impurity concentrations [79].

## CHAPTER 5

ANALOG CIRCUIT BUILDING BLOCKS

In this chapter, a family of general purpose analog building blocks is described. These circuits, using the precision elements of Chapter 3 as ratio standards, allow any of the previously discussed converters to be realized. Sections 5.1-5.3 examine source followers, and single and two stage transconductance amplifiers, while Section 5.4 discusses various design considerations in integratcrs. Section 5.5 covers output stages, while comparators are examined in Section 5.6. Noise in analog circuits is discussed in Section 5.7.

For the designs in this chapter, a simple MOS model has been adopted for ease of calculation [84]. More rigorous modeling information should be included in computer circuit simulations for optimization of results (for example $[85,86])$. It is assumed here, that for an NMOS device,

$$
\begin{equation*}
I_{D} \simeq \frac{\beta\left(V_{G S}-V_{T}\right)^{2}}{2\left(1-\lambda V_{D S}\right)}, V_{D S}>V_{G S}-V_{T} \tag{5.1}
\end{equation*}
$$

and

$$
\begin{equation*}
I_{D} \simeq \frac{B\left[\left(V_{G S}-V_{T}\right) V_{D S}-V_{D S}^{2} / 2\right]}{\left(1-\lambda V_{D S}\right)}, V_{D S}<V_{G S}-V_{T} \tag{5.2}
\end{equation*}
$$

where $\beta=\mu C_{o x} W / L, \lambda$ is the channel length modulation parameter and $V_{g s}, V_{d s}$ and $V_{T}$ indicate the gate to source, drain to source and threshold voltages respectively. For a PMOS device the same equations apply, but where now $\lambda$ is negative, and a positive current indicates current flow from (rather than to) the MOSFET.

### 5.1 Source Followers

Source followers are commonly employed as a level shifter between high gain stages, or as a buffer between a high impedance gain stage and a low impedance load. With slight modifications to reduce distortion, they are also suitable for use as moderate precision voltage followers, which are useful in pipeline converters and input sample and holds.

### 5.1.1 Frequency Response and Gain

In its simplest configuration, the source follower takes the form shown in Fig. 5.la. For this circuit, the first order small signal gain is given by

$$
\begin{equation*}
\frac{v_{o u t}(s)}{V_{i n}(s)}=\frac{\left(s C_{1}+g_{m_{1}}\right)}{\left(g_{m_{1}}+g_{m b}+1 / R_{L}+s\left(C_{1}+C_{L}\right)\right)} \tag{5.3}
\end{equation*}
$$

where $g_{\mathrm{mb}}$ is the back-bias transconductance of $M_{1}$ and $R_{L}$ is the parallel impedance of the nonideal current source and the output load. At low frequencies, this expression reduces

$$
\begin{equation*}
\frac{v_{\text {out }}}{V_{\text {in }}}=\frac{g_{m_{1}}}{\left(g_{m_{1}}+g_{m b}+1 / R_{L}\right)} \tag{5.4}
\end{equation*}
$$

which for small values of $g_{m b}$ and $1 / R_{L}$ can be made very near unity.
As is evident from Eq. (5.3), the source follower exhibits a singlepole single-zero frequency response. By placing the pole and zero at the same frequency, this circuit may be made very broadband. This is seen to occur, where

$$
\begin{equation*}
\frac{C_{L}}{C_{1}}=\frac{g_{m b}+l / R_{L}}{g_{m_{1}}} \tag{5.5}
\end{equation*}
$$



Fig. 5.1 Source follower configurations in CMOS. (a) Basic circuit. (b) Input device in an isolated well. (c) Input derived variable cascode for reduced distortion. (d) Output derived variable cascode.

For source followers which must exhibit a gain approaching unity, this condition can rarely be satisfied. Broadband circuits which drive low impedance loads (on the order of $1 / g_{m_{1}}$ ) are easily designed, however, for operation over at least a small dynamic range.

### 5.1.2 Transfer Characteristic Nonl inearity

A common problem with the source follower circuit as previously shown, is nonlinearity of the transfer function over large output swings. This problem occurs because of nonzero drain output conductances in each of the MOS devices and backbias modulation of the threshold voltage of $M_{1}$. The output conductance of the current source may be made high by employing a long geometry device and/or by using series feedback. These techniques are not generally applicable to $M_{p}$, however, as they will directly reduce circuit bandwidth. The nonlinearity produced by the device output conductances and the threshold modulation of $M_{1}$ in a typical circuit, is demonstrated in Fig. 5.2 (curve a). This nonlinearity may be reduced by placing $M_{1}$ in an isolated well which is tied to its source, as shown in Fig. 5.1b. This procedure, which is only possible in a CMOS or epitaxial process, eliminates first order threshold modulation effects in $M_{1}$ and reduces circuit nonlinearity to that shown in Fig. 5.2 curve b. As is evident from the slope of this curve, the drain conductance of $M_{1}$ contributes heavily to the overall circuit nonlinearity. Required is a circuit technique which will reduce this dependence, as $M_{1}$ should ideally have a short channel length in order to provide good bandwidth. A circuit which at least reduces this problem is shown in Fig. 5.1c. Here, a cascode device $\left(M_{3}\right)$ has been added above $M_{1}$ to maintain it at an approximately constant $V_{d s}$ value. This is accomplished by biasing $M_{3}$ at a voltage which itself tracks the input voltage (via the source follower


Fig. 5.2 Maximum nonlinearity for each of the source followers of Fig. 5.1. Assumes $V_{\text {out }}$ may vary between $\pm 3 \mathrm{~V}, V_{D D}=5 \mathrm{~V}, V_{S S}=-5 \mathrm{~V}$, $\sqrt{2 \varepsilon q N_{A}} / C_{0 X}=.7$ (for NMOS), $\sqrt{2 \varepsilon q N_{D}} / C_{O X}=.35$ (for PMOS), $\lambda_{2}=.002$, $\lambda_{3}=.04, \lambda_{4}=-.04,\left(V_{g s}-V_{T}\right)_{A L L} \simeq 1 V$.
$M_{4}$ ). As demonstrated by curve $c$ in Fig. 5.2, this technique greatly reduces the contributions of $M_{1}$ to the overall nonlinearity, although back-bias effects in $M_{4}$ now become the limiting factor. A further modification of the circuit eliminates this problem. By biasing the cascode circuit from the output node, rather than from the input, the use of the PMOS source follower may be avoided.

This circuit, shown in Fig. 5.2d provides the most 1 inear performance of any of the circuits considered, as demonstrated by curve $d$ of Fig. 5.2. In this configuration, the voltage source $V_{B}$ may be any output independent impedance or voltage drop, such as a pair of diode connected NMOS devices placed in $M_{1}$ 's well. A practical problem with this circuit, lies in the realization of the current source $I_{2}$. As this source is in parallel with the output node, it may contribute a gain roll-off and nonlinearity if its output impedance is not kept sufficiently high. This almost necessitates the use of a series feedback circuit, which regretfully, will further limit the allowable circuit output swing. Thus, low distortion has ultimately been achieved at the expense of dynamic range.

### 5.1.3 Elimination of Intrinsic Offset

The usefulness of a source follower circuit is limited in many applications by the intrinsic voltage difference (offset voltage) between input and output. Although this is not normally important in AC coupled circuits, it is a severe problem in sample and hold and voltage follower circuits of the type discussed in Chapter 3. In these circuits an intrinsic offset may directly contribute to the creation of an effective converter offset, or worse yet, a nonlinearity (in pipeline converters).

Two circuit configurations which may be used to reduce this problem are shown in Fig. 5.3. Both of these circuits employ depletion input


Fig. 5.3 Use of depletion source followers with variable bias conditions allows elimination of intrinsic offset. (a) Variable current. (b) Variable backbias.
follower devices which are biased to a $V_{g s}$ of approximately zero volts. The first circuit, that shown in Fig. 5.3a, employs an adjustable follower bias current, while the second, in Fig. 5.3b, uses an adjustable input device back bias technique. The bias generators for each of these circuits monitor the behavior of a "dummy follower" enabling good offset cancellation over large variations in process parameters and operating conditions. Further, one bias generator may be shared over several followers, making this technique fairly area efficient. Note that care must be exercised in the choice of linear amplifiers for these circuits to ensure that the bias generators are stable over all operating conditions. For optimal performance, this technique may be combined with the low distortion methods discussed above.

### 5.2 Single Gain Stage Linear Amplifiers

In many circuit applications, the source followers described in the previous section may not be used, as the application may require a gain above unity, very low distortion or differential operation. In these cases some form of linear amplifier must be used. In many instances the amplifier may only have to drive capacitive loads, while in others (usually those driving off the chip), a low impedance output is required. In this section single-high gain stage transconductance amplifiers are described, which are only suitable for use with high impedance loads. Subsequent sections will discuss two-stage transconductance amplifiers and output buffering techniques.

### 5.2.1 Class-A Amplifiers

A simple common source CMOS Class-A amplifier circuit, is shown in Fig. 5.4. In this figure, $r_{0}$ represents the total effective output loading


Fig. 5.4 CMOS Class-A gain stage.
from the parallel output impedances of $M_{1}$ and $M_{2}$ and the output load (if any), while $C_{L}$ represents the total output node capacitance. The impedance of the bias generator for current source $M_{2}$, is represented by $r_{x}$. It is straightforward to show for this circuit, that the small signal transfer function may be given by

$$
\begin{equation*}
\frac{V_{o u t}(s)}{V_{i n}(s)}=\frac{-g_{m_{1}} r_{0}\left(1-s C_{g d_{1}} / g_{m_{1}}\right)\left(1+s r_{x} C_{g d_{2}}\right)}{1+B s+A s^{2}} \tag{5.6}
\end{equation*}
$$

where

$$
A=\left(C_{g d_{1}}+C_{L}\right) C_{g d_{2}} r_{0} r_{x}
$$

and

$$
\begin{equation*}
B=\left(C_{g d_{1}}+C_{L}\right) r_{0}+C_{g d_{2}}\left(r_{x}+r_{0}-r_{0} r_{x} g_{m_{2}}\right) \tag{5.7}
\end{equation*}
$$

The poles and zeros of Eq. (5.6) may be given by

$$
\begin{aligned}
& z_{1}=+g_{m_{1}} / C_{g d_{1}} \\
& z_{2}=-1 /\left(r_{x} C_{g d_{2}}\right) \\
& p_{1} \simeq-1 / B
\end{aligned}
$$

and

$$
\begin{equation*}
p_{2} \simeq-B / A \tag{5.8}
\end{equation*}
$$

while the gain at low frequencies is simply,

$$
\begin{equation*}
A_{V}=\frac{V_{\text {out }}}{V_{\text {in }}}=-g_{m_{1}} r_{0} \tag{5.9}
\end{equation*}
$$

By way of the previously presented MOS models, the values for $r_{0}$, $g_{m_{1}}$ and $g_{m_{2}}$ may be determined from the device sizes and bias conditions. Specifically, as

$$
\begin{equation*}
I_{1} \approx \frac{B_{1}}{2\left(1-\lambda_{1} V_{D S_{1}}\right)}\left(V_{i n}-V_{T_{1}}\right)^{2} \tag{5.10}
\end{equation*}
$$

then

$$
\begin{equation*}
r_{0_{1}} \simeq\left[\frac{\partial I_{1}}{\partial V_{D S_{1}}}\right]^{-1} \simeq \frac{\left(1-\lambda_{1} V_{D S_{1}}\right)}{I_{1} \lambda_{1}} \tag{5.11}
\end{equation*}
$$

Similarly,

$$
\begin{equation*}
r_{0_{2}} \simeq \frac{\left(1-\lambda_{2} V_{D S_{2}}\right)}{I_{1} \lambda_{2}} \tag{5.12}
\end{equation*}
$$

where $r_{0}=r_{0} \| r_{0_{2}}$. The device transconductance values may be given by
and

$$
\begin{equation*}
g_{m_{1}} \simeq \frac{\partial I_{1}}{\partial V_{i n}} \simeq \sqrt{\frac{2 B_{1} I_{1}}{\left(1-\lambda_{1} V_{D S_{1}}\right)}} \tag{5.13}
\end{equation*}
$$

$$
\begin{equation*}
g_{m_{2}} \simeq \frac{\partial \mathrm{I}_{1}}{\partial \mathrm{~V}_{\mathrm{bias}}} \simeq \sqrt{\frac{2 \mathrm{~B}_{2} \mathrm{I}_{2}}{\left(1-\lambda_{2} \mathrm{~V}_{\mathrm{DS}}{ }_{2}\right.}} \tag{5.14}
\end{equation*}
$$

Note that the transconductance is proportional to $I_{1}^{1 / 2}$ (in the first order model) while $r_{0}$ is proportional to $I^{-1}$. This implies that the low frequency gain goes as roughtly $I_{1}^{-1 / 2}$. Hence, the gain is usually highest when operating at the lowest possible current levels. ${ }^{l}$ As the bandwidth of this circuit is also dependent on the current levels, however, adjustment of the current to achieve a required gain is often not possible. In fact, it is often the case that this simple circuit will display insufficient gain for many applications. In these situations
${ }^{1}$ At low current densities and high drain voltages, channel avalanching may dominate output impedance behavior [87,88]. This may substantially reduce amplifier gain.
the gain may be improved (at the expense of output swing), by employing a cascode circuit above one or both MOS devices, as shown in Fig. 5.5a. This configuration modifies the effective output impedances to

$$
\left.r_{0_{\psi}} \simeq r_{0_{3}} i ? \div g_{m_{3}} r_{0}\right)
$$

and

$$
\begin{equation*}
r_{0_{\uparrow}} \simeq r_{0_{4}}\left(1+g_{m_{4}} r_{0_{2}}\right) \tag{5.16}
\end{equation*}
$$

which raises the stage gain by a factor of $r_{0} \| r_{0_{t}} / r_{0}$. The frequency response of the cascode is usually improved over that of the simple common source amplifier, in that the Miller capacitance around $M_{1}\left(C_{g d}\right)$ now looks into a low impedance node. This greatly increases the frequency of the right half plane zero. The doublet created by overlap capacitance in the current source also creates less problems in the cascode configuration, in that it is normally much more compressed.

The circuit of Fig. 5.5a, is still not useful in many applications, however, in that the circuit displays a large intrinsic offset. This problem may usuaily je oliminated bv incorooratire an offset correcting input stage as shown in Fig. 5.5b. By adjusting the reference current, a known offset may be developed across the input device, which may be used to cancel any offset in the gain stage. This single ended circuit is generally sufficient for most capacitively coupled circuits, if an offset cancellation cycle may periodically be used to ensure that the amplifier operates in its linear region. The low frequency gain of this circuit is simply

$$
\begin{equation*}
\left.A_{V}=\frac{V_{\text {out }}}{V_{\text {in }}}=\frac{-g_{m_{5}} g_{m_{1}}}{\left(g_{m_{5}}+g_{m_{6}}\right.}\right)\left(r_{0_{\uparrow}} \| r_{0_{\psi}}\right) \tag{5.17}
\end{equation*}
$$

while the bandwidth (if $\left|p_{1} A_{V}\right|<\left|p_{2}\right|$ ) is given by


Fig. 5.5 Class-A gain stages using cascode devices. (a) Simple circuit. (b) Circuit with level shift at input for reduction of intrinsic offset.

$$
\begin{equation*}
\omega_{0} \simeq \frac{g_{m_{5}} g_{m_{1}}}{\left(g_{m_{5}}+g_{m_{6}}\right) C_{L}} \tag{5.18}
\end{equation*}
$$

Note that this bandwidth may be reduced if any other poles are near $P_{j}\left|A_{V}\right|$. As a rule, the load capacitance $C_{L}$ should be made sufficiently large that an approximate single-pole response is obtained even in the presence of the nondominant poles.

To allow use of this high-gain circuit in applications requiring differential inputs, as well as in circuits which cannot tolerate use of a variable bias current for coarse offset cancellation, a differential input stage may be used as shown in Fig. 5.6. Note that in this circuit, a Wilson current mirror has been substituted for the cascode/currentmirror circuit used previously.

### 5.2.2 Class-B Operation

In high-speed amplifiers, and in amplifiers which must use power sparingly, a Class-B design is often favored over a Class-A approach. In circuits where the useful bandwidth is limited by the maximum slew rate, end in applications where ine output current requirements vary widely, a Class-B circuit may provide substantially improved performance. An example of a Class-B single-ended amplifier is shown in Fig. 5.7. In this circuit the bias sources $V_{B}^{+}$and $V_{B}^{-}$are included to allow adjustment for zero intrinsic offset, as no constant current sources are now available. The modifications shown in this circuit result in an effective amplifier transconductance, of

$$
\begin{equation*}
G_{m}=\frac{g_{m_{5}}{ }^{g_{m_{6}}}}{g_{m_{5}}+g_{m_{6}}} \cdot \frac{I_{1}}{I_{2}}+\frac{g_{m_{7}} g_{m_{8}}}{g_{m_{7}}+g_{m_{8}}} \cdot \frac{I_{1}}{I_{2}} \tag{5.19}
\end{equation*}
$$

and a bandwidth (assuming a single pole response) of simply


Fig. 5.6 Differential input high gain amplifier.


Fig. 5.7 Class-B single-ended amplifier.

$$
\begin{equation*}
\omega_{0}=G_{m} / C_{L} \tag{5.20}
\end{equation*}
$$

This circuit may be made with differential inputs by modifying the circuit as shown in Fig. 5.8. While this circuit allows full Class-B operation with differential inputs, it will in some applications present difficulties due to the fairly small common-mode input range. This problem may be reduced by employing the circuit configuration shown in Fig. 5.9. In this circuit a Class-A differential stage is used to provide an increase in common-mode range, while a level shifter is used to maintain push-pull operation at the output. As opposed to the previously described fully Class-B circuit, the circuit in Fig. 5.9, will still be subject to slew rate limiting, but at twice the value of a Class-A circuit with the same gain-stage current.

### 5.2.3 Methods for Increasing Bandwidth

For a specified load capacitance, circuits similar to those just described will exhibit an improved bandwidth if the overlap and gate capacitances are minimized, and the circuit is operated at relatively high current levels. Reducing the thickness of the gate oxide will help to a point, although at a constant current level this will actually reduce the frequency of some nondominant poles. An alternative broadbanding technique, which is only possible in some CMOS processes, is to use high transconductance bipolar devices at critical nodes. This technique is illustrated in Fig. 5.10. In this circuit, triple-diffused isolated NPN transistors are used to improve the frequency response of the current mirror on the negative supply, as well as to increase the frequency of the current source doublet. This technique is perhaps most useful in moderate or large linewidth processes ( $L_{\min }>6 \mu$ ) where the useful MOSFET


Fig. 5.8 Class-B differential amplifier.


Fig. 5.9 Differential amplifier with improved common-mode range but reduced slew rate.


Fig. 5.10 Use of isolated bipolar devices at critical nodes allows for improved bandwidth. This technique is only possible in specially tailored CMOS processes.
frequency range is low relative to that of a bipolar transistor. In advanced processes, however, the utility of this technique is reduced as the collector resistance of the triple-diffused bipolar devices becomes a speed-limiting factor. This amplifier and the CMOS/bipolar process discussed above, have been described elsewhere by the author [ 89,90$]$ and will not be expanded upon here.

### 5.3 Two Gain-Stage Linear Amplifiers

In the single gain-stage circuits just presented, it is usually necessary to employ cascode devices (or the equivalent) in order to achieve high gain. Although these additional devices do improve the stage gain, they also reduce circuit output swing. Further, as will be shown in a subsequent section, the level shifters needed to provide a low offset with a single-stage circuit will greatly increase amplifier equivalent input noise. These problems may often be eliminated by employing an amplifier with two high-gain stages. In these circuits the gain requirements of each stage are greatly reduced, as only the product of the individual gains need exceed the total gain requirement. Further, the equivalent input noise may usually be kept small by employing a small number of devices in a high gain input stage.

An example of a two gain-stage differential input Class-A amplifier is shown in Fig. 5.11. In this circuit, the block labeled "compensation circuitry" is necessary if stable amplifier operation is to be achieved while employing feedback. Several types of compensation circuits are discussed below.


Fig. 5.11 Two gain-stage differential input amplifier.

### 5.3.1 Compensation Techniques

A. Simple Pole-Splitting

A problem with two-stage amplifiers, which is typically more complicated than with single gain-stage types, is that of frequency compensation. While a number of techniques exist for achieving stable feedback operation, the most useful involve some form of pole-splitting. This method in simple form, is shown in Fig. 5.12a. In this figure, the effective output impedance of each stage is represented by resistors $r_{1}$ and $r_{2}$, while the effective capacitive loads are represented by $C_{1}$ and $C_{L}$. As has been shown for this circuit [91], the small signal transfer function may be given by

$$
\begin{equation*}
\frac{v_{\text {out }}(s)}{v_{\text {in }}(s)}=\frac{-g_{m_{1}} g_{m_{2}} r_{1} r_{2}\left(1-s C_{f} / g_{m_{2}}\right)}{1+B s+A s^{2}} \tag{5.21}
\end{equation*}
$$

where

$$
A=r_{1} r_{2}\left(C_{1} C_{L}+C_{f} C_{L}+C_{f} C_{1}\right)
$$

and

$$
\begin{equation*}
B=g_{m_{2}} r_{1} r_{2} C_{f}+r_{1}\left(c_{f}+C_{L}\right)+r_{2}\left(C_{f}+c_{L}\right) \tag{5.22}
\end{equation*}
$$

the poles and zeros of Eq. (5.21) may be given by

$$
\begin{align*}
& z_{1}=+g_{m_{2}} / C_{f} \\
& p_{1} \simeq-1 /\left(g_{m_{2}} r_{1} r_{2} C_{f}\right) \\
& p_{2} \simeq-g_{m_{2}} C_{f} /\left(C_{1} C_{L}+C_{f}\left(C_{1}+C_{L}\right)\right) \tag{5.23}
\end{align*}
$$

while the gain at low frequencies is simply,


Fig. 5.12 Compensation techniques for two gain-stage amplifiers. (a) Simple pole-splitting. (b) Pole-splitting with feedforward blocking. (c) Pole-splitting with R-C control of zero.

$$
\begin{equation*}
A_{V}=\frac{V_{\text {out }}}{V_{\text {in }}}=-g_{m_{1}} g_{m_{2}} r_{1} r_{2} \tag{5.24}
\end{equation*}
$$

In implementations of this circuit which employ bipolar transistors, the zero indicated above is usually at a high frequency, leaving a basically two pole response. For a $45^{\circ}$ phase margin $\left(\left|p_{2}\right| \simeq\left|A_{V} p_{2}\right|\right)$, it is straightforward to show, that $\mathrm{C}_{\mathrm{f}}$ must be sufficiently large, that

$$
\begin{equation*}
c_{f} \geqslant \frac{g_{m_{1}}\left(c_{1}+c_{L}\right)+\sqrt{g_{m_{1}}{ }^{2}\left(c_{1}+c_{L}\right)^{2}+4 g_{m_{1}} g_{m_{2}} c_{1} c_{L}}}{2 g_{m_{2}}} \tag{5.25}
\end{equation*}
$$

which for $C_{L} \gg C_{1}$, reduces to

$$
\begin{equation*}
c_{f} \geqslant \frac{g_{m_{1}} c_{L}}{g_{m_{2}}} \tag{5.26}
\end{equation*}
$$

In MOS circuits, however, the zero at $g_{m_{2}} / C_{f}$ is rarely above $p_{2}$. In fact, in many circuits it is at a sufficiently low frequency that use of this compensation technique will lead to marginal closed loop stability. In these situations, modifications of this basic scheme must be considered. Several of these are described below.

## B. Pole-Splitting with Feedforward Blocking

A commonly used technique for eliminating the right-half plane zero of Eq. (5.21), is to prevent feedforward through the compensation capacitor, as shown in Fig. 5.12b. In this circuit, a voltage follower is used to block signal injection forward, but still allow the polesplitting feedback. This modifies the small signal transfer function, to [92]

$$
\begin{equation*}
\frac{v_{\text {out }}(s)}{v_{\text {in }}(s)}=\frac{-g_{m_{1}} g_{m_{2}}{ }^{r_{1} r_{2}}}{1+B s+A s^{2}} \tag{5.27}
\end{equation*}
$$

where

$$
A=r_{1} r_{2}\left(C_{1} C_{L}+C_{f} C_{L}\right)
$$

and

$$
\begin{equation*}
B=g_{m_{2}} r_{1} r_{2} C_{f}+r_{1}\left(C_{f}+C_{L}\right)+r_{2} C_{L} \tag{5.28}
\end{equation*}
$$

The poles of Eq. (5.27) are given by

$$
p_{1} \simeq-1 /\left(g_{m_{2}} r_{1} r_{2} c_{f}\right)
$$

and

$$
\begin{equation*}
p_{2} \simeq-g_{m_{2}} c_{f} /\left(c_{1} c_{L}+c_{f} c_{L}\right) \tag{5.29}
\end{equation*}
$$

and are quite similar to those presented previously. Most importantly, however, note that the right half plane zero has been eliminated.

This technique is routinely employed with a source follower acting as a buffer circuit. It should be noted that although a source follower will eliminate the right half plane zero, it will contribute a left half plane doublet. Specifically, the pole and zero which normally occur in a source follower circuit will appear as a zero and a pole in the amplifier transfer function. This doublet is potentially useful in obtaining a broadband amplifier, if the zero may be made to cancel the second amplifier pole. In other cases, however, it may lead to a poor settling response as the doublet may be at low frequency and not well compressed.

## C. R-C Compensation Method

A third technique for achieving pole-split amplifier compensation, is shown in Fig. 5.12c [93]. It may be shown for this circuit, (given
enough time), that the poles and zeros are given by

$$
\begin{aligned}
& p_{1} \simeq-1 /\left(g_{m_{2}} r_{2} r_{1} c_{f}\right) \\
& p_{2} \simeq-g_{m_{2}} c_{f} /\left(c_{1} c_{L}+c_{f}\left(c_{1}+c_{L}\right)\right) \\
& p_{3} \simeq-1 / r_{f} c_{1}
\end{aligned}
$$

and

$$
\begin{equation*}
z_{1}=-1 /\left(c_{f}\left(r_{f}-1 / g_{m_{2}}\right)\right) \tag{5.30}
\end{equation*}
$$

Note that when $r_{f}=1 / g_{m_{2}}$, the zero is at infinity, leaving a basically two pole response (assuming $C_{L} \gg C_{L}$ ). In this case the requirement upon the compensation capacitance is, as given previously,

$$
\begin{equation*}
c_{f} \geqslant \frac{g_{m_{1}}}{g_{m_{2}}} c_{L} \tag{5.31}
\end{equation*}
$$

Further, the maximum amplifier bandwidth is given by

$$
\begin{equation*}
\omega_{0} \simeq\left|p_{2}\right| \simeq g_{m_{2}} / C_{L} \tag{5.32}
\end{equation*}
$$

It would be desirable if a technique could be developed which would allow the bandwidth to be increased and/or the compensation capacitance to be made smaller without modifying the basic amplifier and load parameters. This is seen to occur, if the zero is brought in from infinity and placed atop the second pole. ${ }^{2}$ This zero frequency will result when

$$
\begin{equation*}
r_{f} \simeq\left(C_{L}+C_{f}\right) /\left(g_{m_{2}} C_{f}\right) \tag{5.33}
\end{equation*}
$$

providing again a two pole response, but where now the stability

[^7]requirement is given by
\[

$$
\begin{equation*}
\left|p_{3}\right| \geqslant\left|A_{V} P_{j}\right| \tag{5.34}
\end{equation*}
$$

\]

or, in terms of the compensation capacitance

$$
\begin{equation*}
c_{f} \geqslant \sqrt{g_{m_{1}} / g_{m_{2}} \cdot c_{1} c_{L}} \tag{5.35}
\end{equation*}
$$

This is a substantially smaller capacitor than that required previously, allowing a considerable increase in bandwidth. In fact, the maximum amplifier bandwidth (neglecting additional high frequency poles) is now given by

$$
\begin{equation*}
\omega_{0} \simeq\left|p_{3}\right| \simeq \frac{g_{m_{2}} c_{f}}{\left(C_{L}+C_{f}\right) C_{1}} \tag{5.36}
\end{equation*}
$$

The difficulty with this technique is in maintaining the proper resistance $r_{f}$ over large process, temperature and supply variations. If this is not done, the $P_{2} z_{1}$ doublet will separate, resulting in an increased settling time. A circuit which alleviates this problem is shown in Fig. 5.13, where a tracking resistance scheme is used. In this circuit, the required series resistance is generated by the drain-tosource impedance of device $M_{x}$, whose gate (and hence, resistance) is biased by device $M_{z}$, voltage source $v_{o s 2}$, and current source $I$. The second gain stage is represented by driver device $M_{y}$ and current source $K \cdot I$, while the input voltage of this stage is represented by $v_{i n 2}$. The input voltage of the second stage during quiescent periods is represented by $V_{\text {os } 2}$, while capacitances $C_{f}$ and $C_{L}$ are the compensation and load capacitances as before.


Fig. 5.13 Tracking $R-C$ compensation circuit. $R_{d s}$ value of $M_{x}$ tracks the transconductance of $M_{y}$, enabling optimal compensation to be maintained over large process, temperature and current variations.

Using the MOS models presented previously, it is easy to show that if

$$
\begin{equation*}
\left(\frac{W}{L}\right)_{x} \simeq\left[\left(\frac{W}{L}\right)_{y} \cdot\left(\frac{W}{L}\right)_{z} \cdot K\right]^{1 / 2} \cdot \frac{C_{f}}{\left(C_{f}+C_{L}\right)} \tag{5.37}
\end{equation*}
$$

then the resistance condition is satisfied, or

$$
\begin{equation*}
R_{d s_{x}} \simeq \frac{\left(C_{f}+C_{L}\right)}{g_{m_{2}} C_{f}} \simeq R_{f}(\text { nom }) \tag{5.38}
\end{equation*}
$$

Note that Eq. (5.37) is totally independent of process, temperature and supply variations, and is only a function of relative device sizes, which may be well-matched and easily specified. This technique has been employed in the amplifiers in a high order monolithic filter [94], allowing the use of a relatively small compensation capacitor with even a large input stage transconductance and large capacitive load (which were required for reasons of noise and power supply rejection). Similarly, this technique may be employed in applications with small capacitive loads to achieve a very high bandwidth and a fast settling time. This technique is generally most useful in circuits employing poly/poly or metal/poly capacitors, as the optimal resistance value for $r_{f}$ is dependent upon the output capacitive loading. Implementations in a process employing a diffused bottom plate will typically exhibit a sufficiently large voltage dependence of capacitance, that very good pole-zero cancellation may be difficult to achieve over the entire output voltage range.

### 5.4 Integrators

Nearly all of the amplifier applications discussed in Chapter 3 use some form of integrator configuration. As such, it is important that
the amplifier requirements for this type of circuit be considered. In this section, amplifier bandwidth and compensation requirements are discussed, with regard to integrator frequency and step response. Design techniques for minimizing power supply coupling are also considered in detail.

### 5.4.1 Dynamic Characteristics

A simple integrator employing a single-pole transconductance amplifier, is shown in Fig. 5.14. In this figure, $r_{0}$ indicates the total resistive output loading, while $C_{I}, C_{L}$ and $C_{P}$ indicate the integrating, output load and input parasitic capacitances respectively. It is straightforward to show for this circuit, that

$$
\begin{equation*}
\frac{v_{\text {out }}}{i_{\text {in }}}=\frac{-g_{m} r_{0}\left(1-s C_{I} / g_{m}\right)}{s C_{I}\left(g_{m} r_{0}+1+C_{p} / C_{I}\right)\left(1+\frac{s r_{0}\left(C_{L}\left(1+C_{p} / C_{I}\right)+C_{p}\right)}{\left(g_{m} r_{0}+1+C_{P} / C_{I}\right)}\right)} \tag{5.39}
\end{equation*}
$$

which at low frequencies reduces to

$$
\begin{equation*}
\frac{v_{\text {out }}}{i_{\text {in }}} \simeq \frac{-g_{m} r_{0}}{s C_{I}\left(g_{m} r_{0}+1+C_{P} / C_{I}\right)} \tag{5.40}
\end{equation*}
$$

The poles and zeros of Eq. (5.39) are given by

$$
\begin{aligned}
& z_{1}=+g_{m} / C_{I} \\
& p_{1}=0
\end{aligned}
$$

and

$$
\begin{equation*}
p_{2}=\frac{-\left(g_{m} r_{0}+1+C_{p} / C_{I}\right)}{r_{0}\left(C_{L}\left(1+C_{p} / C_{I}\right)+C_{p}\right)} \tag{5.41}
\end{equation*}
$$

As is evident from these equations, this circuit functions as a lossy integrator at low frequencies, but exhibits non-ideal gain and phase


Fig. 5.14 Small-signal model of a simple integrator.
behavior at high frequencies. The right half plane zero occurs because of feedforward through $C_{1}$, while $P_{2}$ is due to amplifier gain rolloff.

In most of the amplifier applications considered here, the step response of this circuit is of critical importance. In pipeline converters, and in sample and holds, the integrator's settling time will directly affect the maximum circuit operating rate. Applying the method of partial fractions to Eq. (5.39), and by performing an impulse analysis via the Laplace transform method, the step response of this circuit may be seen to be,

$$
\begin{equation*}
v_{\text {out }}(t)=v_{\text {step }}\left[1+e^{-p_{2} t}\left(p_{2} / z_{2}-1\right)\right] \tag{5.42}
\end{equation*}
$$

where $V_{\text {step }}$ is the nominal output step size, $V_{2}(0)$ is assumed to be zero, and $p_{2}$ and $z_{1}$ are as indicated above. For a maximum permissible error, a total settling time is seen to be required of the amount

$$
\begin{equation*}
t_{\min }=-\frac{1}{p_{2}} \ln \left[\frac{-\left|\varepsilon_{\max }\right|}{\left(\frac{p_{2}}{z_{1}}-1\right)}\right] \tag{5.43}
\end{equation*}
$$

where

$$
\begin{equation*}
\varepsilon=\frac{V_{\text {out }}(t)-V_{\text {step }}}{V_{\text {step }}} \tag{5.44}
\end{equation*}
$$

For the integrators employed in a pipeline converter or in a sample and hold, it will almost always be the case that the circuit is used as an $A C$ coupled voltage multiplier. In this configuration, the input voltage is fed through the bottom plate of $C_{p}$, providing a forward voltage gain at low frequencies of

$$
\begin{equation*}
A_{x} \simeq \frac{C_{p}}{C_{I}} \frac{g_{m} r_{0}}{\left(g_{m} r_{0}+1+C_{p} / C_{I}\right)} \tag{5.45}
\end{equation*}
$$

This places the high frequency pole at

$$
\begin{equation*}
p_{2} \simeq \frac{-g_{m}}{\left(C_{L}\left(1+A_{x}\right)+C_{p}\right)} \tag{5.46}
\end{equation*}
$$

and the ratio of the pole and zero locations at

$$
\begin{equation*}
\frac{P_{2}}{z_{1}} \simeq \frac{C_{p}}{A_{x}\left(C_{L}\left(1+A_{x}\right)+C_{p}\right)} \tag{5.47}
\end{equation*}
$$

In most practical situations, the zero will not seriously affect the settling response unless the integrator output is fed back to its input through additional circuitry. While this is not the case here, this problem may be alleviated in general, by placing a resistor in series with the integrating capacitance. This is an identical situation to that discussed in Section 5.3 for the R-C compensation technique, which is applied to the integrating second gain stage of an amplifier.

While the preceeding analysis has assumed the use of a single stage transconductance amplifier, it is often the case that a multiple gainstage circuit will be used. If it is assumed that a multiple stage amplifier exhibits a low frequency gain of $a$, and a dominant pole at $P$, it is easy to show that when used as an integrator, it will display a new pole frequency at

$$
\begin{equation*}
p^{\prime} \simeq \frac{2 a P}{1+A_{x}} \tag{5.48}
\end{equation*}
$$

where $A_{x}$ is as previously defined. Note that in both single and multiple stage circuits, care must be exercised to avoid poor settling response due to the presence of non-dominant poles. These poles may be
due to capacitive loading on the amplifier output (in multiple stage amplifiers), or due to internal poles from current mirrors or level shifters. As a general rule, all nondominant poles should be kept at least as high as the dominant pole position after the application of feedback.

Note that because the dominant pole frequency decreases as the forward gain increases, the amplifier need not be heavily compensated for cases where $A_{x}$ is large. Advantage can be taken of this fact in AC-coupled multipliers with gain greater than one, by reducing the amplifier compensation capacitance, and hence increasing the circuit bandwidth. This is not always practical, however, as most AC-coupled multipliers of this type must periodically be offset cancelled to eliminate drift from leakage currents. This offset cancellation cycle usually entails shorting the inverting input and output nodes together, requiring that the amplifier be unity gain stable. In applications which need only be reset occasionally, such as between lines in a video system, it would seem plausible to electrically switch between several compensation capacitors depending upon whether the circuit was in a reset mode or in a mode requiring gain. Further, if different gains are programmable, optimal bandwidth performance may be obtained by programming the total compensation capacitance as well as the components which determine the forward gain.

### 5.4.2 Power Supply Rejection

A. General Case

The sources of power-supply coupling in integrators are considerably different than those which affect circuits with DC feedback. In most
monolithic implementations of this circuit, supply coupling via parasitic capacitance on the integrating node will dominate PSRR behavior. This capacitance may be switch or layout related, or associated with the amplifier used in the integrator. The parasitic capacitances which are most important in terms of supply rejection, are shown in Fig. 5.15. In this figure, device $M_{1}$ represents the input device of the amplifier, while impedance $r_{1}$ represents the impedance from any input stage load $\left(1 / g_{m_{10 a d}}\right.$ if a current mirror). The gain block represents the remainder of the amplifier, while capacitances $C_{P_{1}}$ and $C_{P_{2}}$ are switch or layout parasitics. Note that this circuit may be used to model either single or two stage amplifiers with single-ended or differential inputs, by modifying the various circuit component values. It is easy to show for this circuit, that the low frequency supply rejection may be given by

$$
\begin{equation*}
\frac{\partial V_{\text {out }}}{\partial V_{S S}} \approx \frac{1}{C_{I}}\left[C_{g S}\left(\frac{\partial I}{\partial V_{S S}} \cdot \frac{1}{g_{m_{1}}}+\frac{\partial V_{T_{1}}}{\partial V_{S S}}\right)+C_{P_{2}}\right] \tag{5.49}
\end{equation*}
$$

and

$$
\begin{equation*}
\frac{\partial V_{\text {out }}}{\partial V_{D D}}=\frac{-1}{C_{I}}\left[c_{g d}\left(1-\frac{\partial I}{\partial V_{D D}} \cdot r_{1}\right)+c_{P_{1}}\right] \tag{5.50}
\end{equation*}
$$

The first of these two equations is typically the more troublesome of the two, in that $C_{g s}$ is typically fairly large. This is particularly true in amplifiers which must use a high transconductance or a low noise input device. The first of the terms in Eq. (5.49) may be eliminated by employing a supply independent current source. Note, however, that this is not possible in some of the single-ended amplifiers presented in Section 5.2. The second term in Eq. (5.49), may be reduced by using


Fig. 5.15 Sources of power supply coupling in a real integrator.
input devices which are on a very lightly doped substrate, or eliminated by placing the input devices in an isolated well. In most CMOS processes, this latter technique is usually preferrable. If a supply independent current source is used, the transfer function from the positive rail becomes simply $-\left(C_{g d}+C_{P_{1}}\right) / C_{I}$. This may result in poor supply rejection for large width input devices, even in a self-aligned process. To eliminate this problem altogether, a cascode circuit may be used to buffer the drain of the input devices from positive supply variations [95].

Although the above techniques will reduce the amplifiers contribution to the total supply coupling, the switch and layout parasitics have yet to be considered. The simplest means of reducing feedthrough from these sources, is to minimize the switch sizes and use a large integrating capacitance. Regretfully, this will directly reduced the circuit speed as well. Alternately, driving the switch gates between supplyindependent levels, and the use of an on-chip supply regulator may allow improved performance [96].

## B. Effect of Offset Cancellation Cycle

In many integrator applications, an offset cancellation cycle is used to eliminate the effect of leakage currents and amplifier drift. In input sample and holds this is typically performed once per conversion, while in pipeline converters it is performed on alternate clock cycles. As power supply coupling may be viewed as an apparent drift in amplifier offset, it will tend to be reduced by the reset procedure. In fact, assuming delta function sampling, the supply transfer function to the output of an integrator becomes modified, to

$$
\begin{equation*}
H_{s}(f) \simeq H_{c}(f)\left[1-\frac{\sin \left(\pi T_{s} f\right)}{\pi T_{s} f}\right] \tag{5.51}
\end{equation*}
$$

where $H_{c}(f)$ is the continuous (unclocked) transfer function, $H_{s}(f)$ is the new transfer function and $\mathrm{T}_{\mathrm{s}}$ is the reset frequency.

As is apparent from this equation, the supply rejection will be substantially improved at low frequencies. This reset procedure will be of little help, however, in eliminating power supply noise at, or above the clock frequency. For good rejection at these frequencies, a differ-ential-in, differential-out integrator must usually be used which is well matched on each side. As power supply coupling will appear as a commonmode signal, it will be attenuated (at even high frequencies) by differencing between sides. This technique has been used to improve supply rejection in a monolithic filter [97] and should be useful in converter applications as well.

## - 5.5 Output Stages

The circuitry discussed to this point is useful in on-chip applications with capacitive loads, but is not generally applicable to off-chip drivers or on-chip circuitry which must drive resistors. In these situations, circuitry with a lower output impedance and a higher current drive is usually required. Typically, this can assume the form of a buffer circuit which is used in conjunction with the transconductance amplifiers previously discussed.

The simplest form of output circuitry for an MOS amplifier, is the source follower (Section 5.1). While this type of circuit does provide relatively good bandwidth, it is fundamentally limited in terms of output current capability, by the constant pull-down current source. Although this current may be designed to be as large as necessary, the efficiency of this circuit is extremely low. A preferred embodiment of
a buffer circuit is shown in Fig. 5.16. In this figure, the Class-A source follower has been replaced by a Class-B output driver which is typically merged with a Class-A gain stage. This circuit is usually preferable to the simple source follower, in that it is much more efficient under resistive loading, and is easily designed to offer symmetrical output drive.

A problem with this technique in MOS implementations, however, is that the dynamic range of the circuit is reduced because of body-effect related threshold increases at large output voltages. These threshold increases can substantially reduce the maximum output swing. In a conventional CMOS process, the swing towards the positive rail may be improved by placing each of the NMOS devices in a separate p-well, which is shorted to the respective device's source. This prevents any NMOS threshold increases from occurring, regardless of the output voltage. Maintaining a good swing in the negative direction, however, is more difficult. One approach is to replace the PMOS output source follower with an NMOS pull-down circuit which functionally resembles the PMOS device, but which is not affected by threshold changes at negative output values. A circuit of this type which is commonly employed in NMOS circuits, is shown in Fig. 5.17 [98,99]. In this stage, $M_{24}$ acts as a simple source follower while $M_{20}$ and $M_{21}$ act as a differencing circuit between $V_{\text {in }}$ and $V_{\text {out }}$. Depending upon the values of $V_{i n}$ and $V_{\text {out }}$, a voltage is sent to $M_{25}$ which determines the magnitude of the total pull-down. The output swing of this circuit in the negative direction is a strong function of output loading and device geometries, but in its simplest form is just $V_{S S}+V_{g S_{21}}$ $+V_{\mathrm{gs}_{25}}$. Depending upon the specifics of the process, and the circuit, this may or may not be an improvement over the negative swing of the Class-B


Fig. 5.16 Class $A-B$ gain and output stage,


Fig. 5.17 Push-pull output stage commonly employed in NMOS circuits.
source follower. A problem with this circuit, however, is that the quiescent current through the output stage is a strong function of output voltage. This actually reduces the positive output swing, and makes the output pole frequency level dependent. This can lead to level dependent oscillatory behavior, or a poor settling response with certain output signal amplitudes. These problems may be reduced by employing the CMOS modification of this circuit, shown in Fig. 5.18. In this circuit, $M_{20}$ and $M_{24}$ are source followers while $M_{25}$ is again the pull-down device. In this case, however, $\mathrm{M}_{25}$ is biased by a CMOS differential stage which maintains $V_{\text {out }}$ at approximately the same potential as the source of $M_{20}$. Because of this feedback, the quiescent current through the output stage is kept at a constant level regardless of the output voltage $\left(I_{3} \simeq I_{1} B_{24} / B_{20}\right)$. Further, the swing in the negative direction is very good as no cascoding of NMOS devices occurs between the output node and the gate of $M_{25}$. Thus the major problems of the previous circuit have been alleviated.

Under certain circumstances, however, the circuit of Fig. 5.18 will still present some problems. Specifically, because local feedback is taken from the output, peaking will usually occur with even moderate capacitive loads. Note that this will also be the case, in the circuit of Fig. 5.17. In these situations, the most straightforward approach is to eliminate the local feedback from the output entirely and attempt to modify the simple Class-B follower for improved negative swing. One possible modification, is to use a depletion MOSFET for both the diode connected and source follower PMOS devices. As a depletion PMOS device will exhibit a low threshold even with a large body bias, the maximum output swing towards the negative rail will be greatly improved.


Fig. 5.18 CMOS push-pull output stage.

Depletion devices however, if used in this circuit, may allow widely varying currents to flow through the output devices. This will occur because the diode-connected PMOS device may not always be in the saturation region. This problem may be alleviated by using the output stage shown in Fig. 5.19. In this circuit, constant quiescent current is maintained, regardless of the output level, by sensing the current through the "dummy" output pair $M_{20}$ and $M_{21}$ by way of the PMOS current mirror $M_{22}$ and $M_{23}$. The current being pulled by $M_{23}$ forces the gates of depletion devices $M_{21}$ and $M_{25}$ to a point which will keep the current through $M_{20}$ and $M_{21}$ constant, regardless of the depletion PMOS threshold voltage. As $M_{24}$ and $M_{25}$ are in parallel with $M_{20}$ and $M_{21}$, the current through these devices is also stablized, although at a potentially higher level depending upon the device geometries. This output stage has been used in a high order monolithic filter circuit, to obtain excellent stability, output swing and drive efficiency in the presence of even 1000 pF and/or $300 \Omega$ loads [100]. Further, this performance has been obtained with a quiescent power of only a few milliwatts.

### 5.6 Voltage Comparators

Every type of quantization circuit requires some means of thresholding or comparing. In the analog/digital conversion circuits described here, this typically entails performing a number of rapid voltage comparisons between an unknown input voltage and a number of known reference standards. As demonstrated in Chapter 3, these comparisons may be performed: in parallel by a number of distinct comparators, sequentially by a single comparator; or both. In many of these techniques, the ultimate conversion rate is often determined by the comparator's speed, while


Fig. 5.19 Wide dynamic range Class-B output stage.
in some cases the comparators limit the conversion accuracy as well.

### 5.6.1 Single Ended Comparators

The simplest form of comparator, is just a single-ended amplifier with very high gain. In this type of circuit, the voltage comparison is between the present input voltage (and it's recent history) and the amplifier offset point. In order to eliminate potential output ambiguities, the amplifier is typically used in conjunction with a regenerative stage as shown in Fig. 5.20a. In this configuration, the effective comparator offset may be given by (ignoring gain nonlinearities and regenerator hysteresis)

$$
\begin{equation*}
V_{o s_{T}} \simeq V_{o s_{1}}+V_{o s_{2}} / a \tag{5.52}
\end{equation*}
$$

where $V_{\mathrm{OS}_{1}}$ is the offset of the gain stage, $V_{\mathrm{OS}_{2}}$ is the $i-0$ trip point of the regenerator, and $a$ is the gain of the amplifier. As discussed in previous sections, the intrinsic offset of the amplifier may usually be designed to be small. Invariably, however, process and device variabilities will tend to make offsets distributed about their mean (intrinsic) value. As a result, a converter which uses these circuits will also display an offset, or in some cases (most notably in pipelines) a comparator induced nonlinearity. Although the offset distribution may be made small by making the initial gain contributing devices of the amplifier large and interdigitized, this invariably impacts the overall die size and may effect the comparator's speed. A preferable method is shown in Fig. 5.20b. In this circuit the comparator is capacitively coupled to the input voltage, allowing the comparators offset to be stored on the input capacitor. This is accomplished by shorting the input and

(a)


Fig. 5.20 Single-ended comparators. (a) Simple gain stage and regenerator cascade. (b) A-C coupled comparator with first-order offset cancellation. (c) Same as in "b" but with input differencing.
output nodes of the amplifier together, and at the same time grounding the input side of the coupling capacitor. After this reset cycle, the input capacitor is again switched to the input voltage, allowing comparisons to be performed as before. This has the effect of modifying the total input offset voltage, to

$$
\begin{equation*}
v_{\mathrm{os}_{\mathrm{T}}} \simeq\left(\mathrm{~V}_{\mathrm{os}_{1}}+\mathrm{V}_{\mathrm{os}_{2}}\right) / \mathrm{a}+\mathrm{V}_{\mathrm{ft}}^{1} \tag{5.53}
\end{equation*}
$$

where $V_{\mathrm{ft}_{1}}$ is the feedthrough resulting from the amplifier shorting switch. This feedthrough results from both gate overlap capacitance and MOSFET channel charge pumping [101] as the shorting switch turns off. For circuits employing a small shorting switch and a large input capacitor this latter term may be kept small, although in most CMOS comparators which use a high gain input amplifier, the feedthrough term will usually dominate the total circuit offset.

Although the above circuit must be periodically reset for offset cancellation to occur, this need not happen before every comparison. Typically, the reset frequency will be the same as the converter sampling rate, as leakage currents will (usually) create only a sub-LSB drift over one conversion cycle. In fact, in converters employing a binary weighted capacitor technique, the comparator during its reset cycle will usually function as the input sample and hold. Note that this off-set-cancelled comparator may be operated in a differential mode between two independent input signals, as shown in Fig. 5.20c.

In high precision converters, this simple offset reduction method may not provide a sufficiently small offset. This will be particularly true in circuits which have employed large reset switches for increased speed. Although a feedthrough cancellation device or capacitor may be
used to provide some reduction in switch related offset, the degree to which cancellation occurs will typically be process dependent. Fortunately, the offset reduction technique described above can be modified to reduce first order switch feedthrough effects [102]. This technique is shown in Fig. 5.21. In this circuit, several offset cancelled stages are cascoded together, but with slightly different reset phasing. By continuing to offset cancel the second stage even after the first stage reset switch turns off, the feedthrough from the first stage is effectively eliminated. Further, the effective switch feedthrough from the second stage is attenuated by the gain of the first. In fact, the total offset is now given by

$$
\begin{equation*}
v_{o s_{T}}=\left(\left(v_{\mathrm{os}_{2}}+v_{\mathrm{os}_{3}}\right) / \mathrm{a}_{2}+v_{\mathrm{ft}_{2}}\right) / \mathrm{a}_{1} \tag{5.54}
\end{equation*}
$$

This cascoding procedure may be continued indefinitely (at least to the thermal noise limit), until the required offset is obtained. Note that care must be exercised in circuits of this type, to ensure that the feedthrough of any stage when multiplied by the stage gain does not result in gain stage clipping. This may require that individual stage gains be kept fairly low, or that feedthrough compensating devices or capacitors be used to keep the overall feedthrough at a tolerable level.

### 5.6.2 Differential Comparators

In many situations, the single-ended comparators just presented will be unsatisfactory. This will certainly be the case in any application which requires a differential comparison between two rapidly changing signal levels, or in applications which require good common mode rejection at high frequencies. In these situations, the comparetor


Fig. 5.21 Very low offset comparator employs a cascade of offset cancelled amplifier stages.
configurations are modified, as shown in Fig. 5.22. The first of these figures (Fig. 5.22a) shows a direct coupled differential amplifier used with (again) a regenerator. Note that in this case, however, the flipflop is used to differentially discriminate between two signal levels, rather than between a single input and the trip-point. As before, both stages will contribute to a net comparator offset, as given by Eq. (5.52). By capacitively coupling the inputs of the gain stage to the differential input lines, an offset cancellation may again be made to occur, as indicated by Eq. (5.53) and as shown in Fig. 5.22b. As before, this technique may be expanded to include multiple AC-coupled gain stages, as shown in Fig. 5.22c.

### 5.6.3 Bandwidth Considerations

In the circuits must presented, there are two distinct types of comparator operations. The first occurs only in circuits employing offset cancellation cycles, and is when one or more stages are being reset, by shorting their input(s) and output(s) together. The second type of operation is that normally associated with a comparator; open loop differencing between an input voltage (absolute or differential) and the comparator offset point. These two cases are analyzed below.

## A. Reset Cycle

As the reset cycle usually only occurs once per conversion, and possibly less, the effect of this operation upon the maximum conversion rate is usually small. ${ }^{3}$ This operation is extremely important, however,

[^8]

Fig. 5.22 Differential comparators. (a) Direct-coupled gain stage and regenerator. (b) A-C coupled comparator with one stage of offset cancellation. (c) Multiple stage offset cancellation.
if the input stage of the comparator is used as the input sample and hold. In these situations, the first comparator stage will generally determine the converters distortion characteristics for high frequency input signals.

A small signal model of an inverting gain stage during a reset cycle, is shown in Fig. 5.23. In this figure, $C_{L}$ and $r_{0}$ represent the output capacitance and equivalent load impedance, while $r_{f}$ is the feedback switch impedance, $C_{i n}$ is the input capacitance and $R_{i n}$ is any series input switch resistance. It may be shown for this circuit, that

$$
\begin{equation*}
\frac{v_{1}}{V_{i n}} \simeq \frac{s\left(c_{i n} / g_{m}\right)\left(1+s r_{f} C_{L}\right)}{1+A s+B s^{2}+C s^{3}} \tag{5.55}
\end{equation*}
$$

where

$$
\begin{align*}
& A=C_{i n}\left(1 / g_{m}+r_{i n}\right)+c_{L} / g_{m} \\
& B=\frac{c_{i n} C_{L}}{g_{m}}\left(1 / g_{m}+r_{i n}\right) \\
& C=\frac{C_{i n}{ }^{2} C_{L} r_{i n}}{g_{m}}\left(1 / g_{m}-r_{f}\right) \tag{5.56}
\end{align*}
$$

For cases where $1 / g_{m} \simeq r_{f}$ and $C_{i n}>C_{L}$, this reduces to

$$
\begin{equation*}
\frac{v_{1}}{v_{i n}} \simeq \frac{s\left(c_{i n} / g_{m}\right)}{\left(1+s c_{i n}\left(1 / g_{m}+r_{i n}\right)\right)} \tag{5.57}
\end{equation*}
$$

This equation indicates that the input to the comparator may not be properly reset for high frequency input signals. In fact, at the end of the sampling period, the comparator may exhibit an input dependent offset, which will substantially reduce the signal to noise ratio of high frequency input signals. In practice, this will require that the comparator -3 dB frequency be 50-100 times higher than the maximum full-


Fig. 5.23 Small signal model of the input gain stage of a comparator during a reset cycle.
scale input frequency for a 40 dB signal to noise ratio (see Chapter 6). In addition to introducing distortion, the comparator will also attenuate the sampled value. The voltage across the sampling capacitor may be seen to be

$$
\begin{equation*}
v_{\text {sam }} \simeq v_{x}-v_{1} \simeq \frac{v_{i n}}{\left(1+s C_{i n} r_{i n}\right)}\left[1-\frac{s c_{i n} / g_{m}\left(1+s r_{f} C_{L}\right)}{1+A s+B s^{2}+c s^{3}}\right] \tag{5.58}
\end{equation*}
$$

which, for $g_{m} \simeq 1 / r_{f}$ and $C_{i n}>C_{L}$, may be approximated by

$$
\begin{equation*}
v_{s a m} \simeq \frac{v_{i n} s C_{i n}\left(r_{i n}+1 / g_{m}\right)}{\left(1+s C_{i n} r_{i n}\right)\left(1+s C_{i n}\left(r_{i n}+1 / g_{m}\right)\right)} \tag{5.59}
\end{equation*}
$$

In cases where $1 / g_{m} \simeq r_{i n}$, $V_{\text {sam }}$ will only be attenuated from $V_{i n}$ by -9.5 dB at a frequency of approximately $g_{m} / C_{i n}$. In general, this is not as severe a problem as the distortion case considered above.

## B. Compare Cycle

For comparators which always operate in the linear region, the relevant time constant for each comparator stage is simply $g_{m} / C_{L}$. Thus for an n-stage cascaded comparator, a minimum comparison time will be required of the amount

$$
\begin{equation*}
\tau_{c m p} \simeq \sum_{i=1}^{n} \frac{g_{m_{i}} \ln \varepsilon_{\max }}{C_{L_{i}}} \tag{5.60}
\end{equation*}
$$

where $\varepsilon_{\max }$ is the allowable relative error. In practice, however, few comparators behave linearly over a large input range. This is particularly true for circuits which employ Class-A gain stages, where a significant slewing delay may be encountered. In these circuits an additional
time must be added to each term of Eq. (5.60), which indicates this slewing delay at each stage.

In comparators employing a two stage amplifier, the slew rate may well be determined by stability requirements for the comparator reset period (if one is used). Specifically, the requirement that the amplifier be stable under unity gain conditions may force the use of a large compensation capacitor, which will considerably degrade amplifier slew rate. In these situations, the compensation capacitor may actually be switched out during the compare interval, as no feedback is used.

### 5.7 Noise in Analog Circuits

Nonideal behavior in $A / D$ converters is most often due to a mismatch in precision components, or a limited circuit bandwidth. In high precision converters, it may also be due to device noise. According to most MOS models [703,104], the drain noise current of a MOSFET in saturation over a bandwidth $\Delta f$, is given by

$$
\begin{equation*}
\overline{i_{d}{ }^{2}}=4 k T\left(\frac{2}{3} g_{m}\right) \Delta f+\frac{K I_{d}^{a}}{f^{b}} \Delta f \tag{5.61}
\end{equation*}
$$

where $k$ is Boltzman's constant, $T$ is the absolute temperature and $K$, $a$ and $b$ are device constants. The first term in Eq. (5.61) is thermal noise resulting from a nonzero channel resistance, while the second term is flicker noise created by (apparently) surface carrier trapping. This drain noise current may be expressed in terms of an equivalent gate noise voltage, by dividing through by $g_{m}{ }^{2}$, hence

$$
\begin{align*}
e_{n}^{2} & =\frac{8 k T}{3 g_{m}} \Delta f+\frac{k I_{d}{ }^{a}}{g_{m}^{2} f^{b}} \Delta f \\
& =e_{t}^{2}+e_{f}^{2} \tag{5.62}
\end{align*}
$$

where $e_{f}$ may be expressed as [705]

$$
\begin{equation*}
e_{f} \simeq \sqrt{\frac{a_{f}}{W L}} \tag{5.63}
\end{equation*}
$$

In this equation, $W$ is the device channel width and $L$ is the length, while $a_{f}$ is a single parameter which incorporates all of the previous parameters above. Using Eq. (5.62), an apparent input noise voltage may be found for any circuit, by rms summing the noise contributions of each device at the circuit output, and dividing by the circuit's transfer function. Specifically, if $H_{i}$ is the transfer function from the gate of the ith device to the output, and $H_{p}$ is the principal transfer function, then the equivalent circuit input noise is given by,

$$
\begin{equation*}
e_{n}^{i n} \simeq \frac{\left(\sum_{a 11 i} e_{n_{i}}^{2} H_{i}^{2}\right)^{1 / 2}}{H_{p}} \tag{5.64}
\end{equation*}
$$

If circuit frequency dependence is included in this equation, it becomes fairly involved for even simple circuits. For these situations computer analysis will usually be necessary. At low frequencies, however, where flicker noise dominates and circuit behavior is similar to that at DC, most circuits are easily analyzed. The results for three simple circuits are given below [106];
a) Source follower (Fig. 5.24a)

$$
\begin{equation*}
e_{n}^{i n} \simeq\left[\frac{a_{f}}{W_{1} L_{1}}\left[1+\left(\frac{L_{1}}{L_{2}}\right)^{2}\right]\right]^{1 / 2} \tag{5.65}
\end{equation*}
$$

b) Common source amplifier (Fig. 5.24b)

$$
\begin{equation*}
e_{n}^{i n} \simeq\left[\frac{a_{f}}{W_{1} L_{1}}\left[1+\frac{\mu_{n} a_{f_{2}}}{\mu_{p} a_{f}}\left(\frac{L_{2}}{L_{2}}\right)^{2}\right]\right] 1 / 2 \tag{5.66}
\end{equation*}
$$

c) Differential input amplifier (Fig. 5.24c)

$$
\begin{equation*}
e_{n}^{i n} \simeq\left[\frac{{ }^{2} a_{f_{1}}}{W_{1} L_{1}}\left[1+\frac{\mu_{n} a_{f_{2}}}{\mu_{p} a_{f_{1}}}\left(\frac{L_{1}}{L_{2}}\right)^{2}\right]^{1 / 2}\right. \tag{5.67}
\end{equation*}
$$

As is evident from all of these equations, noise performance at low frequency is improved by lowering the load transconductance relative to that of the driver. In practice, this is easier to achieve with a PMOS load and an NMOS driver (in gain stages), where advantage may be taken of the ratio of PMOS to NMOS mobilities.


Fig. 5.24 Noise models for Eqs. (5.65)-(5.67).

## CHAPTER 6

## TIME INTERLEAVED CONVERTER ARRAYS

As discussed in Chapters 1 and 2, there is considerable motivation for reducing the production costs of high-speed $A / D$ converters. Present day high-speed converters use techniques which require large die sizes and/or fairly exotic fabrication processes, which makes them quite expensive. Furthermore, these $A / D$ techniques will be exceedingly difficult to integrate along with a VLSI digital signal processor.

In the method presented here, the ultimate cost of a high-speed converter may be considerably reduced by employing the converter-array technique shown in Fig. 6.1. As will be shown, this technique may be used to achieve a substantial reduction in die size and power dissipation over other circuits, yet without sacrificing bandwidth or signal-to-noise ratio. This type of converter configuration is easily implemented in a MOS technology, allowing on-chip compatibility with dense digital signal processors. Furthermore, this technique may be used to achieve faster sampling rates than are possible with any single converter.

### 6.1 Comparison with Other Methods

The various characteristics of several common high-speed converter types have been discussed at length in the previous chapters. A summary of the more relevant characteristics of these methods is presented in Table 6.1. In this table, it has been assumed that all converters are implemented in the same MOS process. Relative values may vary by perhaps $30 \%$ depending upon specific assumptions. Note that the conversion rate


Fig. 6.1 Converter array technique uses a number of converters with interleaved sampling times to achieve a fast effective conversion rate. An n-bit, m-way array is shown.

TABLE 6.1

| Method | Relative Die Size D | Time Slots per Conversion T $(\mathrm{n}=7)$ | Throughput per Area $1 /(D \cdot T), n=7$ $\times 10^{-3}$ |
| :---: | :---: | :---: | :---: |
| - I-S tep Parallel (Flash) | $2^{n}$ | 2 | 4 |
| - 2-Step Parallel (I/2-Flash) | $3 \cdot 2^{n / 2}$ | 3-4 | 7-9 |
| - $n$-Step Pipeline (Succ. Approx.) | $4.5 n$ | 4-6* | 5-8* |
| - n-Step Sequential (Succ. Approx.) | $1.5 n$ | $n+1,(8)$ | 12 |
| - 4-Way Successive Approx. Array | 4(1.5n) | $(\mathrm{n}+1) / 4,(2)$ | 12 |

Summary of A/D converter characteristics.
per time slot for the pipeline converter is actually half to a third of that indicated in Chapter 3 where converters of this type were discussed. This is because the minimum time slot for a pipeline converter is 2 to 3 times longer than that of the other converters listed (in a given process), due to amplifier settling requirements. It has been assumed for the successive approximation converters, that binary-weighted capacitors, or their equivalent, are used in the respective designs.

A figure of merit which is rarely cited in converter literature, is the relative conversion rate (samples per second) per unit of integrated circuit chip area. This is listed for each of the converter types in the last column of Table 6.1. As is evident, the converters which display the best area efficiency are in fact those with the lowest conversion rate. Thus, although the successive approximation technique is slower than any of the other converters listed, it usually will use its area more efficiently. It would be desirable, if a technique could be developed which would allow the throughput of parallel circuits to be obtained, yet with the area efficiency of the successive approximation method. As is indicated in the last row of Table 6.1, an array of interleaved successive approximation converters may be used to achieve this desired end. Note that for the 7-bit case, 4 parallel converters are needed to achieve a throughput equal to that of the flash approach, but with only a third of the required die-size. Further, this area savings becomes even more pronounced at higher resolution levels, as shown in Fig. 6.2.

This array technique may also be used to achieve significantly higher bandwidths than are possible with any single converter. By interleaving more converters, conversion rates limited only by the sample and hold aperture times may be obtained. Thus, at the expense of die area,


Fig. 6.2 Relative area comparison between flash and array converter methods as a function of resolution level. Small numbers along graph indicate size of array needed to equal flash conversion rate.
even a slow technology may be used to achieve fast conversion rates. Similarly, a fast technology may be used to achieve bandwidths that were previously impossible to obtain.

As was shown in Table 6.1, a total of $n+1$ time slots are needed for an n-bit successive approximation converter. Thus 9 different time states will be nneded for an 8-bit converter. In order to reduce reconstruction or singal processing difficulties, it is generally desirable to stagger the sampling periods by constant time intervals, as shown in Fig. 6.3a. If the smallest unit of time displacement between converters is one time slot, then only certain array sizes may be realized for a given resolution converter. This is demonstrated in Fig. 6.3b. For example, in the 8-bit case just considered, only a three or nine way array is possible. If more time slots are added, however, new configurations may be realized. These additional time slots may be used to provide more charging time in the sample mode or MSB bit times ${ }^{1}$ or even thrown away. Thus in the eight bit case, adding one time slot would allow 2, 5 or 10 way arrays to be used. Note that in most cases, allowing fractional time slot displacements will increase the number of allowable arrays. In particular, delays of $1 / 2$ period are quite simple to generate if the input clock duty cycle is exactly $50 \%$. Extreme care must be taken if this is done, however, as andesirable sampling time skew may develop between converters which trigger on the positive and negative clock edges. As discussed in the next section, this may lead to a considerable degradation in performance.

[^9]$T_{1}$

| $S$ | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | A/D | $\# 1$ |
| :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- |
| 6 | 7 | 8 | $S$ | 1 | 2 | 3 | 4 | 5 |  | \#2 |
| 3 | 4 | 5 | 6 | 7 | 8 | $S$ | 1 | 2 |  |  |

Fig. 6.3a Staggering of time slot assignments maintains a uniform time interval between sampling periods. A 3-way 8-bit array is shown.

| Total time slots | 7 | 8 | 9 | 10 | 11 | 12 |
| :--- | :--- | :--- | :--- | :--- | :--- | :--- |
|  | 7 | 2 | 3 | 2 | 11 | 2 |
| l time slot <br> min. delay |  | 8 |  | 10 |  | 4 |
|  |  |  |  |  |  | 6 |

Fig. 6.3b Allowable number of converters in an array for a constant period between sampling times.

### 6.2 Error Analysis

This time interleaved array technique may be used with any type of A/D converter. As will be shown, however, a variation of converter quantizing characteristics within an array may introduce additional noise or distortion. This will bend to make some converter types more suitable for use in arrays than others, and may result in a slightly increased component matching requirement for a specified performance level.

### 6.2.1 Noise and Distortion Error Power

As discussed in Chapter 3, the quantizing error of an $A / D$ converter at an input voltage $x$, is given by

$$
\begin{equation*}
\varepsilon(x)=\delta(x) \tag{6.1}
\end{equation*}
$$

where

$$
\begin{equation*}
\delta(x)=\hat{x}-(a x+b) \tag{6.2}
\end{equation*}
$$

in which $\hat{x}$ is the quantized value, and $a$ and $b$ are best fit gain and offset terms which minimize the toal error power,

$$
\begin{equation*}
E_{p}=\left\langle\delta^{2}\right\rangle \tag{6.3}
\end{equation*}
$$

If instead of a single converter, an array of interleaved converters is employed, the error analysis becomes more involved. This situation is depicted in Fig. 6.4, where the transfer functions of two converters in an array are shown, each of which exhibits different gain, offset and nonlinearity. For this case, the error in the ith converter is given by

$$
\begin{equation*}
\varepsilon_{\mathfrak{i}}(x)=\delta_{\mathfrak{j}}(x)+\Delta_{\mathfrak{i}}(x) \tag{6.4}
\end{equation*}
$$

where

$$
\begin{equation*}
\delta_{i}(x)=\hat{x}_{i}-\left(a_{i} x+b_{i}\right) \tag{6.5}
\end{equation*}
$$



Fig. 6.4 Quantization and nonlinearity error in a converter array.
and

$$
\begin{equation*}
\Delta_{i}(x)=\left(a_{i}-a_{\min }\right) x+b_{i}-b_{\min } \tag{6.6}
\end{equation*}
$$

In these equations, $\delta_{i}(x)$ indicates the error that would exist if the ith converter were used alone, while $\Delta_{i}(x)$ indicates an additional error which exists when the ith converter is used as part of an array. The best fit gain and offset of the ith converter is represented by $a_{i}$ and $b_{i}$, while the best fit gain and offset of the array as a whole is given by $a_{\min }$ and $b_{\min }$. This results in an average error power for an m-way array, of

$$
\begin{equation*}
E_{p}=\frac{1}{m} \sum_{i=1}^{m}\left\langle\left(\delta_{i}+\Delta_{i}\right)^{2}\right\rangle \tag{6.7}
\end{equation*}
$$

or, expanding in terms of the above definitions,

$$
\begin{align*}
E_{p}= & \frac{1}{m} \sum_{i=1}^{m}\left[\left(a_{i}-a_{\min }\right)^{2}\left\langle x^{2}\right\rangle+\left(b_{i}-b_{\min }\right)^{2}\right. \\
& +\left\langle\delta_{i}^{2}\right\rangle+2\left(a_{i}-a_{\min }\right)\left(b_{i}-b_{\min }\right)\langle x\rangle \\
& \left.+2\left(a_{i}-a_{\min }\right)\left\langle\delta_{i} x\right\rangle+2\left(b_{i}-b_{\min }\right)\left\langle\delta_{i}\right\rangle\right] \tag{6.8}
\end{align*}
$$

For input signals which are asynchronous with the sampling clock, the last two terms of this equation are near zero, even for badly matched arrays. This results in the approximation,

$$
\begin{equation*}
E_{p} \simeq \frac{1}{m} \sum_{i=1}^{m}\left[\left\langle\delta_{i}^{2}\right\rangle+\left\langle\Delta_{i}^{2}\right\rangle\right] \tag{6.9}
\end{equation*}
$$

The minimizing values of $a_{\min }$ and $b_{\min }$ are obtained by differentiating Eq. (6.8) with respect to both variables and solving for the zero of each derivative. This occurs at simply $a_{\min }=\bar{a}_{i}$ and $b_{\min }=\bar{b}_{i}$, resulting
in an error power of

$$
\begin{equation*}
E_{p}=\sigma_{a}^{2}\left\langle x^{2}\right\rangle+\sigma_{b}^{2}+\left\langle\bar{\delta}_{i}^{2}\right\rangle \tag{6.10}
\end{equation*}
$$

in signals for which $\langle x\rangle \simeq 0$, where

$$
\begin{equation*}
\sigma_{x}^{2}=\overline{x^{2}}-\bar{x}^{2} \tag{6.11}
\end{equation*}
$$

These equations imply that converter gain and offset variations within an array will result in increased error power. These variations may be due to mismatches in gain or offset establishing components, or from differences in converter nonlinearities, which may result in a slightly different best fit gain and offset for each converter. In converters where these variations are well controlled, however, the array error power will be approximately the average of the individual error powers due to quantization and nonlinearity.

In both single converters and converter arrays, there is one additional source of error power; phase skew or jitter of converter sampling. Specifically, if the aperture timing of a comparator's sample and hold circuit exhibits any variance in period, the overall error power will increase. Phase skew is particularly troublesome in converter arrays, as it may result from even a slight difference in timing line layout or loading between converters.

The average error power for an m-way array of interleaved samplers may be given by

$$
\begin{equation*}
E_{p}=\frac{1}{m} \sum_{i=1}^{m}\left\langle\left(\hat{x}_{i}-x\right)^{2}\right\rangle \tag{6.12}
\end{equation*}
$$

where $\hat{x}_{i}$ is the sampled value of $x$ for the $i$ th sample and hold. For $a$ sinusoidal input waveform, and for distortionless samplers which exhibit
a timing skew,

$$
\begin{equation*}
x=A \sin (\omega t) \tag{6.13}
\end{equation*}
$$

and

$$
\begin{equation*}
\hat{x}=A \sin \left(\omega\left(t+t_{i}-t_{\min }\right)\right) \tag{6.14}
\end{equation*}
$$

where $t_{i}$ is the absolute delay in the $i$ th sampler and $t_{\text {min }}$ is a minimizing sampling delay of the array as a whole. This results in an average error power given by

$$
\begin{equation*}
E_{p}=\frac{A^{2}}{m} \sum_{i=1}^{m}\left\langle\left(\sin \left(\omega\left(t+t_{i}-t_{\min }\right)\right)-\sin (\omega t)\right)^{2}\right\rangle \tag{6.15}
\end{equation*}
$$

in which $t_{\min }$ is defined to be the value which minizes $E_{p}$. The above expression may be expanded to give

$$
\begin{align*}
E_{p}= & \frac{A^{2}}{m} \sum_{i=1}^{m}\left[\left\langle\sin ^{2}\left(\omega t+\phi_{i}\right)\right\rangle\right. \\
& \left.-\left\langle 2 \sin \left(\omega t+\phi_{i}\right) \sin (\omega t)\right\rangle+\left\langle\sin ^{2}(\omega t)\right\rangle\right]  \tag{6.16}\\
= & \frac{A^{2}}{m} \sum_{i=1}^{m}\left(1-\cos \left(\omega\left(t_{\min }-t_{i}\right)\right)\right) \tag{6.17}
\end{align*}
$$

The value of $t_{\text {min }}$ may be obtained by differentiating the above and equating to zero, giving an error minimizing delay of simply $t_{\min }=\overline{t_{i}}$. The expression for $E_{p}$ may now be expanded, resulting in

$$
\begin{equation*}
E_{p}=\frac{1}{m} \sum_{i=1}^{m}\left[\frac{\omega^{2}\left(t_{i}-\overline{t_{0}}\right)^{2}}{2!}-\frac{\omega^{4}\left(t_{i}-\overline{t_{0}}\right)^{4}}{4!}+\ldots\right] \tag{6.18}
\end{equation*}
$$

which for small values of time skew may be approximated by

$$
\begin{equation*}
E_{p} \simeq \frac{A^{2} \omega^{2} \sigma_{t}^{2}}{2} \tag{6.19}
\end{equation*}
$$

The amount of error power resulting from gain mismatch, offset mismatch and sampling skew is summarized in Table 6.2. To determine the overall signal-to-noise ratio of a converter array, the error powers listed in the second column need only be added together, along with the average error power due to individual nonlinearities, and compared with the original signal power ( $V_{p p}{ }^{2} / 8$ ). The relative sensitivities of the error power to different non-idealities are shown in the third column. In this comparison, the indicated variance of each of the three parameters, is that which produces a degradation of $S / N$ ratio to 40 dB when all other noise sources are zero. Note that the overall error power is least sensitive to gain variance and most sensitive to phase skew and jitter. As a point of reference, however, the gain matching requirement is actually twice as severe as the most significant bit component matching requirement of a single converter, for a nonlinearity limited $S / N$ ratio of 40 dB . This implies that arrays of converters which have a specific gain determining element, such as $\mathrm{R}-2 \mathrm{R}$ ladders, will have a more severe component matching requirement when used in an array than when used individually (for a specified $S / N$ ratio). Similarly, multiple flash circuits [108] may suffer from a reduced S/N ratio, as "bow" errors will modify the effective converter gains and offsets. All types of converter arrays, however, will display reduced performance levels in the presence of random component variations. The $S / N$ ratio performance of 7-bit flash and weighted capacitor converter arrays which employ precision components with Gaussian error distributions, has been simulated via the program listed in Appendix A. The results are summarized in Figs. 6.56.7. As is apparent from these figures, both the flash and bipolar input weighted-capacitor converter methods display a degradation in S/N ratio

TABLE 6.2

| Noise <br> Source | Sinusoid <br> Error Power | Requirement <br> for Sinusoid <br> $S / N=40 \mathrm{~dB}$ |
| :---: | :---: | :---: |
| Gain (a) <br> Mismatch | $\frac{V_{p-p}^{2}}{8} \sigma_{a}^{2}$ | $\frac{\sigma_{a}}{\bar{\alpha}}=1 \%$ |
| Offset (b) <br> Mismatch | $\sigma_{b}^{2}$ | $\frac{\sigma_{b}}{V_{p-p}}=.71 \%$ |
| Phase ( $t$ ) <br> Skew | $\frac{V_{p-p}^{2} \omega^{2} \sigma_{t}^{2}}{8}$ | $\frac{\omega \sigma_{t}}{2 \pi}=.16 \%$ |
| $\sigma_{X}^{2}=\overline{X^{2}}-\bar{X}$ |  |  |

Magnitude and sensitivity of error power to mismatch in a converter array. Values in table are for an input sinusoid centered about zero.


Fig. 6.5 Simulated SNR performance of 100 flash converters when configured as 1,2,4 and 8-way arrays, as a function of resistor mismatch. Ladder resistors are assumed normally distributed about their nominal value.


Fig. 6.5 Simulated SNR performance of 100 unipolar input weightedcapacitor $A / D$ converters as a function of array size and unit capacitor matching.



The $k=0$ case in Eq. (6.20) corresponds to the original signal component, while the remaining terms are a form of aliasing distortion. This distortion is demonstrated for the case of a sinusoidal input signal and a 4-way array in Fig. 6.8b, as compared with the ideal case in Fig. 6.8a. As is evident from the figure, sidebands develop about fractions of the sampling rate, which are identical in spectral content to the original input signal. The relative magnitude of each sideband, which except for the $k=m / 2$ case exist in pairs, is determined by the appropriate order of discrete transform as defined above. Note that for nonzero DC levels, gain errors will produce noise at each multiples of $f_{s} / m$.

Jnitike :he gain ariasions discussed aoove, a mismatsh of converter offsets will produce a constant noise, which is independent of the input frequency spectrum. The reconstructed noise waveform for an offset mismatched converter array that is otherwise ideal, may be given by

$$
\begin{equation*}
\hat{x}(t)=\sum_{k=-\infty}^{\infty} \frac{1}{m} \sum_{i=1}^{m} b_{i} h(t-(m k+i-1) T) \tag{6.22}
\end{equation*}
$$

Whers 0 ; is the offise of the ith convorter, $T$ is the sampling pericd and $h(t)$ is the impulse response of the reconstruction filter. The above noise expression has a spectral distribution given by

$$
\begin{align*}
z\{\hat{x}(t)\} & =\int_{-\infty}^{\infty} \hat{x}(t) e^{-j \omega t} d t  \tag{6.23}\\
& =\int_{-\infty}^{\infty} \sum_{k=-\infty}^{\infty} \frac{1}{m} \sum_{i=1}^{m} b_{i} h(t-(m k+i-1) T) e^{-j \omega t} d t \tag{6.24}
\end{align*}
$$

which may be expressea as

$$
\begin{equation*}
F\{\hat{x}(t)\}=\sum_{k=-\infty}^{\infty} e^{-j \omega T m k} \frac{1}{m} \sum_{i=1}^{m} b_{i} e^{-j \omega(i-1) T} H(\omega) \tag{6,25}
\end{equation*}
$$

performance of typically 1 to 2 dB when used in arrays. Note that the unipolar input converter, however, maintains a tight performance distribution over all of the array sizes simulated. The reason for the improved feriormance of this converter civer the bipolar input tipe, is principally due to the smaller effect of MSB component mismatch on best fit gain and offset. Note that even for the flash and bipolar input converters, however, a constant performance level may be obtained in large arrays by doubling the component matching requirement over that required for a single converter. This is equivalent (in matching requirement) to adding 1 bit of resolution to each of the converters.

### 6.2.2 Spectrai Distribution of Voise and Distortion

Of equal importance to the error power magnitude is the noise and distortion spectral distribution. Analysis of the power spectrum resulting from a gain mismatched array is similar to that performed by Messerschmitt for the case of induced gain errors by "bit robbing" in digital channel banks [109]. For an m-way array with gain mismatch, an irput signai of onver soectrum $\varsigma_{A}^{\prime}$ ' will result in a reconstructed output power spectrum (assuming impulse sampling) given by

$$
\begin{equation*}
S_{\dot{x}}(\omega)=\left|H(\omega) f_{s}\right|^{2} \sum_{k=-\infty}^{\infty}\left|G_{k}\right|^{2} S_{x}\left(\omega-k 2 \pi f_{s} / m\right) \tag{6.20}
\end{equation*}
$$

where $H(\omega)$ is the transfer function of the reconstruction filter, $f_{s}$ is the sampling frequency and $G_{k}$ is the discrete Fourier transform of gain in the array

$$
\begin{equation*}
G_{k}=\frac{1}{m} \sum_{i=1}^{m} a_{i} e^{-j 2 \pi k(i-1) / m} \tag{6.21}
\end{equation*}
$$

(a) Perfectly Matched Array
(b) Gain Mismatch and Phase Skew


Fig. 6.8 Spectrum of a reconstructed sinusoid for a 4-way converter array.

This reduces to

$$
\begin{equation*}
F\{\hat{x}(t)\}=\sum_{k=-\infty}^{\infty} \delta(\omega-2 \pi k / m T) \frac{1}{m} \sum_{i=1}^{m} b_{i} e^{-j 2(i-1) k / m} H(\omega) \tag{6.26}
\end{equation*}
$$

which leads to an output noise power spectrum of

$$
\begin{equation*}
N_{\dot{x}}(\omega)=\left|H(\omega) f_{s}\right|^{2} \sum_{k=-\infty}^{\infty}\left|B_{k}\right|^{2} \delta\left(\omega-k f_{s} / m\right) \tag{6.27}
\end{equation*}
$$

where $B_{k}$ is the discrete Fourier transform of best fit offsets in the array. This noise is illustrated in Fig. 6.8c for a 4-way array, along with the original signal spectrum. As is evident from the above analysis, offset related noise appears at multiples of $f_{s} / m$, just as gain errors do for nonzero DC input levels.

The noise due to sampling time skew in a converter array is most easily analyzed with respect to an input sinusoid, rather than for an arbitrary distribution as was done in the gain error case. If the ith converter is assumed to exhibit a sampling time error of $\delta t_{i}$ and is otherwise assumed to be ideal, the sampled value may be given by

$$
\begin{equation*}
\hat{x}_{i}(t)=A \sin \left(\xi\left(t+\delta t_{i}\right)\right) \tag{6.28}
\end{equation*}
$$

This may be expanded, to give

$$
\begin{align*}
\hat{x}_{i}(t)= & A[\sin (\xi t)+\xi \delta t \cos (\xi t) \\
& \left.-\xi^{2} \delta t_{i}{ }^{2} \frac{\sin (\xi t)}{2!}-\xi^{3} \delta t_{i}{ }^{3} \frac{\cos (\xi t)}{3!}+\ldots\right] \tag{6.29}
\end{align*}
$$

which allows the converter error

$$
\begin{equation*}
\varepsilon_{i}(t)=A \sin \left(\xi\left(t+\delta t_{i}\right)\right)-A \sin (\xi t) \tag{6.30}
\end{equation*}
$$

to be approximated by.

$$
\begin{equation*}
\varepsilon_{i}(t)=A \xi \delta t_{i} \cos (\xi t) \tag{6.31}
\end{equation*}
$$

for small values of $\delta t_{i}$. For an array of interleaved converters, this results in a reconstructed output error waveform given by

$$
\begin{equation*}
\hat{x}(t)=\sum_{k=-\infty}^{\infty} \frac{1}{m} h(t-(m k+i-1) T) \xi \delta t_{i} \cos (\xi t) \tag{6.32}
\end{equation*}
$$

Note that this function is identical to that generated from a gain mismatched array for an input signal of $\cos (\xi t)$ and with effective gains of $\mathrm{a}_{\mathbf{i}}=\xi \delta \mathrm{t}_{\mathrm{i}}$. Thus the output distortion power spectrum for the phase skewed situation is approximately the same as the total output power spectrum for this latter case. Thus

$$
\begin{equation*}
N_{\dot{x}}(\omega) \simeq\left|A H(\omega) f_{s}\right|^{2} \sum_{k=-\infty}^{\infty}\left|\Phi_{k}\right|^{2} \Pi\left(\frac{\omega-k f_{s} / m}{2 \xi}\right) \tag{6.33}
\end{equation*}
$$

where $\Phi_{k}$ is the discrete transform of $\xi \delta t_{i}$ and $\Pi$ is given by

$$
\begin{equation*}
\Pi(x)=\frac{1}{2} \delta(x+1 / 2)+\frac{1}{2} \delta(x-1 / 2) \tag{6.34}
\end{equation*}
$$

Note that when $\delta t_{\mathbf{i}}$ is defined relative to the average sampling period, the $k=0$ term in the above is identically zero.

As is apparent from this analysis, and Fig. 6.8b, output distortion due to phase skew is similar in spectral content to that of converter gain error. The principal difference is that the magnitude of the distortion sidelobes in the phase skew case are frequency dependent, while in the gain error case they are not.

The previous spectral analysis has not specifically included the effects of quantization and nonlinearity. In converters with asynchronous sources, these effects usually appear as broadband noise and simple or aliased harmonic distortion. This is also the situation with converter arrays if the converter nonlinearities are similar. If the nonlinearities are different, however, a given input level may be encoded differently by different converters, resulting in a "dithered" output signal. This is ultimately equivalent to a gain and offset mismatch, as the different nonlinearities will result in different best fit converter parameters. In real converter arrays, which do not otherwise exhibit a gain or offset mismatch, this may result in an overall $\mathrm{S} / \mathrm{N}$ degradation of perhaps a few $d B$ in the reconstruction of a full scale input signal (see experimental results). This noise may be objectionable in some systems, particularly during low level or idle input conditions. In video systems it is often desirable, however, as it reduces the appearance of sharp quantization boundaries in picture areas where the luminance changes slowly [ ]. If necessary, the dithering at a given voltage may be eliminated by altering the effective converter offsets (by either analog or digital means) so as to place a given voltage within a common quantization level.

Of final note in the overall error analysis is the effect of the previously considered noise and distortion sources on the $S / N$ ratio of a bandlimited input signal. As significant portions of the noise and distortion products are clustered at, or near the Nyquist rate, the actual inband $S / N$ ratio of an array, may be larger than what the power figures of Table 6.2 may indicate. More specifically, for a system whose signal bandwidth is limited to $\mathrm{f}_{\mathrm{s}} / k(k \geq 2)$, up to $k / 2$ badly
matched converters can be used in an-array with no degradation of inband signal-to-noise ratio.

### 6.3 A Monolithic Time-Interleaved Converter Array

### 6.3.1 Chip Description

To demonstrate the characteristics of this array-technique, a 4channel CMOS test-chip has been fabricated in a 10 micron metal-gate CMOS process [113118. This chip consists of 4 time-interleaved 7-bit weighted-capacitor bipolar input $A / D$ converters, associated timing and control logic and an implanted MOS voltage reference. Also included on this chip is a 4-way input multiplexor, which enables some, or all of the individual $A / D$ converters to be connected with a specific input line. This allows each of the converters to be operated as part of an array (common inputs), or independently. A block diagram of this test chip is shown in Fig. 6.9. In this figure, the converters are seen to exist as a matrix of bit cells, where each row corresponds to a separate A/D converter, and where each column represents the bit-cells which are active in a given time-state. For example, during the first time-state, the first $A / D$ converter is sampling, the second $A / D$ converter is determining bit 6 of its conversion sequence and so on. A total of 8 timestates are required for a complete 7-bit conversion, which, because there are 4 converters per chip, results in an effective throughput of one conversion every two clock cycles. Note that this throughput may be doubled by interleaving two chips for a total of an 8-way converterarray.

This entire chip is defined in terms of general purpose cells, which were placed and interconnected via an automated layout program. Although


Fig. 6.9 Block diagram of converter array chip.
this approach resulted in a considerable increase in die size, it greatly simplified the required layout effort and provided a library cell-base from which other converter configurations can be designed. In theory, other converters may be realized by simply modifying the cell interconnect list, and rerunning the layout program.

An example of one of the cells used in this chip, a bit slice of a successive approximation register, is shown in cell form in Fig. 6.10. As is evident from the figure, the cells are defined as constant height, variable width units, where most of the interconnect occurs along one of the cell sides (bottom of cell in Fig. 6.10a). Typically, the cells exist in back to back rows, where the $V_{\text {SS }}$ line can be shared by top and bottom cells (the $V_{D D}$ connection is made through the substrate). Summing node and analog ground lines are shared by cells which are side by side, while some timing and data information may be passed vertically between cell rows where necessary. A layout of the complete chip, in cellular form, is shown in Fig. 6.11. Each of the four cell rows along the middle of the chip is a separate $A / D$ converter, while timing logic exists in the cells at the left and control and buffer circuitry are placed at the right.

A metal-gate CMOS process was chosen for this circuit, for reasons of simplicity and ease of fabrication at Sandia Laboratories. A more advanced process would have been desirable from the standpoint of performance, but was unnecessary for the demonstration of the concept, and not immediately available. A layout of a small inverter in this process which demonstrates most of the minimum design rule dimensions, is shown in Fig. 6.12. As is apparent from this figure, this process required that diffused $n+$ or $p+$ guard-bands (which are not self-aligned)



Fig. 6.11 Layout of converter array chip in cell form.


Fig. 6.12 Minimum geometry CMOS inverter in this process with symmetrical pull-up/pull-down. Die size is about $18.5 \mathrm{mil}^{2}$.
be placed around each transistor. Although this ultimately reduced many of the various leakage currents, and certainly helped reduce latchup (which was never observed), it greatly increased the circuit die size. As may be deduced from Fig. 6.12, a minimum geometry inverter with equal pull-up and pull-down capability occupies about $18.5 \mathrm{mil}^{2}$. A complete schematic of the SAR bit cell, which was previously shown in cell form, is presented in Fig. 6.13. This cell, or one quite similar to it, is used in each of the 28 bit cells present in the converter array. The state of this cell, is established by the voltage present upon the floating node "Q̄." This node is pulled high or low at the appropriate bit time, by the complementary devices shown on the right of the cell. For example, during the sampling interval it is pulled down to $V_{S S}$, while during the hold interval it is brought to $V_{D D}$. In turn, this node allows the associated precision capacitor to be charged to the appropriate value, by way of a pair of CMOS transmission gates. The state of the cell is read during the associated converters final bit time, by passing its value onto a tri-state data line which is common to all of the cells in a given time-state. Note that this cell is totally dynamic, in that no method for maintaining $\bar{Q}$ at a constant potential in the presence of leakage currents exists.

Several of the bit positions use slight modifications of this cell, for example, the first bit position uses the cell shown in Fig. 6.14. This cell has been modified in order to allow bipolar input operation, as discussed in Chapter 3. The principle difference is that this bit position doesn't require either $\mathrm{Sel}_{\boldsymbol{i}}$ or Hold timing lines. Similarly, the cells for the 6th and 7th bit positions are shown in Figs. 6.15 and 6.16. These cells have been simplified by eliminating the data read


Fig. 6.14 Schematic of cell used in first bit position. Circuit has been modified from
that shown previously to allow for bipolar intput operation.


Fig. 6.15 Schmatic of cell used in sixth bit position.


Fig. 6.16 Schematic of cell used in seventh bit position.
circuitry from each, and by eliminating the "next time state" logic from the 7th bit cell. For these bit locations, the state of each cell is transferred directly from the comparator to the output buffer (rather than through the cell), so that the data read circuitry wasn't required. Similarly, as the conversion ends after bit 7 , there is no reason to store the result of bit 7 within the bit cell. If this were done, a timing conflict would actually occur as the next sample interval immediately follows the 7th bit time.

The various cells presented above provide the necessary circuitry for 7 of the binary-weighted capacitors. As discussed in Chapter 3, however, one additional capacitor exists which is equal in size to the smallest capacitor in the binary-weighted array. Circuitry for driving this capacitor is shown in Fig. 6.17a. This circuitry connects the capacitor's bottom plate to $V_{i n}$ during the sample mode and to ground thereafter while switches exist to line "Vy" which are phased in the reverse of this order. This extra line is used with two additional cells to provide a programmable offset voltage, as shown in Fig. 6.17b. Depending upon the state of two digital offset control pins, the offset of each $A / D$ converter may be adjusted from +0 to $+3 / 4$ LSB in $1 / 4$ LSB increments. Although many converters employ circuitry to provide $+1 / 2$ LSB of offset, the programable feature used here was added to allow a simple evaluation of noise resulting from an offset mismatched array. This circuitry would be neither necessary, or particularly desirable in a chip designed for actual use in a system. The complete schematics for these two programmable offset cells are shown in Fig. 6.18. Note that when the programmable input pins are left floating, the first cell connects it's capacitor between $V_{y}$ and the $\sum$ node (providing 1/2 LSB offset)


Fig. 6.17a Circuitry for "last" capacitor.


Fig. 6.17b Interconnect of cell shown in (a) with cells shown on next page for control of converter offset.


Fig. 6.18a 1/2 LSB offset control cell.


Fig. 6.18b $1 / 4$ LSB offset control cell.
while the second cell does not.
Timing information for each of the various bit cells, is generated by a self starting dynamic ripple converter. This circuit, which is shown schematically in Fig. 6.19, consists of an 8-stage shift register and a 7-input dynamic "OR" gate. This "OR" gate uses a "wired OR" function from the first 7 stages of the shift register to determine whether a "1" or a "0" should be inserted into the input of the shift register Both $Q$ and $\bar{Q}$ outputs from each stage are buffered by large geometry gates and sent to the bit cells as the lines "Sel ${ }_{i}$ " and " $\overline{\mathrm{Sel}_{i}}$," where $\boldsymbol{i}$ indicates the corresponding time state (1-8).

Another cell, the voltage comparator, is shown in Fig. 6.20. This circuit is single-ended and is used with its input node tied directly to the top plate of a capacitor array. During a given converter's "sample" interval, the associated comparator cell is offset cancelled, by shorting the input inverter's output and input together. During subsequent bit times, the input inverter is used as a linear amplifier, where the output reflects the value of the input voltage with respect to its offset point. The output of this inverter is then regenerated and buffered, by the flip-flop which follows it. The large geometry devices of the input inverter allow a relatively high transconductance to be obtained. Using the simple MOS models presented earlier, the effective $G_{m}$ of the input stage may be shown to be over 2 m Mhos with a lOV power supply. Similarly, the shorting switch possesses an "ON" resistance of between 500 and $600 \Omega s$. As discussed in Chapter 5, where comparators of this type were considered, this results in an effective settling time constant of,


Fig. 6.19 Self-starting 8-stage dynamic ripple counter. (a) General configuration. (b) Schematic of shift register stage. (c) Input logic.


Fig. 6.20a Schematic of voltage comparator.


Fig. 6.20b Die photograph of comparator.
$\tau_{a q} \simeq \frac{1}{C_{i n}\left(R_{\text {in }}+500 \Omega\right)}$
where $R_{i n}$ is the impedance of any additional series switches or devices (in this case the impedance of the $V_{i n} / V_{\text {ref }}$ bus selector) and $C_{i n}$ is the effective input capacitance. ${ }^{2}$ For the switches and capacitors used in this circuit, $C_{i n} \simeq 7 p F$ and $R_{i n} \simeq 600 \Omega$, so $\tau_{a q} \simeq 8.4 \mathrm{nsec}$. This results in a comparator -3 dB bandwidth of about 19 MHz . Note that, although this comparator is small and fairly fast, a much more sophisticated comparator could be used, without substantially impacting the overall die size. Modifications of the design to improve resolution or powersupply rejection, for instance, would be much easier to incorporate in this array method than with a flash technique, as the number of required comparators is so much smaller.

A timing diagram of a complete converter cycle for one of the converters on the chip (\#1), is shown in Fig. 6.21. The timing diagrams of the other converters are nearly identical but are shifted in position relative to the converter shown.

The precision capacitors used in this chip are defined by thin oxide cuts over $n+$ diffusion regions and consist of about 10 pF of total capacitance for each of the 4 converters. Although the converters in this chip employ binary-weighted capacitors, higher resolution converters could be realized with little or no increase in array capacitance, by using a "split array" technique as discussed in Chapter 3.
${ }^{2}$ For bipolar input converters of the type used here, the value of $C_{\text {in }}$
is less than the array capacitance, as the MSB capacitor is not
charged to $V_{i n}$.


Fig. 6.21 Timing diagram for one conversion cycle of one A/D.

In order to allow easier converter testing, and to demonstrate a concept which would be useful in a commercial device, an input multiplexor was used between the 4 signal input pins and each of the $4 \mathrm{~A} / \mathrm{D}$ converters. This multiplexor configuration is shown in Fig. 6.22. As is evident from this figure, each of the $A / D$ converters may be tied to either of two signal input lines, based upon the state of an input select control pin (1 per converter). This allows the converters to be easily configured into various array configurations (common inputs), or operated independently. Circuitry for performing this multiplexing function is combined with the " $\mathrm{V}_{\mathrm{in}} / V_{\text {ref }}$ " bus logic, as shown in Fig. 6.23. In this circuitry, ratioed CMOS logic is used to provide a rapid output of $V_{i n}$ during the sample interval, with a slow recovery to $V_{\text {ref }}$ thereafter. This slow recovery is not detrimental to conversion time or accuracy, as no bit cells use the " $V_{i n} / V_{\text {ref }}$ " line until two time states after sampling. The two ratioed "AND" gates used in this circuit, are shown in Fig. 6.24.

The on-chip voltage reference uses a closed loop CMOS amplifier which has an implant-induced offset voltage of approximately .8 V . A schematic of this amplifier is shown in Fig. 6.25, while measured amplifier characteristics are presented in Table 6.3. In order to provide improved transient settling response and an increase in reference voltage, as well as to allow a controllable amount of "dither" to be added to the quantized signals, this amplifier is used in the clocked circuit configuration shown in Fig. 6.26a. Operation of this circuit under DC loading is demonstrated by the oscilloscope photograph in Fig. 6.26b. A die photograph of the complete converter array chip, is shown in Fig. 6.27.

Fig. 6.22 Input multiplexor configuration. Each A/D converter may be programmed to select either of two input lines.



Fig. 6.24 Ratioed "AND" gates of previous figure. Skewing of W/L ratios from normal values allows faster transmission of $V_{i n}$ to bus upon receipt of "sample" signal.


Fig. 6.25 CMOS op-amp with implant shifted offset voltage is used as a voltage reference.

## TABLE 6.3

| Gain | $10,000 \mathrm{~V} / \mathrm{V}$ |
| :--- | :---: |
| Bandwidth | 1.5 MHz |
| Slew Rate | $>1 \mathrm{~V} / \mathrm{\mu sec}$ |
| CMRR | 60 dB |
| Power Dis. | 1.5 mW |
| Die Size | $300 \mathrm{mi}{ }^{2}$ |

Measured op-amp characteristics.










Fig. 6.27 Die photograph of monolithic converter array.

### 6.3.2 Experimental Results

Reconstructed ramp and sinewave input signals, which were digitized by this array chip, are shown in Fig.6.28. Low frequency linearity is maintained to approximately a 2.5 MHz conversion rate, with operation at reduced linearity ( $5-6$ bits) continuing to 4 Hz . The low frequency linearity is dictated by binary-weighted capacitor matching, which in this case, appears to have been limited by consistent pattern-generator and mask run-out errors. These effects are usually pronounced in high speed circuits, which by necessity, must employ small capacitors. The capacitor errors are quite consistent from wafer to wafer, however, implying that a simple mask adjustment (or e-beam generated masks) should allow consistently good linearity in even higher resolution applications. High frequency conversion linearity appears to be dominated by signal-related power line noise affecting the single-ended comparator operation. This effect could be substantially reduced by using differential comparators, which as mentioned previously, could be more easily employed in this technique than with a flash method, as only a few comparators are used.

A spectrograph of a reconstructed 100 kHz sinewave after sampling at a 2 MHz rate with an entire 4-way array, is shown in Fig. 6.29a. Similarly, a spectrograph of the reconstructed output from a single converter within an array for the same input signal, is shown in Fig. 6.29. Power readings from several typical samples, which demonstrate 100 kHz SNR performance in both single converter and array configurations, are shown in Table 6.4. As is evident from the figures in these tables, a degradation of 1 or $2 d B$ occurs in the SNR in going from a single converter to a 4-way array. This difference is principally due to independent nonlinearities affecting the best fit converter gains and offsets,


Fig. 6.28 Original and reconstructed waveforms after sampling with converter array chip (5V/div. on vertical scale, lower trace of each pain inverted). (a) Single converter within an array at a 625 kHz conversion rate. (b) Four-way array at 3.5 MHz conversion rate (points shown correspond).

(b)


Fig. 6.29 Spectrum of reconstructed 100 kHz sinewave after sampling with array chip. (a) Four way array at 2 MHz sampling rate. (b) Single converter within an array at same clock rate as (a).

## TABLE 6.4

| 1 Converter |  | 2 Converters |  | 4 Converters |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
| Converter | SNR | Converters | SNR | Converters | SNR |
| 1 | 40.3 |  |  |  |  |
| 2 | 41.9 | 1 and 3 | 39.4 | $1-4$ | 38.7 |
| 3 | 39.15 | 2 and 4 | 40.3 |  |  |
| 4 | 40.0 |  |  | 38.7 |  |
| Avg | 40.3 | Avg | 39.85 |  |  |

W1-2 | Converter |  | 2 Converters |  | 4 Converters |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
| Converter | SNR | Converters | SNR | Converters | SNR |
| 1 | 41.2 |  |  |  |  |
| 2 | 42.2 | 1 and 3 | 39.4 | $1-4$ | 39.4 |
| 3 | 41.5 | 2 and 4 | 40.9 |  |  |
| 4 | 41.2 |  |  |  |  |
| Avg | 41.5 | Avg | 40.15 |  | 39.4 |

Measured 100 kHz SNR performance of two converter array chips at a 4 MHz clock rate ( 2 MHz sampling rate for a 4 -way array).
and a slight variance in feedthrough from the comparator reset switch. Signal-to-noise ratio performance as a function of input signal frequency is shown in Fig. 6.30. The degradation of performance at high frequencies is principally due to limited comparator bandwidth during the sample mode as discussed in Chapter 5, although some degradation due to sampling skew is evident as well. Channel isolation between converters is sufficiently high that no component of a full scale 100 kHz input signal to one converter is observable in the reconstructed quiet channel output of any other. A summary of the characteristics of this test converter array chip is given in Table 6.5.

The previous analysis regarding noise and distortion from nonideal arrays, was verified using this test chip. As mentioned previously, circuitry has been included on-chip for modifying the intrinsic offset of each $A / D$ converter by fixed fractional LSB amounts. Similarly, the apparent gain of each converter may be modified by simply resistively attenuating the input signal at the input to each converter (off-chip). Spectral plots of reconstructed sinewaves after artificially inducing gain and offset variations in this manner are shown in Fig. 6.31. Note that noise spikes are at the frequencies indicated in Fig. 6.8, while the magnitude of the additional error power is approximately that indicated by Table 6.2. The effect of an induced offset mismatch on the quitechannel noise output spectrum is shown in Fig. 6.32b, as compared to the nominal output noise spectrum in Fig. 6.32a. Note that the noise floor in each of these figures is that of the spectrum analyzer, and not broadband noise from the $A / D$ or $D / A$ converters.


Fig. 6.30 Measured SNR performance as a function of input frequency for a 4 -way array operating at a 2 MHz sampling rate.
TABLE 6.5

| NUMBER OF MOS DEVICES | 1250 |
| :--- | ---: |
| TOTAL POWER CONSUMPTION | 250 mW |
| MAXIMUM SAMPLING RATE PER CHIP | 4 MHz |
| DIE SIZE | $208 \times 278 \mathrm{MLLS}$ |
| RESOLUTION (2.5 MHz SAMPLE RATE) | 7 BITS |
| LINEARITY | $1 / 2 \mathrm{LSB}$ |
| S/N RATIO, FULL SCALE INPUT " | 39 db |
| PHASE SKEW AND JITTER | $<4 \mathrm{nsec}$ |
| REFERENCE VOLTAGE DRIFT, $0-70^{\circ} \mathrm{C}$ | $<1 \%$ |



Fig. 6.31 Noise and distortion due to induced converter mismatch. Spikes shown are caused by mismatch indicated. (a) Gain variation ( $\sigma_{a} / \bar{a}$ ) of approximately $7 \%$. (b) Offset variation $\left(\sigma_{b}\right)$ of approximately $0.5 \%$ at full-scale range.


### 6.3.3 Anticipated Characteristics of Implementations in An Advanced Process

If this basic design, which used a 10 micron, fully guard-banded CMOS process and program interconnected cells, were to be implemented in a modern 4-5 micron silicon-gate process using more area efficient techniques, a substantial improvement in die-size and performance would result. The die size would be reduced by a factor of four by simply scaling the major lithographic dimensions to half of their present size (shrinking 10 micron gates to 5 microns for instance). Further reductions in die size would occur if the field isolation regions were implanted or self-aligned, rather than consisting of diffused guard bands, and the computer generated cell-based layout was replaced with one drawn by hand. In fact, an overall reduction in die size by a factor of 6 appears to be conservative if the above improvements were incorporated into a new design.

As the maximum conversion rate is directly related to the capacitance which must be driven by internal nodes, the reductions in die size considered above, would all result in higher speed. Further improvements in conversion rate could be obtained by using optimized device sizes in all of the internal logic, rather than a standard cell approach, and by incorporating a lower resistivity interconnect than the $\mathrm{P}+$ diffusions which were available here. These various improvements would allow an improvement in conversion rate by at least a factor of four. This would imply that an 8-bit video bandwidth ( $10-15 \mathrm{MHz}$ ) converter could be fabricated in a die size of less than $10,000 \mathrm{mi}^{2}$.

### 6.4 Application to a Digital Signal Processor

Monolithic digital signal processors which include on-chip $A / D$ and D/A conversion circuits, would be useful in many applications if the circuits could be made inexpensively and with sufficiently high resolution. As high speed converters have tended to consume large die areas, and because very fast digital filters are difficult to implement in an MOS technology, however, single-chip processors have thus far been constrained to operate at near audio bandwidths [113]. Application of this array technique to a digital signal processor may allow higher frequency processors to become practical.

As this array technique will allow a fast converter to be realized in a smaller area than with other methods, more chip area is available for a digital processor within a maximum allowable die size. This would allow expanded capabilities in a single or few chip digital processor of the type illustrated in Fig. 6.33, although the digital processing section may still be difficult to implement in very fast systems. An alternative architecture, which takes advantage of the structure of a converter array, is shown in Fig. 6.34. In this case, the outputs from separate converters are processed independently and are only combined just before reconstruction. This architecture has the advantage of greatly reduced bandwidth in each digital processor and very simple expandability. This multiple-path technique has been employed with a variety of analog filter paths [14-16], for the realization of various bandpass and comb filtering functions, but has thus far not been applied to a digital processor. It would appear to be ideally suited to MOS implementations, as it trades several slower but potentially more area efficient circuits for a single high speed device (as was demonstrated above for $A / D$ converters), and


Fig. 6.33 Analog in/analog out digital signal processor employing a converter array front-


Fig. 6.34 Digital signal processor employing multiple path techniques throughout.
is easily expandable by simply interleaving additional processor paths or chips.

## CHAPTER 7

## CONCLUSIONS

Three basic high speed analog to digital conversion techniques have been analyzed with regard to implementation in a standard CMOS process. They are; the flash, successive approximation and pipeline converter methods. A new method for using $A / D$ converters has also been analyzed and demonstrated; the time-interleaved converter array. This method, which employs a number of converters with synchronized sampling times to achieve a fast effective conversion rate, has been shown to be useful in achieving higher bandwidths, or smaller die sizes at a given bandwidth than are possible with any single converter. Although this method will, in some cases, result in increased noise or distortion, these effects are both predictable and consistent and may be minimized in the design of the array. This technique has been demonstrated in a 4-way successive approximation array chip, which has been used to verify the analysis of array characteristics and to show the feasibility of the method. Extensions of this work into implementations in an advanced process or as part of a digital processing system have also been discussed. This technique should be useful in reducing the cost of both high speed monolithic converter circuits and single- or few-chip digital signal processors which must operate on high frequency input signals.

## REFERENCES

[1] D.N. Gregory, J.L. Bliss, I.D.B. Millar and C.J. Allen, "Color picture synchronization by the natlock system," J. SMPTE, 78, 611-614, August 1969.
[2] R.J. Bulter, "Remote color genlock," J. SMPTE, 78, 615-618, August 1969.
[3] H. Schmid, "Synchronization of remote program sources for color TV broadcasting," J. SMPTE, 78, 619-620, August 1969.
[4] D.E. Acker and R.H. McLean, "Digital time-base correction for video signal processing," J. SMPTE, 85, 146-160, March 1976.
[5] K. Kano, T. Jmai, M. Inaba, A. Sugimoto, Y. Itoh, Y. Inoue, and J. Shimizu, "Television frame synchronizer," J. SMPTE, 84, 129134, March 1975.
[6] J.B. Mately, "A digital framestone synchronizer," J. SMPTE, 85, 385-388, June 1976.
[7] A.A. Goldberg, "PCM NTSC television characteristics." J. SMPTE, 85, 141-145, March 1976.
[8] R. Burkhardt and J. Wasser, "Digital television transmission with 34 ubits/s," J. SMPTE, 89, 244-248, April 1980.
[9] K. Yokoyama, S. Nakagawa, and H. Katayama, "An experimental digital videotape recorder," J. SMPTE, 89, 173-180, March 1980.
[10] J.P. Rossi, "Sub-nyquist encoded PCM NTSC color television," J. SMPTE, 85, 1-6, January 1976.
[11] V.G. Devereux, "Differential coding of PAL video signals using intra-field prediction," Proc. of IEEE, 124, 1139-1147, December 1977.
[12] P. Rainger and P.A. Ratliff, "Low bit-rate system for digital coding of the television picture," J. SMPTE, 88, 550-554, August 1979.
[13] J.P. Rossi, "Digital television image enhancement, " J. SMPTE, 84, 545-551, July 1975.
[14] A. Iga et al., "A consumer PCM audio unit connectable to home-use video tape recorder," presented at the Conference of the Acoustical Society of Japan, October 1977, paper 3-2-9.
[15] Y. Tsuchiya et al., "Professional PCM audio unit using error correcting code," presented at the Conference of the Acoustical Society of Japan, October 1977, paper 3-2-10.
[16] B.A. Blesser, "Digitization of audio," J. Audio Engineering Soc., Vol. 26, No. 10, pp. 750-771, October 1978.
[17] T. Doi et al., "On several standards for converting PCM signals into video signals," J. Audio Engineering Soc., Vol. 26, No. 9, pp. 641-649, September 1978.
[18] J.A. McCracken, "A high-performance digital audio recorder," J. Audio Engineering Soc., Vol. 26, No. 7/8, pp. 560-561, July/ August 1978.
[19] R.W. Brodersen, P.R. Gray, and D.A. Hodges, "MOS switched-capacitor filters," Proceedings of the IEEE, pp. 61-71, January 1979.
[20] W.C. Black, Jr., D.A. Allstot, S.H. Patel, and J.B. Wieser, "CMOS PCM channel filters," ISSCC Dig. of Technical Papers, pp. 84-85, February 1980.
[21] T. Doi, et al.
[22] G.W. McNally, "Microprocessor mixing and processing of digital audio signals," J. Audio Engineering Soc., Vol. 27, No. 10, pp. 793-830, October 1979.
[23] Y. Kawakami et al., "A Single-Chip Digital Signal Processor for Voiceband Applications," ISSCC Dig. of Technical Papers, pp. 4041, February 1980.
[24] National Semiconductor Data Acquisition Handbook, Chapter 15, 1978.
[25] T. Redfern, National Semiconductor Corp., private communication.
[26] J.R. Naylor, "Testing digital/analog and analog/digital converters," IEEE Trans. on Circuits and Systems, Vol. CAS-25, No. 7, pp. 526538, July 1978.
[27] J.G. Peterson, "A monolithic video A/D converter," IEEE J. SolidState Circuits, Vol. SC-14, No. 6, pp. 932-937, December 1979.
[28] M.O. Felix, "Differential phase and gain measurements in digitized video signals," SMPTE Journal, Vol. 85, pp. 76-79, February 1976.
[29] W.A. Kester, "Characterizing and testing $A / D$ and D/A converters for color video applications," IEEE Trans. on Circuits and Systems, Vol. CAS-25, No. 7, July 1978.
[30] R.A. Nordstrom, "High-speed integrated A/D converter," ISSCC Dig. of Technical Papers, pp. 150-151, February 1976.
[31] J.G. Peterson, op.cit. [27].
[32] A.G.F. Dingwall, "Monolithic expandable 6 bit 20 MHz CMOS/SOS A/D converter," IEEE J. Solid-State Circuits, Vol. SC-14, pp. 926-932, December 1979.
[33] J.G. Graeme, G.E. Tobey, and L.P. Huelsman, Operational Amplifiers, McGraw-Hill, New York, 1971.
[34] Y. Nishikawa and J.E. Solomon, "A general purpose wideband operational amplifier," ISSCC Dig. of Technical Papers, pp. 144-145, February 1973.
[35] K.E. Kuijk, "A fast integrated comparator," IEEE J. Solid-State Circuits, Vol. SC-8, pp. 458-462, 1973.
[36] J.G. Peterson, op.cit. [27].
[37] M.A. Polinsky, O.H. Schade Jr., and J.P. Keller, "CMOS/bipolar monolithic integrated circuit technology," Proc. Inter. Electron Dev. Meeting, p. 229, December 1973.
[38] W.C. Black, Jr., R.H. McCharles and D.A. Hodges, "CMOS (and bipolar) processes for high performance analog LSI," Proc. Inter. Electron Dev. Meeting, pp. 331-334, December 1976.
[39] J.G. Peterson, op.cit. [27].
[40] A.G.F. Dingwall, op.cit. [32].
[41] R. Kanachero. and P.R. Gray, University of California, Berkeley, private communication.
[42] J.G. Peterson, op.cit. [27].
[43] R.M. McCharles, Hewlett-Packard Laboratories, private communication.
[44] R.M. McCharles, V.A. Saletore, W.C. Black, Jr., and D.A. Hodges, "An algorithmic analog to digital converter," ISSCC Dig. of Technical Papers, pp. 96-97, February 1977.
[45] D.R. Breuer and J.D. Hyde, "10-bit 5 -megasample/second monolithic A/D converter," ISSCC Dig. of Technical Papers, pp. 152-153, February 1976.
[46] P.H. Saul, P.J. Ward, and A.J. Fryers, "A 5 ns monolithic D/A subsystem," ISSCC Dig. of Technical Papers, pp. 18-19, February 1980.
[47] J.L. McCreary and P.R. Gray, "All-MOS charge-redistribution A/D conversion techniques," IEEE J. Solid-State Circuits, Vol. SC-10, pp. 371-379, December 1975.
[48] J.L. McCreary, "Successive approximation analog-to-digital conversion techniques in MOS integrated circuits," Ph.D. dissertation, University of California, Berkeley, 1975.
[49] W.C. Black, Jr. and D.A. Hodges, "Time interleaved converter arrays technique," ISSCC Dig. of Technical Papers, pp. 14-15, February 1980.
[50] B.M. Gordon, "Linear electronic analog/digital conversion architectures, their origins, parameters, limitations and applications," IEEE Trans. on Circuits and Systems, Vol. CAS-25, No. 7, pp. 391418, July 1978.
[51] F.D. Waldhauer, U.S. Patent \#3,187,325, June 1, 1965.
[52] R.M. Jepperson, U.S. Patent \#3,119,105, January 21, 1964.
[53] R.J. Van de Plassche and R.E.J. Van der Grift, "A high-speed 7-bit A/D converter," IEEE J. Solid-State Circuits, Vol. SC-14, No. 6, pp. 938-943, December 1979.
[54] ibid.
[55] N.F. Foster, "Piezoelectric and piezoeresistive properties of films," a chapter of 'Handbook of Thin Film Technology,' McGraw-Hill Book, Co., New York, 1968.
[56] G. Kelson, H.H. Stellrecht and D.S. Perloff, A monolithic 10-b digital-to-analog converter using ion implantation," IEEE J. SolidState Circuits, Vol. SC-6, pp. 396-403, December 1973.
[57] J.A. Schoeff, "An inherently monotonic 12-bit DAC," IEEE J. SolidState Circuits, Vol. SC-14, pp. 904-911, December 1979.
[58] G. Erdi, "A precision trim technique for monolithic analog circuits," ISSCC Digest of Technical Papers, pp. 193-192; February 1975.
[59] D.T. Comer, "A monolithic 12-bit D/A Converter," ISSCC Digest of Technical Papers, pp. 104-105, February 1977.
[60] A.P. Brokaw, "A monolithic 10 -bit $A / D$ using $I^{2} L$ and LWT thin-film resistors," IEEE J. Solid-State Circuits, Vol. SC-13, pp. 736-745, December 1978.
[61] F. van de Wiele, R. van Overstraeten, and M. de Man, "Graphical method for the determination of junction parameters and of multiplication parameters," Solid State Electron., Vol. 13, pp. 25-36, 1970.
[62] E.H. Stevens, "Depletion region thickness in diffused junctions," IEEE Trans. on Elect. Dev., Vol. ED-20, No. 7, pp. 659-660, July 1973.
[63] H. Lawrence and R.M. Warner, Jr., "Diffused junction depletion layer calculations," Bell Syst. Tech. J., Vol. 39, pp. 389-403, 1960.
[64] J.C. Irvin, "Resistivity of bulk silicon and of diffused layers in silicon," Bell Syst. Tech. J., Vol. 41, pp. 387-401, 1962.
[65] J.P. Lee and S.M. Sze, "Depletion layer capacitance of cylindrical and spherical P-N junctions, " Solid State Electron., 10, p. 1105, 1967.
[66] Y. Ameniya and K. Kato, "Compensation for voltage dependence of diffused feedback resistors in operational amplifiers," IEEE J. Solid-State Circuits, Vol. SC-14, pp. 1118-1120, December 1979.
[67] W.H. Gartner, Transistors, Principles, Design and Applications, Van Nostrand Co., Inc., Princeton, 1960.
[68] K. Tan and P.R. Gray, "Fully integrated analog filters using bipolarJFET technology," IEEE J. Solid-State Circuits, Vol, SC-13, pp. 814-821, December 1978.
[69] J.L. McCreary and P.R. Gray,-"All-MOS charge redistribution analog-to-digital conversion techniques -- Part I," IEEE J. Solid-State Circuits, Vol. SC-10, pp. 371-379, December 1975.
[70] W.C. Black, Jr. et al., op.cit. [20].
[71] T. Redfern, National Semiconductor Corp., Santa Clara, Ca., re. U.S. Patent Application.
[72] G.F. Landsburg and G. Smarandoiu, "A two-chip CMOS CODEC," ISSCC Digest of Technical Papers, February 1978.
[73] W.C. Black, et al., op.cit. [49].
[74] J.L. McCreary, A.M. Voshchenkov, (unpublished).
[75] J.L. McCreary, private communication.
[76] S.M. Sze, Physics of Semiconductor Devices, J. Wiley, New York, 1969.
[77] A. Many, et al., Semiconductor Surfaces, J. Wiley, New York, 1965.
[78] J.L. McCreary, A.M. Voshchenkov (unpublished).
[79] ibid.
[80] A.L. Grebene, Analog Integrated Circuit Design, Van Nostrand Reinhold, New York, 1972.
[81] American Institute of Physics Handbook, 2nd Ed., McGraw-Hill, New York, 1963.
[82] P. Andreatch and H.J. McSkimin, J. Applied Physics, Vol. 47, pp. 1299-1301, 1976.
[83] J.L. McCreary, op.cit. [74].
[84] H. Schichman and D.A. Hodges, "Modeling and simulation of insulatedgate field-effect transistor switching circuits," IEEE J. Solid-State Circuits, Vol. SC-3, pp. 285-289, September 1968.
[85] D. Frohman-Bentchkowski and A.S. Grove, "Conductance of MOS transistors in saturation," IEEE Trans. Electron Devices, Vol. ED-16, pp. 108-113, January 1969.
[86] Y.A. El-Mansy and A.R. Boothroyd, "A simple two-dimensional model for IGFET operation in the saturation region," IEEE Trans. Electron Devices, Vol. ED-24, pp. 254-261, March 1977.
[87] R.R. Troutman, "Low-level avalanche multiplication in IGFETs," Proc. Inter. Electron Dev. Meeting, pp. 43-46, December 1978.
[88] W.C. Black, Jr., "High voltage metal gate CMOS process for large scale analog circuit applications," M.S. Thesis, University of California, Berkeley, June 1977.
[89] ibid.
[90] W.C. Black, Jr. et al., op cit [38].
[91] J.E. Soloman, "The monolithic op amp: A tutorial study," IEEE J. Solid State Circuits, Vol. SC-9, pp.314-332, December 1974.
[92] B.J. Hosticka, "MOS sampled data recursive filters using state variable techniques," Ph.D. dissertation, University of California, Berkeley, Memorandum No. ERL-M77/65, August 1977.
[93] D. Senderowicz, D.A. Hodges, and P.R. Gray, "High-performance NMOS operational amplifier," IEEE J. Solid State Circuits, Vol. SC-13, pp. 760-766, December 1978.
[94] W.C. Black, Jr., D.J. Allstot, and R.A. Reed, "A high-performance low power CMOS channel filter," IEEE J. Solid State Circuits, Cember 1980.
[95] ibid.
[96] H. Ohara, W. Baxter, C. Rahim, J. McCreary and P.R. Gray, "A new second generation low power MOS switched capacitor PCM channel Filter," Proc. Inter. Conf. on Communications, June 1980.
[97] K.C. Hseuh, University of California, private communication.
[98] Y.P. Tsividis and P.R. Gray, "An integrated NMOS operational amplifier with internal compensation," IEEE J. Solid State Circuits, Vol. SC-11, pp. 748-753, December 1976.
[99] D. Senderowicz, et al, op cit [93].
[100] W.C. Black, Jr., et al, op cit [94].
[101] J.S. Brugler and P.G.A. Jespers, "Charge pumping in MOS devices," IEEE Trans. on Elect. Dev., Vol. ED-16, pp. 297-302, March 1969.
[102] Y.S. Yee, L.M. Terman and L.G. Heller, "A 1 mV MOS comparator," IEEE J. Solid State Circuits, Vol. SC-13, pp. 294-297, June 1978.
[103] M.B. Das and J.M. Moore, "Measurement and interpretation of lowfrequency noise in FET's," IEEE Trans. Electron Dev., Vol. ED-21, pp. 247-257, April 1974.
[104] F.M. Klaassen, "Characterization of low 1/f noise in MOS transistors," IEEE Trans. Electron Dev., Vol. ED-18, pp. 887-891, October 1971.
[105] J. Bertails, "Low frequency noise considerations for MOS amplifier design," IEEE J. Solid State Cricuits, Vol. SC-14, pp. 773-776, August 1979.
[106] ibid.
[107] F. Breole, Sandia Laboratories, private communication.
[108] A.G.F. Dingwall, op cit. [32].
[109] D.G. Messerschmitt, "Effects of bit-robbing in tandem digital switches," IEEE Trans. on Communications, Vol. COM-27, pp. 10251028, July 1979.
[110] W.W. Goodall, "Television by pulse code modulation," Bell System Technical Journal, Vol. 30, No. 1, pp. 33-49, January 1951.
[111] W.C. Black, Jr. et al, op cit. [49].
[112] W.C. Black, Jr. and D.A. Hodges, "Time interleaved converter arrays," IEEE J. Solid State Circuits, December 1980.
[113] M. Townsend, M.E. Hoff, Jr. and R.E. Holm, "An NMOS microprocessor for analog signal processing," IEEE J. Solid State Circuits, Vol. SC-15, No. 1, pp. 33-38, February 1980.
[114] L.E. Franks and J.W. Sandberg, "An alternative approach to the realization of network transfer functions: The N-path filter," Bell Syst. Technical Journal, Vol. 39, pp. 1321-1350, September 1960.
[115] W.E. Heinlein, K.H. Moehrmann and W.H. Holmes, "Double-turned $N$-path bandpass filters using a single gyrator," IEEE Trans. on Circuit Theory, pp. 728-729, November 1971.
[116] D.J. Allstot and K.S. Tan, "A switched capacitor N-path filter," Proc. Inter. Circuits and Systems Conference, March 1980.

APPENDIX A

SIGNAL TO NOISE RATIO SIMULATION PROGRAM
(Flash Simulator Shown)

Initial variable definitions

$$
\begin{aligned}
N= & \# \text { of bits } \\
L= & \# \text { of converters } \\
E= & \text { relative component } \\
& \text { bution routine }) \\
V 1= & \text { Reference voltage } \\
A 9= & V_{\text {in (peak) }} / V_{\max }
\end{aligned}
$$

$E=$ relative component error ( $\sigma_{R} / \bar{R}$ if using normal distri-

Intermediate and output variable definitions
$R(I, J)=$ Jth resistor of converter $i$
$A \phi(I)=$ Best fit offset of Ith converter
Al(I) = Best fit gain of Ith converter
A $\mathrm{MA}=$ Average array offset
AlMA $=$ Average array gain
EPI $=\frac{1}{L} \sum_{i=1}^{L}\left\langle\delta_{i}{ }^{2}\right\rangle=\left\langle\overline{\delta_{i}{ }^{2}}\right\rangle$
$E P 2=\frac{1}{L} \sum_{i=1}^{L}\left\langle x \delta_{i}\right\rangle\left(a_{i}-\bar{a}\right)$
EP3 $=\frac{1}{L} \sum_{i=1}^{L}\left(\delta_{i}\right)\left(b_{i}-\bar{b}\right)$
$E P 4=\frac{1}{L} \sum_{i=1}^{L}(x)\left(a_{i}-\bar{a}\right)\left(b_{i}-\bar{b}\right)$
$E P 5=\frac{1}{L} \sum_{i=1}^{L}\left\langle x^{2}\right\rangle\left(a_{i}-\bar{a}\right)^{2}$

$$
\begin{aligned}
& \text { EP6 }=\frac{1}{L} \sum_{i=1}^{L}\left(b_{i}-\bar{b}\right)^{2} \\
& E P X=\sum_{i=1}^{6} E P i
\end{aligned}
$$



```
    IIMENSIOM R (10, 1 OE 4) , T ( 10
    IIMEHSIOH SIG(10), FIGC10), F1 (10), EF ( 6,10 )
    IIDIELE PFECISIDN F1, S. S1, SE, S., 54. S5,56
    FEFL . II, JE
    \(\mathrm{H}=\mathrm{T} \quad: \mathrm{H}=: \mathrm{OF}\) BITS
    \(K=N+1\)
    L=E ! L = : DF EGNVERTEFS
    \(E=1 . \Omega E . \quad: E=F E L\). FES. EFROR:
    \(\because 1=5 . \quad \vdots \geqslant 1=\) FEFEFENCE \(\because O L T A B E\)
```



```
    \(I 1=1\)
```



```
    WF:ITEくこ日, EOM H.L.E
    FDFMAT (EI5,F10.E)
    \(F_{1}=1\).
        :IIEFIME UNIT RESISTGE:
    IO 50 IJ=1.50 !LDOF THFOIIGH: DF HPRFYS
    TYPE 100
    FOFMAT (1H)
    IO \(1011=1, \mathrm{~L}\)
    İ=11
    CFLL ERR (K,F,R1,N.L,E,IG) IARRFYS WITH ERFORS
10
\(E\)
```



```
    1' \(\mathrm{E}=\) - F10. \({ }^{\circ}\)
    QE=1 :SUITEH \% FOR INDIV. EDNYERTERS
        : SWITEH O F FDP LIHELE FRFAY
```



```
    1, ADFF) :FINI EEST FIT LINESEAICH GOHYERTER.
    IOE I \(11=1, L\)
    Qé=11 !SUITCH II FOR EONVERTER II
    : SWITCH O FOR WHOLE FREFAY
```



```
    1J1,F1,N1, SE, S3, S4, S5,SG,HOFF,F1M,FUM) IFINI EPR.
    \(E P(1, I 1)=S 1 ; E P(E, I 1)=E \Sigma ; E P(3, I 1)=S 3\)
    \(E F(4,11)=54 E P(5 \cdot I 1)=55 E F(6, I 1)=S 6\)
```




```
    SIG (I1) =10.* (S1/10.)
```



```
    FOPMAT (1H, COMVERTER \(\because, ~ I S, 4\) (FIE.B)
```



| $\begin{aligned} & \mathrm{En} 1 \\ & \mathrm{En} \end{aligned}$ | FOFMAT GF10．6．SF1 I． EONTINIE $\mathrm{B} 1 \mathrm{MA}=\mathrm{O} .: \mathrm{FIMA}=\mathrm{I}$ ． |
| :---: | :---: |
|  | I口 $\because \square I=1 . L$ |
|  | H1MA $=\mathrm{H} 1 \mathrm{MA}+\mathrm{H} 1$（I） |
| 30 |  |
|  |  T＇VPE 103, H1MA．FUMA |
| 118 |  |
|  |  |
|  | D＇ $40 \mathrm{I}=1$ ，L |
|  | $E F 1=E P 1+E P(1, I)$ |
|  | $E F E=E P 2+E P(2, I) *(H 1(J)-H 1 M A)$ |
|  | $E F 3=E P 3+E F(3,1) \oplus\left(\begin{array}{l}\text { P }\end{array}\right.$ |
|  |  |
|  |  |
| 411 |  |
|  | $E P 1=E P 1, L$ |
|  | EPこ＝EFこャこ． |
|  |  |
|  | EP4＝EP4＊ $2 . L$ |
|  | EPS＝EPS，L |
|  | EPG＝EFGとL |
|  | $E P X=E P 1+E P E+E P 3+E P 4+E P S+E P 6$ |
|  | TYFE 104，EF1：EPE．EP3 |
| 1514 |  |
|  |  |
|  | T＇FEE 105，EP4，EFE，EF6 |
| 10.5 |  |
|  | 1＊EPG＝＇F12．${ }^{\circ}$ |
|  |  |
|  | WRITEイEO，EीE）AIMA，FIMA |
|  |  |
|  |  |
| ごす | FOFMHT（1H， 4 ¢F1こ． 3 ，S\％\％ |
|  | TYFE 106．EFX，EPY |
| 106 | FORMAT © 1 H ：TOTAL EPROF FQWER＊F12．S．F12．B） |
|  | Gロ TO 50 ：REMOWE FOR FLILL FMALYSIS |
|  | Bこ＝ 0 |
|  |  |
|  | 1HDFF） |
|  | TYFE 107．G1M．HIM |
| 107 | FORMAT（1H＊BEST FIT GAIN ANI DFFSET＇，F1E．B．FIE．B） |
|  | QE＝0 |
|  |  |
|  |  |
|  |  |
|  | T＇TPE 1 DE，EFX，EPY：TYPE 101 |
| 510 | COMT INIIE |
|  | H\％$=9 \%$ ． |
|  |  |

```
    WFITEGEO,OG% HG, HG GWFITE FINISHEN EONE
    FOFMAT(EF1!.ご)
        ELDSE MMIT=EO
        STDF
        EMI
```



```
    SIEFOIITINE EFR(X,F,PRI:N,L:E,I I)
    IIMENSIOH & (10,1 IEC4)
    E1=E
    IO 1 I=1,IFIK(Z.**H+.,11)
    EHLL STAT (EI, % EGALL STAT FOF EHUSEIAN
    OES\bulletCFAHCD: - % E :THIE LINE FOF EOX IIST.
    IF(x.LT.-1.> : <=-1. :TRIINIGTE ERR. FOR F) !
    FCII,I)=F1-(1.+N)
    EOMTINIIE
    PETI_PNT
    ENHI
```



```
    OLEFDIITINE TOT K,T,R,N,L,II)
    IIMENEIDN T &10%:R&10.10E4)
    T(I1) = !
    I口 1 I=1,IFIX<2.**N+. !1)
    TCII)=T(I1)+R(I1.I)
1 EQNTINLIE
    F:ETIIEN
    ENII
I
    SIEFRIITINE ATOD (K,T,R,N,L:I1:Y:W1:D)
    IIMENEIDN T(1D,R(10,10E4)
    FT=B.
    IID 1 I=1,IFIN(E.**M+. I1)-1
    F:T=FT+F(II:I)
    \because=RT,T(I1)*E.*V1-V1
    IF(YS.LT.%ED TO 1
    I=I-1
    PETIIFN
1 EONTINIIE
    II=I-1
    FETIIEN
    END
C- 
    SHEFGLITIHE IITDR(F,N.W,VI,I)
```



```
    RETIIFN
    EMI
```



```
    SIERDITTINE FROE(N1,VI:'S.F.F.FG.ADFF)
    IIOELE FPECISIDN P
```



```
#
```

```
F=!.!1TIO
Eロ TO \Xi
1
C
\Xi
EOPTTINIIE
PETILRH
ENII
```



``` 1．HOFF）
DIMENEIDN F（ \(10,10 \Xi 4\) ），T（ 10 ）
```



```
IOIELE FRECISION F，T1，T2，TS，TA，TS，TE
```



```
TEMF \(1=, \quad 1-\operatorname{HF}\)
TEMFE＝TEMF1， 1 ！nin．
```




```
IID 1 I \(1=1\) ，L
IFCDE．EO．日 G G Tロ こ
\(\mathrm{H} 1=0\)
\(T 1=0 . ; T 2=0 .: T 3=0 ; T 4=0 . ; T 5=0\).
T \(6=0\)
\(\because \underset{\because G}{\prime}=46\)
```



``` IFはE．EO．M \(F=F / L\)
\(T 1=T 1+\ddot{\gamma} * P\)
\(T E=T こ+\%\) •
\(T B=T B+x \rightarrow Y \Delta F\)
\(T 4=T 4+X: \phi \quad \otimes\)
\(T .5=T S+Y \leftrightarrow Y \leftarrow F\)
TS＝TE + ：MAY MONITOF TS FOR EHEK OF INT（P）＝1
EDHT INIIE
IF GE．EG．日）GO TO 1
H1（I1）＝（TS－Tコ－T1）（T4－T1＊T1）
F！（I1）＝TこーF1（I \()\)－T1
EOHT INIIE
IF：DC．ME．D B ETD 3
H1M＝（TB－TE T1）（T4－T1－T1）
H！M＝TE－H1MかT1
E TYPE 1D，HIM，AIM
3
```

RETIIEN

## ENI



1S1，S，JE，I1，F1，N1，SE，S3，S4，SS，SG，HDFF，F1M，FIMM
IIMENSJON F（10．10E4），T（10）

IDUELE FFECISIDM P1，13，S1，1，S，14，EE，S5，54，55，56
FEAL ． $11, .12$

$\operatorname{TEMP1}=\psi_{1} \uparrow \mathrm{H}^{9}$
IG 1 VS＝－TEMP $1+$ HOFF：TEMP $1+$ HOFF．TEMF $1 / 10000$ $\mathrm{I} \mathrm{E}=1 ; \mathrm{I} \mathrm{B}=\mathrm{L}$
IF（OE．ME．O）IE＝ロE
IFGE．ME．O I I＝0こ
$\mathrm{IO}+\mathrm{II}=\mathrm{IE}$ •IS
$\psi=4.5$
19＝11
EALL ATON（K，T，R，N．L，IQ，Y，Y1，D）
EALL IITOA（K．N．W．VI，I）


$.13=\operatorname{HES}(.14)$
$\% 6=15$
CALL PROB（N1：Y1：YG．P1：RG：HOFF）
$S 1=51+13 \cdot 13+F_{1}$
SE＝Sモ＋． 14 ＊VS＊F 1
$\mathrm{S}=53+14 \oplus \mathrm{~F}_{1}$
$\mathrm{S} 4=54+\mathrm{V}_{5} \mathrm{~F} \mathrm{~F}_{1}$

$56=5+P 1$
© SET LIF FOF IDEAL SIMISOIIS
 S $6=1$ ．

$\mathrm{S}=\mathrm{S}+.1+.1+\mathrm{F}_{1}$
IF（．I．LT．JI）GOTロ 3
$.11=1$
IF（IB．LT．．Iそ）にロ TO 4
$.12=13$
equtinilie
gontinile
FETIIPN
ENI
SUEFDIITINE STAT EE1．$\infty$
$\mathrm{HEO}=2.515517$
FC： $1=.302853$
$\mathrm{HC}=.01 \mathrm{a}=\mathrm{E}$
AD $1=1.432783$

```
ADこ=. 139269
HIG=. 00130E
EM=FRN(0)
Q=.5*EN
T=SNPT (FLDG(1., (0&G)
K=ACO
X=*
EN=RFINCO
:=E1-(T-%) SIEN(1.,EN-.5)
PETURN
END
```

2

## APPENDIX B. 1

PLANAR W,E AND V DETERMINATION

From Guass' law

$$
\begin{equation*}
\underset{\sim}{\nabla} \cdot \underset{\sim}{E}=\frac{\rho}{\varepsilon_{S}} \tag{B.1.1}
\end{equation*}
$$

as

$$
\begin{equation*}
\rho(x)=q\left[N_{s} e^{-x^{2} / 4 D T}-N_{B}\right] \tag{B.1.2}
\end{equation*}
$$

then

$$
\begin{equation*}
E_{1}(x)=\frac{q N_{s}}{\varepsilon} \int_{x_{L}}^{x}\left[e^{-x^{2} / 4 D T}-\frac{N_{B}}{N_{s}}\right] d x \tag{B.1.3}
\end{equation*}
$$

and

$$
\begin{equation*}
E_{2}(x)=\frac{q N_{s}}{\varepsilon} \int_{x_{R}}^{x}\left[e^{-x^{2} / 40 T}-\frac{N_{B}}{N_{s}}\right] d x \tag{B.1.4}
\end{equation*}
$$

where $E_{1}$ and $E_{2}$ represent the electric fields present on the left (diffusion) side and right (substrate) side of the metallurgical junction while $x_{L}$ and $x_{R}$ represent the edges of the depletion region.

As

$$
\begin{equation*}
\operatorname{erf}(x)=\frac{2}{\sqrt{\pi}} \int_{0}^{x} e^{-t^{2}} d t \tag{B.1.5}
\end{equation*}
$$

and via substitution

$$
\begin{equation*}
\int_{0}^{x} e^{-t^{2} / 4 D T}=\sqrt{\pi D T} \operatorname{erf}(x / \sqrt{4 D T}) \tag{B.1.6}
\end{equation*}
$$

then the $E$ fields may be expressed as

$$
\begin{align*}
E_{1}(x)= & \frac{q N_{S}}{\varepsilon}\left[\sqrt{\pi D T}\left[\operatorname{erf}(x / \sqrt{4 D T})-\operatorname{erf}\left(x_{L} / \sqrt{4 D T}\right)\right]\right. \\
& \left.-N_{B} \quad\left(x-x_{L}\right)\right] \tag{B.1.7}
\end{align*}
$$

and

$$
\begin{align*}
E_{2}(x)= & \frac{q N_{S}}{\varepsilon}\left[\sqrt{\pi D T}\left[\operatorname{erf}(x / \sqrt{4 D T})-\operatorname{erf}\left(x_{R} / \sqrt{4 \mathrm{DT}}\right)\right]\right. \\
& \left.-\frac{N_{B}}{N_{S}}\left(x-x_{R}\right)\right] \tag{B.1.8}
\end{align*}
$$

For a given $x_{R}$ or $X_{L}$, the other component ( $x_{L}$ or $x_{R}$ ) may be determined by solving for the case $E_{1}\left(x_{j}\right)=E_{2}\left(x_{j}\right)$, which is the peak field. Taking

$$
\begin{gather*}
E_{1}\left(x_{j}\right)-E_{2}\left(x_{j}\right)=\frac{q N_{s}}{\varepsilon}\left[\sqrt { \pi D T } \left(\operatorname{erf}\left(x_{R} / \sqrt{4 D T}\right)\right.\right. \\
\left.\left.-\operatorname{erf}\left(x_{L} / \sqrt{4 D T}\right)\right)+\frac{N_{B}}{N_{S}}\left(x_{L}-x_{R}\right)\right] \tag{B.1.9}
\end{gather*}
$$

It is evident that zeros occur where the function

$$
\begin{align*}
F=\sqrt{\pi D T} & \left(\operatorname{erf}\left(x_{R} / \sqrt{4 D T}\right)-\operatorname{erf}\left(\frac{x_{L}}{\sqrt{4 D T}}\right)\right) \\
& +\frac{N_{B}}{N_{S}}\left(x_{L}-x_{R}\right) \tag{8.1.10}
\end{align*}
$$

is zero. Taking $x_{R}$ as given, and finding that a closed form of $F^{\prime}$ exists

$$
\begin{equation*}
\frac{\partial F}{\partial x_{L}}=e^{-x_{L}{ }^{2} / 4 D T}-\frac{N_{B}}{N_{S}} \tag{B.1.11}
\end{equation*}
$$

it is apparent that Newtons method may be app?ied to the above to indicate the values of $X_{L}$ which create zeros of $F$ (see Appendix B.3).

The electrostatic potential may be determined by

$$
\begin{equation*}
-\underset{\sim}{V}=\underset{\sim}{E} \tag{B.1.12}
\end{equation*}
$$

or

$$
\begin{equation*}
v_{1}=-\int_{x_{L}}^{x_{j}} E_{1}(x) d x \tag{B.1.13}
\end{equation*}
$$

and

$$
\begin{equation*}
v_{2}=+\int_{x_{R}}^{x_{j}} E_{2}(x) d x \tag{B.1.14}
\end{equation*}
$$

or

$$
\begin{align*}
v_{1}= & \frac{-q N_{s}}{\varepsilon_{s}}\left[\sqrt { \pi D T } \left[\int_{x_{L}}^{x_{j}} \operatorname{erf}\left(\frac{x}{\sqrt{4 D T}}\right) d x\right.\right. \\
& \left.\left.-\operatorname{erf}\left(\frac{x_{L}}{\sqrt{4 D T}}\right) \int_{x_{L}}^{x_{j}} d x\right]-\frac{N_{B}}{N_{s}} \int_{x_{L}}^{x_{j}} x d x+\frac{N_{B}}{N_{s}} x_{L} \int_{x_{L}}^{x_{j}} d x\right] \tag{B.1.15}
\end{align*}
$$

which because [B.1]

$$
\begin{equation*}
\int \operatorname{erf}(x) d x=x \operatorname{erf}(x)+\frac{1}{\sqrt{\pi}} e^{-x^{2}}+c \tag{B.1.16}
\end{equation*}
$$

becomes

$$
\begin{align*}
V_{1}= & \frac{-q N_{s}}{\varepsilon_{s}}\left[\sqrt{\pi D T} x_{j}\left(\operatorname{erf}\left(\frac{x_{j}}{\sqrt{4 D T}}\right)-\operatorname{erf}\left(\frac{x_{L}}{\sqrt{4 D T}}\right)\right)\right. \\
& \left.+2 D T\left(e^{-x_{j}{ }^{2} / 4 D T}-e^{-x_{L}{ }^{2} / 4 D T}\right)-\frac{N_{B}}{2 N_{s}}\left(x_{j}-x_{L}\right)^{2}\right] \tag{B.1.17}
\end{align*}
$$

and similarly

$$
\begin{align*}
V_{2}= & \frac{+q N_{s}}{\varepsilon}\left[\sqrt{\pi D T} x_{j}\left(\operatorname{erf}\left(x_{j} / \sqrt{4 D T}\right)-\operatorname{erf}\left(x_{L} / \sqrt{4 D T}\right)\right)\right. \\
& \left.+2 D T\left(e^{-x_{j} / 4 D T}-e^{-x_{L}{ }^{2} / 4 D T}\right)-\frac{N_{B}}{2 N_{s}}\left(x_{j}-x_{L}\right)^{2}\right] \tag{B.1.18}
\end{align*}
$$

where it is the desired condition, that

$$
\begin{equation*}
F=V_{B}-V_{1}-V_{2}-\phi=0 \tag{В.1.19}
\end{equation*}
$$

where

$$
\begin{equation*}
\phi=\frac{k T}{q} \ln \left[\frac{N\left(x_{L}\right) N\left(x_{R}\right)}{n_{i}{ }^{2}}\right] \tag{B.1.20}
\end{equation*}
$$

As $F$ is not differentiable by $x_{R}$ (due to $x_{L}$ being a function of $x_{R}$ ), Newtons method is not applicable without a numerical differentiation. It was found that the Secant method resulted in lower computational cost to convergence, allowing a fairly rapid determination of $x_{R}$ for a specified $V_{B}$ (see Appendix B.3).

## APPENDIX B. 2

## CYLINDRICAL W,E AND V

## From Gauss' law

$$
\begin{equation*}
\underline{\nabla} \cdot \underline{E}=\rho / \varepsilon \tag{B.2.1}
\end{equation*}
$$

it may be shown, that for a cylindrically symmetrical impurity distribution,

$$
\begin{equation*}
\frac{1}{r} \frac{\partial}{\partial r}[r E(r)]=\frac{\rho(r)}{\varepsilon} \tag{B.2.2}
\end{equation*}
$$

or,

$$
\begin{equation*}
E_{l}(r)=\frac{1}{\varepsilon r} \int_{r_{L}}^{r} r_{\rho}(r) d r+c \tag{B.2.3}
\end{equation*}
$$

and

$$
\begin{equation*}
E_{2}(r)=\frac{1}{\varepsilon r} \int_{r_{R}}^{r} r_{p}(r) d r+c \tag{B.2.4}
\end{equation*}
$$

where $E_{1}$ and $E_{2}$ correspond to the $E$ fields present on the resistor and substrate side of the metallurgical junction, and $r_{L}$ and $r_{R}$ correspond to the resistor and substrate depletion region edges. For the gaussian impurity distribution that has been assumed, the above reduces to:

$$
\begin{align*}
E_{1}(r)= & -\frac{2 D T q N_{S}}{r \varepsilon_{s}}\left[e^{-r^{2} / 4 D T}-e^{-r_{L}^{2} / 4 D T}\right] \\
& -\frac{q N_{B}}{2 r \varepsilon_{S}}\left(r^{2}-r_{L}^{2}\right) \tag{B.2.5}
\end{align*}
$$

$$
\begin{align*}
E_{2}= & -\frac{2 D T q N_{s}}{r \varepsilon_{s}}\left[e^{-r^{2} / 4 D T}-e^{-r_{r}^{2} / 4 D T}\right] \\
& -\frac{q N}{2 r \varepsilon_{s}}\left(r^{2}-r_{r}^{2}\right) \tag{B.2.6}
\end{align*}
$$

For a given $r_{r}$ or $r_{L}$, the other component ( $r_{L}$ or $r_{r}$ ) may be determined by solving for the case $E_{1}\left(r_{j}\right)=E_{2}\left(r_{j}\right)$, which is the peak field. Taking $E_{1}\left(r_{j}\right)-E_{2}\left(r_{j}\right)$, it is evident that zeros occur where the function

$$
\begin{align*}
F= & 2 D T N_{s}\left[e^{-r_{L}{ }^{2} / 4 D T}-e^{-r_{r}{ }^{2} / 4 D T}\right] \\
& +\frac{N_{B}}{2}\left(r_{L}{ }^{2}-r_{r}{ }^{2}\right) \tag{B.2.7}
\end{align*}
$$

is zero. Taking $r_{r}$ as given, and finding that a closed form of $F^{\prime}$ exists

$$
\begin{equation*}
\frac{\partial F}{\partial r_{L}}=-N_{S} e^{-r_{L}^{2} / 4 D T} r_{L}+N_{B} r_{L} \tag{B.2.8}
\end{equation*}
$$

it is apparent that Newton's method may be applied to the above to indicate the values of $r_{L}$ which create zeros of $F$ (see Appendix B.3).

The electrostatic potential may be determined by

$$
\begin{equation*}
-\underline{\nabla} V=\underline{E} \tag{B.2.9}
\end{equation*}
$$

we then have

$$
\begin{align*}
V_{1}(r)= & \int_{r_{L}}^{r} E_{1}(r) d r  \tag{B.2.10}\\
= & \frac{-20 T q N_{S}}{\varepsilon}\left[\int_{r_{L}}^{r} \frac{e^{-r^{2} / 40 T} d r}{r}-e^{-R_{L}{ }^{2} / 4 D T} \cdot \ln \left(\frac{r}{r_{L}}\right)\right] \\
& \frac{-q N}{4 \varepsilon}\left(r^{2}-r_{L}{ }^{2}\right)+\frac{q N r_{L}{ }^{2}}{2 \varepsilon} \ln \left(r / r_{L}\right) \tag{B.2.11}
\end{align*}
$$

Further:

$$
\begin{equation*}
\int \frac{e^{-a^{2} r^{2}} d r}{r}=\ln (r)+\sum_{i=1}^{N} \frac{(a r)^{2 i}(-1)^{i}}{i!(2 i)}+\varepsilon \tag{B.2.12}
\end{equation*}
$$

where

$$
\begin{equation*}
|\varepsilon| \leq \frac{(a r)^{2 N}}{N!(2 N)} \tag{B.2.13}
\end{equation*}
$$

Thus

$$
\begin{align*}
V_{1}(r)- & \frac{-q}{\varepsilon}\left[-\frac{N_{B}}{4}\left(r^{2}-r_{L}^{2}\right)+\frac{N_{B}}{2} r_{L}^{2} \ln \left(r / r_{L}\right)\right. \\
& \left.+2 D T N_{S} e^{-r_{L}^{2} / 40 T} \ln \left(r / r_{L}\right)-20 T N_{s}\left[f(r)-f\left(r_{L}\right)\right]\right] \tag{B.2.14}
\end{align*}
$$

and

$$
\begin{align*}
V_{2}(r)-\frac{q}{\varepsilon} & {\left[-\frac{N_{B}}{4}\left(r^{2}-r_{r}^{2}\right)+\frac{N_{B}}{2} r_{r}^{2} \ln \left(r / r_{r}\right)\right.} \\
& \left.+20 T N_{s} e^{-r_{r}^{2} / 4 D T} \ln \left(r / r_{r}\right)-2 D T N_{s}\left[f(r)-f\left(r_{r}\right)\right]\right] \tag{B.2.15}
\end{align*}
$$

where

$$
\begin{equation*}
f(r)=\ln (r)+\sum_{i=1}^{N} \frac{\left(r^{2} / 40 T\right)^{i}(-1)^{i}}{i!(2 i)} \tag{B.2.16}
\end{equation*}
$$

where it is the desired condition, that

$$
\begin{equation*}
F=V_{B}-V_{1}-V_{2}-\phi=0 \tag{B.2.17}
\end{equation*}
$$

where

$$
\begin{equation*}
\phi=\frac{k T}{q} \ln \left[\frac{N\left(r_{L}\right) N\left(r_{r}\right)}{n_{i}{ }^{2}}\right] \tag{B.2.18}
\end{equation*}
$$

As in the planar case, the Secant method is used to determine the appropriate value of $r_{r}$ given a specific $V_{B}$ (see Appendix B.3).

APPENDIX B. 3

PROGRAM FOR EVALUATION OF RESISTIVITY AND VOLTAGE
COEFFICIENT OF DIFFUSED RESISTORS, INCLUDING SIDEWALLS

Initial variable definitions

$$
\begin{aligned}
D 1 & =D T \\
Q 1 & \left.=Q_{T} \text { (total implant dose in ions } / \mathrm{cm}^{2}\right) \\
M & =\text { mobility coef. } \\
K & =\text { mobility coef. }\left(\mu=k N^{m}(x)\right) \\
N & \left.=N_{B} \text { (background doping in ions } / \mathrm{cm}^{3}\right) \\
L & =\text { resistor length } \\
W & =\text { resistor width (on mask) } \\
V & =V_{B} \text { (applied bias) }
\end{aligned}
$$

Output variable definitions

$$
\begin{aligned}
& A 2=\left.\frac{\partial x_{L}}{\partial V}\right|_{V=V_{B}} \quad \text { planar case } \\
& A 4=\left.\frac{\partial r_{L}}{\partial V}\right|_{V=V_{B}} \text { cylindrical case } \\
& R 1=\rho_{S} \quad \text { planar } \\
& F 1=\left.\frac{\partial \rho_{S}}{\partial V}\right|_{V=V_{B}} \quad \text { planar } \\
& G=G \text { side } \quad \text { cylindrical } \\
& F 2=\left.\frac{\partial G V}{\partial V}\right|_{V=V_{B}} \text { cylindrical }
\end{aligned}
$$

$R=R_{T} \quad$ combined planar and cylindrical resistance
$R_{2}=R /\left(\frac{L}{W} R_{1}\right)$ normalized resistance with respect to mask dimensions
$F 3=\left.\frac{\partial R_{T}}{\partial V}\right|_{V=V_{B}}$ total resistor bias voltage coefficient
$R 3=\left.\frac{1}{R_{T}} \frac{\partial R_{T}}{\partial V}\right|_{V=V_{B}}$ normalized bias voltage coefficient


[^0]:    ${ }^{l}$ Coring is a nonlinear process which removes both low amplitude noise and signal about a signals baseband. This function is typically used to "crispen" both horizontal and vertical details.

[^1]:    'Alternately, the characteristic line is sometimes defined with respect to the midpoints of the first and last conversion steps [24].

[^2]:    ${ }^{2}$ In a feedback system, nonmonotonic conversions may result in oscillatory behavior about a nonmonotonic region. In mechanical servo systems, however, a high frequency oscillatory signal is often applied as a matter of course, to maintain the system in a state of "dynamic" rather than "static" friction [25]. In situations such as these, system performance will not necessarily be degraded unless the nonmonotonic region is large.
    ${ }^{3}$ In all converters it is the case that if the INL < 1 LSB then the DNL < 1 LSB. The converse, however, is not necessarily true. In some converter types, most notably those employing linear strings of precision components, a grossly nonlinear "bow" like characteristic may still exhibit a DNL <1 LSB.

[^3]:    ${ }^{4}$ This test as indicated, will not detect delay distortion terms at the fundamental frequency. For these to be included, the notch filter of Fig. 3.4 must be made phase as well as frequency selective.

[^4]:    ${ }^{5}$ Both vertical and lateral pnp devices are available in a triplediffused (3-D) bipolar process, however, the vertical devices are all common collector, and the lateral devices usually exhibit a very low B due to the absence of a buried layer.

[^5]:    ${ }^{6}$ Successive Approximations by factors of less than two are possible, and indeed quite useful in converters employing nonlinear coding (such as companding) or adaptive calibration algorithms [43].

[^6]:    ${ }^{7}$ Note that certain embodiments of this algorithm are similar to the "stage by stage converter"described by Gordon [50] and invented by Waldhauer [51] and Jepperson [52].

[^7]:    ${ }^{2}$ Because of the presence of $p_{3}$ and other high frequency poles, the optimal settling condition is actually where $z_{1}$ is at a slightly lower frequency than $p_{2}$.

[^8]:    ${ }^{3}$ Even in flash converters which employ a reset cycle fully half of the time, the comparator is still not usually the speed limiting factor. Typically ladder R-C charging times will determine the maximum useful conversion rate.

[^9]:    1 In some converters, making the higher-order bit times longer than others will lead to increased nonlinearity. This is apparently due to the comparator displaying set-up time dependent behavior [107].

