# A 10.4-Gb/s 1-Tap Decision Feedback Equalizer With Different Pull-Up and Pull-Down Tap Weights for Asymmetric Memory Interfaces

Joo-Hyung Chae<sup>®</sup>, *Member, IEEE*, Minchang Kim, Sungphil Choi<sup>®</sup>, *Student Member, IEEE*, and Suhwan Kim<sup>®</sup>, *Senior Member, IEEE* 

Abstract—In asymmetric memory systems, the pull-up and pull-down data from the channel can have different amounts of inter-symbol interference (ISI), so they cannot be fully corrected by an equalizer which uses a single tap weight. We introduce a receiver-side single-ended 1-tap asymmetric decisionfeedback equalizer (DFE) with a weight selection multiplexer, which allows the application of a different weight to each direction of data value transition. Implemented in a 55-nm CMOS technology, our DFE compensated for asymmetric ISI in a 10.4-Gb/s signal from a channel with an insertion loss of -8.3 dB, leading to a 50% wider horizontal margin at a bit-error-rate of  $10^{-12}$  than symmetric correction. The energy efficiency is 0.16 pJ/bit.

*Index Terms*—Quarter-rate receiver, decision feedback equalizer (DFE), memory interface, asymmetric equalization.

#### I. INTRODUCTION

W ITH the emerging interest in virtual reality, artificial intelligence, Internet of Things, and autonomous vehicles, there is increasing demand for higher data-bandwidth in memories such as double data-rate (DDR), low-power DDR (LPDDR), and graphics DDR (GDDR) [1]; thus, a higher data-rate per pin should be supported in memory interfaces [2]. This increases inter-symbol interference (ISI), because the insertion loss of the channel is frequency-dependent.

One way to compensate for an increased channel loss is to use an equalization technique. If the pull-up and pull-down ISI is identical, equalization can use the same tap weight for the pull-up and pull-down data [3]. However, memory interfaces use the single-ended signaling with the asymmetric termination [4], and the characteristics of the pull-up and pull-down output drivers are also asymmetric [5]–[8]. Thus rising and falling edge responses are likely to differ [9], so that pull-up and pull-down data have different amounts of ISI.

Manuscript received February 1, 2019; revised February 28, 2019; accepted April 4, 2019. Date of publication April 12, 2019; date of current version January 31, 2020. This brief was recommended by Associate Editor J. Park. (*Corresponding author: Suhwan Kim.*)

J.-H. Chae and M. Kim were with the Department of Electrical and Computer Engineering, Seoul National University, Seoul 08826, South Korea. They are now with DRAM Design, SK hynix, Icheon-si 17336, South Korea.

S. Choi and S. Kim are with the Department of Electrical and Computer Engineering, Seoul National University, Seoul 08826, South Korea, and also with the Inter-University Semiconductor Research Center, Seoul National University, Seoul 08826, South Korea (e-mail: suhwan@snu.ac.kr).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TCSII.2019.2911017

To compensate for the asymmetric response of the output driver, asymmetric feed-forward equalization (FFE) [10], [11] has been used to provide different amounts of emphasis pulses to the pull-up and pull-down data. However, transmitterside equalization techniques require a large number of unit drivers in the output driver to obtain the asymmetric tap weights correctly. This makes impedance matching difficult and increases the pre-driver complexity and power consumption; this situation becomes worse if the number of taps is increased [12]. Since more current is consumed by the output driver when equalization is performed, the signaling and switching power increased, leading to a lot of simultaneous switching noise (SSN) that is one of the limiting factors in the performance of single-ended signaling.

To alleviate the above issues, we present a receiver-side single-ended 1-tap DFE, which can reduce ISI without significantly increasing noise or power consumption, with independently controlled pull-up and pull-down tap weights for asymmetric memory interfaces. This accounts for the asymmetric 1<sup>st</sup> post-tap ISI of the pull-up and pull-down data, by assigning a different DFE tap weight to each direction of data transition, thus producing more accurate equalization.

The rest of this brief is organized as follows: in Section II we describe the design considerations for asymmetric memory interfaces; in Section III we introduce our 1-tap asymmetric DFE; in Section IV we show how this DFE has been implemented; in Section V we present experimental results; and in Section VI we draw conclusions.

## II. DESIGN CONSIDERATIONS FOR ASYMMETRIC MEMORY INTERFACES

### A. LPDDR4/4X Memory Interface

Fig. 1(a) shows an LPDDR4 memory interface with a lowvoltage swing terminated logic (LVSTL) driver and a  $V_{SSQ}$ termination. The LVSTL driver is configured as an N-over-N driver that uses NMOS in both its pull-up and pull-down drivers. The pull-down driver operates in the linear region, while the non-linear pull-up driver operates in the saturation region [5]. If channel length modulation is not considered, the currents flowing the pull-up and pull-down driver can be expressed as follows:

$$I_{PU,LPDDR4} \approx \frac{1}{2} \cdot \beta \cdot (V_{DDQ} - V_{RX} - V_{TH})^2 \qquad (1)$$

and

$$I_{PD,LPDDR4} \approx \frac{1}{2} \cdot \beta \cdot [2 \cdot (V_{DDQ} - V_{TH}) \cdot V_{RX} + V_{RX}^2], \quad (2)$$

where  $\beta$  is  $\mu_n \cdot C_{ox} \cdot (W/L)$ ,  $\mu_n$  is electron mobility,  $C_{ox}$  is the gate oxide capacitance per unit area, and  $V_{TH}$  is a threshold

1549-7747 © 2019 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications\_standards/publications/rights/index.html for more information.



Fig. 1. (a) LPDDR4 and (b) LPDDR4X memory interfaces with an LVSTL driver and a  $V_{SSO}$  termination.



Fig. 2. Simulated single-bit responses of  $V_{RX}$  at 10Gb/s, for (a) the LPDDR4 and (b) the LPDDR4X memory interface of Fig. 1.

voltage. Since the current characteristics according to the voltage of the pull-up and the pull-down driver are different, the pull-up and pull-down drivers are not symmetric [5], [6].

Fig. 1(b) shows an LPDDR4X memory interface in which the output driver uses an isolated supply voltage  $V_{DDQL}$ , which is lower than the supply voltage  $V_{DD}$  of the pre-driver [7]. The introduction of  $V_{DDQL}$  allows both pull-up and pull-down drivers to operate in the linear region. The currents flowing the pull-up and pull-down drivers become [7]:

$$I_{PU,LPDDR4X} \approx \frac{1}{2} \cdot \beta \cdot [2 \cdot (V_{DD} - V_{TH}) \cdot (V_{DDQL} - V_{RX}) + (V_{DDQL} - V_{RX})^2]$$
(3)

and

$$I_{PD,LPDDR4X} \approx \frac{1}{2} \cdot \beta \cdot [2 \cdot (V_{DD} - V_{TH}) \cdot V_{RX} + V_{RX}^2].$$
<sup>(4)</sup>

Asymmetric pull-up and pull-down signals can be generated by varying the amount of current flowing to each driver, which is controlled by  $V_{RX}$ .

We simulated the single-bit response of pull-up and pulldown data transmission to the LPDDR4 and LPDDR4X memory, with  $V_{DDQ}$  set to 1.1V in the LPDDR4 interface, and  $V_{DD}$  and  $V_{DDQL}$  set to 1.1V and 0.6V respectively in the LPDDR4X interface. The simulated channel loss was -9.5dB at 5.0GHz. Fig. 2(a) and (b) show their simulated results at 10Gb/s. The asymmetry of these interfaces is apparent, as the pre-tap, 1<sup>st</sup> post-tap, and 2<sup>nd</sup> post-tap ISIs of the pull-up and pull-down data signal are different.

### B. GDDR6/DDR4 Memory Interface

Fig. 3 shows a GDDR6 and DDR4 memory interface, consisting of a pseudo open drain (POD) driver [13], in which both pull-up and pull-down drivers operate in the linear region, and a  $V_{DDQ}$  termination. The GDDR6 memory interface adopts the unmatched on-resistances of  $60\Omega$  and  $40\Omega$  in the pull-up and pull-down drivers, and the  $V_{DDQ}$  termination of  $60\Omega$  [8]. The corresponding values are matched in



Fig. 3. GDDR6 and DDR4 memory interface.



Fig. 4. (a) Simulated single-bit response of  $V_{RX}$  in the GDDR6 memory interface at 12Gb/s, and (b) in the DDR4 memory interface at 10Gb/s.

the DDR4 memory interface, but the strength of the pull-up driver differs from that of the pull-down driver, due to the different characteristics of PMOS and NMOS. Furthermore, asymmetric  $V_{DDQ}$  termination can increase the asymmetry of the transmitted signal in these interfaces [4].

We performed single-bit response simulations of these interfaces, with a  $V_{DDQ}$  of 1.35V and 1.1V in the GDDR6 and DDR4 memory interfaces. The channel loss is -9.5dB at a Nyquist frequency. Fig. 4(a) and (b) show simulated single-bit responses of the GDDR6 and DDR4 memory interfaces at a data-rate of 12Gb/s and 10Gb/s. The 1<sup>st</sup> post-tap ISIs of the pull-up and pull-down data signal are different, and the pre-tap and 2<sup>nd</sup> post-tap ISIs of the pull-up and pull-down signal are also asymmetric. These results demonstrate that the different characteristics of the pull-up and pull-down drivers cause the asymmetrical ISI, which requires asymmetric equalization.

#### C. Asymmetry According to Channel Loss

To verify the relationship between the channel loss and the asymmetry of the tap weight, we performed the above simulation again for several channel losses in each interface. Fig. 5 shows the simulated difference between the pull-up and pull-down tap weights, according to channel loss. The asymmetry increases with the channel loss.

## III. 1-TAP DFE WITH DIFFERENT PULL-UP AND PULL-DOWN TAP WEIGHTS

Fig. 6 shows a block diagram of a 1-tap asymmetric DFE, which consists of a weight selection multiplexer (WMUX), a summer, and a sampler. This figure also shows that the pullup and pull-down data results in different post-tap ISI in the waveforms passing through the channel. The asymmetric DFE can correct this imbalance by applying a weight  $W_{PU}$  to the pull-up data and a weight  $W_{PD}$  to the pull-down data.

Both symmetric and asymmetric DFE based on data-state improves the data eye by compensating for post-tap ISI when there is a data transition, but it reduces the height of the data eye when there is no transition [14]. We define the data eye as an AC data eye when there is a data transition



Fig. 5. Simulated difference between the pull-up and pull-down weights.



Fig. 6. Block diagram of the 1-tap DFE with the different pull-up and pull-down tap weights.

and the DC data eve when there is no data transition. To demonstrate the concept of our DFE, Fig. 7 shows how symmetric and asymmetric DFEs remove ISI from a single-bit response [14], [15]. Fig. 7(a) and (b) show waveforms that the same tap weight is applied to both pull-up and pull-down data when 1st post-tap ISI of the pull-up data is larger than that of the pull-down data. Applying a weight W<sub>PD</sub> appropriate for the 1st post-tap ISI of the pull-down data to the pull-up data as well, results in less-equalization. Conversely, if the DFE applies a weight WPU based on the 1st post-tap ISI of the pull-up data to the pull-down data as well, there is over-equalization. Data eyes of equal heights are only obtained when different pull-up and pull-down weights W<sub>PU</sub> and W<sub>PD</sub> are applied by the DFE to each direction of the data transition, as shown in Fig. 7(c). When an average weight  $W_{AV}$  of pull-up and pull-down weights WPU and WPD in Fig. 7(c) is applied, the AC and DC data eyes are still different, as shown in Fig. 7(d).

#### **IV. IMPLEMENTATION**

## A. Architecture

To assess the effectiveness of asymmetric equalization, we designed the receiver shown in Fig. 8, which has a 1-tap DFE with different pull-up and pull-down tap weights. This receiver has a quarter-rate architecture to ensure a relaxed timing margin on the direct feedback path of the DFE [2], [16].

Quadrature clock signals CK<sub>0</sub>, CK<sub>90</sub>, CK<sub>180</sub>, and CK<sub>270</sub> are generated from the incoming differential clock signals CK and CKB in the clock path. Three resistor ladders generate the reference voltage V<sub>REF</sub> and the pull-up and pull-down tap weights W<sub>PU</sub> and W<sub>PD</sub>, as specified by the selection signals SEL<sub>VREF</sub><5:0>, SEL<sub>WPU</sub><3:0>, and SEL<sub>WPD</sub><3:0>.

The quarter-rate DFE consists of four units DFE<sub>0</sub>, DFE<sub>90</sub>, DFE<sub>180</sub>, and DFE<sub>270</sub>, each of which contains a WMUX, a summer, a sampler, and a latch. This DFE compensates for the 1<sup>st</sup> post-tap ISI of the asymmetric input data signal  $D_{RX}$  by assigning different pull-up and pull-down tap weights, and this compensated data signal are demuxed into four quarter-rate data signals  $D_{OUT,0}$ ,  $D_{OUT,90}$ ,  $D_{OUT,180}$ , and  $D_{OUT,270}$  by the quadrature clock signals. In the test setup, we monitored  $D_{OUT,0}$  to measure the bit-error-rate (BER).



Fig. 7. (a) Less-equalization and (b) over-equalization, resulting from equal pull-up and pull-down tap weights, (c) improved equalization by an asymmetric DFE, and (d) equalization by an average of asymmetric weights.



Fig. 8. Block diagram of a quarter-rate receiver with a DFE with the different pull-up and pull-down tap weights.



Fig. 9. Block diagram of the clock path.

## B. Clock Path

The clock path contains an input clock buffer, a duty-cycle corrector (DCC), and a quadrature clock generator, as shown in Fig. 9. The differential clock signals CK and CKB are amplified in 2-stage input clock buffers based on a current-mode logic (CML) amplifier, with a negative capacitance circuit to



Fig. 10. Circuit diagram of one of the four DFE units (DFE $_0$ ).



Fig. 11. Die micrograph and block description.

increase its bandwidth. The duty-cycle corrector increases the signal bandwidth, corrects duty-cycle errors, and changes the level of the clock signals to CMOS voltage. These signals pass to the quadrature clock generator, consisting of two sense amplifiers (SAs) and two latches, which converts them to the quadrature clock signals  $CK_{0}$ ,  $CK_{90}$ ,  $CK_{180}$ , and  $CK_{270}$ , which then enter the DFE.

### C. Decision Feedback Equalizer

Fig. 10 is the circuit diagram of one of the DFE units  $(DFE_0)$ , which consists of a WMUX, a summer, a sampler, and a latch. To reduce the offset mismatch, MOS devices of sufficiently large size are used. When the clock signal  $CK_0$ goes low, all the circuits in the DFE $_0$  reset. The WMUX selects either W<sub>PU</sub> or W<sub>PD</sub>, depending on the value of the previous data bits D<sub>270</sub> and DB<sub>270</sub>, and this tap weight is subtracted from the voltage at the node A or B of the sampler through the summer when  $CK_0$  goes high; thus, the selected tap weight is reflected in the current data bit. When CK<sub>0</sub> goes high, current data D<sub>0</sub> and DB<sub>0</sub> are determined, depending on the difference between the voltages of nodes A and B. The latch holds the value of the current data bit, and outputs D<sub>OUT.0</sub>. In the postlayout simulation, the feedback loop delay, which limits the maximum data-rate, of our DFE is 78ps and 99.8ps in typical and slow process corners with a supply voltage of 1.3V.

## V. MEASUREMENT RESULTS

A prototype chip was implemented in a 55nm CMOS technology with a supply voltage of 1.3V. Fig. 11 shows a micrograph of the die, which has a total area of 0.06 mm<sup>2</sup>, including the decoupling capacitor. The resistor ladders which generate  $W_{PU}$ ,  $W_{PD}$ , and  $V_{REF}$  are the largest block, but they can be shared by several DQ paths.

Fig. 12(a) shows our measurement setup. The differential clock signals CK and CKB and the data signal  $D_{RX}$  are generated in the pattern generator of the signal quality analyzer (Anritsu MP1800A), and transmitted to the chip. The BER was



Fig. 12. (a) Measurement setup and (b) the measured channel loss of the  $D_{RX}$  path.



Fig. 13. Measured single-bit responses of (a) pull-up and (b) pull-down data before and after the channel, at 10.4Gb/s.



Fig. 14. Measured eye diagrams of the receiver data input  $D_{RX}$  before and after the channel, at 10.4Gb/s with the PRBS7 pattern.

found using the error detector in the signal quality analyzer, and the eye diagrams of the input data signal  $D_{RX}$  and the output data signal  $D_{OUT,0}$  were displayed on an oscilloscope (Tektronix MSO73304DX). The measured channel loss of the  $D_{RX}$  path, which consists of an SMA cable, an SMA connector, and an FR4 trace, is shown in Fig. 12(b): the insertion channel loss is -8.3dB at the Nyquist frequency of 5.2GHz.

The single-bit response was measured at 10.4Gb/s, before and after the signal goes through the channel of the  $D_{RX}$  path to verify that the pull-up and pull-down data exhibit different characteristics in the asymmetric interface. Fig. 13(a) shows the single-bit response of the pull-up data: it has a rise time of 35ps before passing through the channel, and a main-tap of 347mV and a 1st post-tap of 20mV after passing through the channel. Fig. 13(b) shows pull-down data, which has a fall time of 31ps before passing through the channel, and a maintap of 282 mV and a 1st post-tap of 42 mV after passing through the channel. The difference between the 1<sup>st</sup> post-tap voltages of the pull-up and pull-down data justify an asymmetric equalization. There are some ISI after the 1<sup>st</sup> post-tap due to several discontinuity points in the measurement setup.

TABLE I Performance Summary and Comparison With Other DFE Designs

|                     | [3]          | [16]                 | [17]                 | [18]                | This Work             |
|---------------------|--------------|----------------------|----------------------|---------------------|-----------------------|
| Technology          | 65nm         | 40nm                 | 130nm                | 65nm                | 55nm                  |
| Supply              | 1.2V         | 1.0V                 | 1.2V                 | 0.8V                | 1.3V                  |
| Data-Rate           | 6.4Gb/s      | 20Gb/s               | 3.4Gb/s              | 12.5Gb/s            | 10.4Gb/s              |
| Architecture        | Half-rate    | Quarter              | Quarter              | Quarter             | Quarter               |
|                     |              | -rate                | -rate                | -rate               | -rate                 |
| Signaling           | Differential | Differential         | Single               | Single              | Single                |
|                     |              |                      | -ended               | -ended              | -ended                |
| Equalization        | 1-tap DFE    | 1-tap DFE            | 2-tap                | 2-tap               | 1-tap DFE             |
|                     |              | +2 IIR filter        | DFE                  | DFE                 | (asymmetric)          |
| Channel Loss        | -11.96dB     | -18.3dB              | N/A                  | -14dB               | -8.3dB                |
| Timing Margin       | 0.56UI       | 0.20UI               | 0.20UI               | <sup>b</sup> 0.35UI | 0.18UI                |
| @BER                | @10-11       | @10-12               | @10-12               | @10 <sup>-12</sup>  | @10 <sup>-12</sup>    |
| <sup>a</sup> Power  | 1.13mW       | 6.2mW                | 3.03mW               | 2.75mW              | 1.71mW                |
| <sup>a</sup> Energy | 0.17nI/bit   | 0.31nI/bit           | 0.89nJ/bit           | 0.22nJ/bit          | 0.16nI/bit            |
| Efficiency          | 0.17 ps/01   | 0.5100/01            | 0.0500/010           | 0.22ps/on           | 0.1000000             |
| <sup>a</sup> FoM    | 0.014        | 0.017                | N/A                  | 0.016               | 0.019                 |
| (pJ/bit/dB)         |              |                      |                      |                     |                       |
| <sup>a</sup> Area   | N/A          | 0.005mm <sup>2</sup> | $0.002 \text{ mm}^2$ | N/A                 | $0.001 \mathrm{mm^2}$ |

<sup>a</sup> DFE core only <sup>b</sup> Uses PRBS31 pattern



Fig. 15. (a) Measured bathtub curves at 10.4Gb/s, with no DFE, with the symmetric DFE, and with the asymmetric DFE, and (b) measured eye diagram of the 1:4 demuxed output data  $D_{OUT,0}$ .

Fig. 14 shows the eye diagram of a PRBS7 pattern before and after passing through the channel to the receiver. Before passing through the channel, the eye has a width of 85ps and a height of 400mV. After passing through the channel, it has a width of 43ps and a height of 152mV. Due to pad capacitances of the test board and the chip, the bonding wire, and the input capacitance of the receiver, the eye opening of the receiver data input  $D_{RX}$  is more closed in the real operation.

The BER performance was measured by applying the same PRBS7 pattern through the channel of the  $D_{RX}$  path. Fig. 15(a) shows measured bathtub curves at a data-rate of 10.4Gb/s, without the DFE, with the symmetric DFE, and with the asymmetric DFE. Operation at 10.4Gb/s is feasible with the 1-tap symmetric DFE, which increases the width of the previously closed eye to a timing margin of 0.15UI and 0.12UI, at a BER of  $10^{-12}$  with the same pullup and pull-down weights of 10mV and 20mV. The width of the horizontal margin is further increased by the asymmetric 1-tap DFE, to a timing margin of 0.18UI at a BER of  $10^{-12}$ . When symmetric and asymmetric weights are applied, their power consumptions are approximately equal, which is about 1.71mW. Therefore, the asymmetric DFE achieves better equalization effect than the symmetric DFE, without power and area overhead. Fig. 15(b) shows the eye diagram of the 1:4 demuxed data output D<sub>OUT,0</sub> when the asymmetric DFE is in operation.

The performance of our DFE is summarized and compared with previous designs [3], [16]–[18] in Table I.

#### VI. CONCLUSION

We have presented a receiver-side single-ended 1-tap DFE for asymmetric memory interfaces. It uses asymmetric pull-up and pull-down tap weights, determined from data transition direction to eliminate different amounts of ISI in the pull-up and pull-down data. Fabricated in a 55nm CMOS technology, our DFE compensated for asymmetric ISI at a data-rate of 10.4Gb/s in a channel with an insertion loss of -8.3dB. The energy efficiency of the DFE is 0.16pJ/bit.

#### REFERENCES

- J.-H. Chae *et al.*, "266–2133MHz phase shifter using all-digital delay-locked loop and triangular-modulated phase interpolator for LPDDR4X interface," *Electron. Lett.*, vol. 53, no. 12, pp. 766–768, Jun. 2017.
- [2] J.-H. Chae, H. Ko, J. Park, and S. Kim, "A 12.8Gb/s quarter-rate transmitter using a 4:1 overlapped multiplexing driver combined with an adaptive clock phase aligner," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 66, no. 3, pp. 372–376, Mar. 2019.
- [3] S. Kim, E.-Y. Jin, K.-W. Kwon, J. Kim, and J.-H. Chun, "A 6.4-Gb/s voltage-mode near-ground receiver with a one-tap data and edge DFE," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 61, no. 6, pp. 438–442, Jun. 2014.
- [4] H. Seol *et al.*, "Energy efficient data encoding in DRAM channels exploiting data value similarity," in *Proc. IEEE/ACM Int. Symp. Comput. Archit.*, Jun. 2016, pp. 719–730.
- [5] H.-K. Jung et al., "A 4.35Gb/s/pin LPDDR4 I/O interface with multi-VOH level, equalization scheme, and duty-training circuit for mobile applications," in *Symp. VLSI Circuits Dig.*, Jun. 2015, pp. 184–185.
- [6] J. Eble, M. Li, and W. Beyene, "An implementer's guide to lowpower and high-performance memory solutions," in *Proc. DesignCon*, Jan. 2014, pp. 1–96.
- [7] C.-K. Lee et al., "A 5Gb/s/pin 8Gb LPDDR4X SDRAM with powerisolated LVSTL and split-die architecture with 2-die ZQ calibration scheme," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech.* Papers, Feb. 2017, pp. 390–392.
- [8] (2017). JEDEC Solid State Technology Association: GDDR6 SGRAM Standard. [Online]. Available: http://www.jedec.org/standardsdocuments/docs/jesd250
- [9] J. Ren, D. Oh, and S. Chang, "Hybrid statistical and time-domain simulation methodology for high-speed links," in *Proc. DesignCon*, Feb. 2010, pp. 1002–1020.
- [10] M. Raj, M. Monge, and A. Emami, "A modeling and nonlinear equalization technique for a 20Gb/s 0.77pJ/b VCSEL transmitter in 32nm SOI CMOS," *IEEE J. Solid-State Circuits*, vol. 51, no. 8, pp. 1734–1743, Aug. 2016.
- [11] G. Belfiore, M. Khafaji, R. Henker, and F. Ellinger, "A 50Gb/s 190mW asymmetric 3-tap FFE VCSEL driver," *IEEE J. Solid-State Circuits*, vol. 52, no. 9, pp. 2422–2429, Sep. 2017.
- [12] W.-J. Su and S.-I. Liu, "A 5Gb/s voltage-mode transmitter using adaptive time-based de-emphasis," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 64, no. 4, pp. 959–968, Apr. 2017.
- [13] (2017). JEDEC Solid State Technology Association: DDR4 SDRAM Standard. [Online]. Available: http://www.jedec.org/standardsdocuments/docs/jesd79-4a.
- [14] Y. Li and F. Yuan, "Adaptive data-transition decision feedback equalizer for serial links," in *Proc. IEEE Int. Midwest Symp. Circuits Syst.* (MWCAS), Aug. 2017, pp. 1609–1612.
- [15] S. Son et al., "A 2.3-mW, 5-Gb/s low-power decision-feedback equalizer receiver front-end and its two-step, minimum bit-error-rate adaptation algorithm," *IEEE J. Solid-State Circuits*, vol. 48, no. 11, pp. 2693–2704, Nov. 2013.
- [16] K.-Y. Chen, W.-Y. Chen, and S.-I. Liu, "A 0.31-pJ/bit 20-Gb/s DFE with 1 discrete tap and 2 IIR filters feedback in 40-nm-LP CMOS," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 64, no. 11, pp. 1282–1286, Nov. 2017.
- [17] W.-H. Shin, Y.-H. Jun, and B.-S. Kong, "A DFE receiver with equalized VREF for multidrop single-ended signaling," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 60, no. 7, pp. 412–416, Jul. 2013.
- [18] I.-M. Yi et al., "A time-based receiver with 2-tap decision feedback equalizer for single-ended mobile DRAM interface," *IEEE J. Solid-State Circuits*, vol. 53, no. 1, pp. 144–154, Jan. 2018.