Solving the problem of Flash memory density

How improved die-stacking technology reduces pin count, board footprint and system complexity

The direction and force of consumer demand is relentless: buyers of mobile and computing devices are constantly looking for products that offer more features and better performance in a smaller, lighter, sleeker form factor. Applications that once would have required a laptop have migrated to the smartphone. Now smart watches are supporting applications that were previously performed only on smartphones.

By K. C. Shekar, Senior Director of Strategic Marketing and Takehiro Kaminaga, Assistant Professional Manager Winbond Electronics Corporation

When product designers are evaluating new ways to meet this constant demand for miniaturization, system storage is one of their top priorities: serial NOR Flash memory (typically used for storing code) and serial NAND Flash memory (typically used for data storage and as a back-up location for code) occupy a relatively large proportion of the total board real estate in many space-constrained designs.

Winbond, which produces around 30% of the world’s serial NOR and serial NAND Flash ICs has previously helped to address this problem through its invention of the Quad Serial Peripheral Interface (QSPI) – a high-bandwidth version of the standard serial peripheral interface that enables very high-speed Read/Write operations to take place between a Flash memory and a host System-on-Chip (SoC), microprocessor or microcontroller. Supporting data rates comparable to those achievable with parallel Flash but using far fewer channels and smaller packages, Winbond’s QSPI innovation was an important step in the miniaturization of the memory system’s board footprint.

But more recently serial Flash manufacturers have adopted an additional way to increase memory capacity while reducing footprint: stacking Flash memory dies inside a single package.

This article outlines the impacts of replacing two or more discrete Flash memory ICs with a single stacked-die package, and describes a new approach to die stacking developed by Winbond that gives improved performance and a reduction in pin count and board footprint compared to alternative solutions.

The benefits of stacking memory dies

A typical Flash memory use case in a small device design might consist of a 16Mbit serial NOR Flash device for code storage, and a 1Gbit serial NAND Flash device for data and a Linux® operating system. NOR Flash offers superior random access performance and excellent endurance and retention, important features for storing code that is read very frequently. NAND Flash offers faster programming, and is markedly cheaper than NOR Flash at densities greater than 512Mbits. Latency in Read operations is longer, but this is acceptable for data that is infrequently accessed.

Typically, then, system designs today have to accommodate on the board two Flash memory packages, each with its own interface to an SoC. In this use case, the use of a stacked-die package offers an immediate reduction in board footprint (see Figure 1). In a heterogeneous package, a smaller NOR Flash die may be stacked on top of a larger NAND Flash die.

Fig. 1: a smaller NOR Flash die is stacked on top of a larger NAND Flash die. The dies and bond wires may be mounted to a leadframe or substrate. (Image credit: Winbond)

It is equally possible to benefit from the use of a homogeneous stacked solution, in which a NOR die is stacked on a NOR die or a NAND on a NAND. For instance, if an existing product design using a 512Mbit NAND Flash IC requires additional data storage capacity, the conventional approach would be to replace the 512Mbit IC with a 1Gbit device. But if the 1Gbit device has a different footprint and pin-out, this change will entail some time-consuming board re-design.

A homogeneous stacked package can eliminate the need for a board re-design: a 512Mbit die stacked on top of a 512Mbit die can be housed in the same package with the same pin-out as the single-die 512Mbit IC, allowing the designer to double memory capacity instantly with no requirement for a board re-spin. These benefits – reduced board footprint and reduced development effort – are inherent to any implementation of the stacked-die concept.

But now Winbond has introduced a new approach to the stacking of Flash dies which provides an even greater reduction in board footprint while also offering improved Read/Write performance.

Low pin-count stacking

One of the main challenges in implementing a stacked-die product is the provision of interfaces between the SoC or microcontroller and the two (or more) dies inside the Flash memory package. To avoid contention on the SPI bus, the SoC uses a Chip Select (CS) command which tells the stacked package which die it wishes to interface to.

In other stacked memory solutions, this CS function is implemented in hardware over dedicated CS lines – one CS line to each die. So in a package containing two dies, there will be two CS lines, for three dies there are three CS lines and so on.  Clearly this has the unfortunate effect both of requiring more pins on the memory device and on the SoC, and more traces on the board.

Fig. 2: implementing the Chip Select function in software requires only one CS pin. (Image credit: Winbond)

In the latest W25M SpiStack® Flash memory products from Winbond, this problem is solved by use of a software Chip Select function which operates over a single CS channel between the memory device and the SoC (see Figure 2). Each die in a SpiStack package has a unique ID, and a simple Chip Select command using the relevant ID tells all the dies in the package which one is to occupy the SPI bus.

This software CS function operates via just a single CS pin on the memory device and on the SoC, and requires only one board trace between them. This reduction in pin count enables Winbond to accommodate two, three or even four dies in a standard 8-pin SOP or 8-pad SON package, whereas other stacked-die implementations using a hardware CS function typically use 16-pin SOP or 24-ball BGA packages which entail a much larger and more complex board design (see Figure 3).

Fig. 3: multiple dies in a SpiStack device may be accommodated in an 8-pin package. (Image credit: Winbond)

It is easy for the designer to integrate the SpiStack CS function into system software. The C2h command is used to select any die in the Flash memory package (see Figure 4). It can be issued at any time to change the active die, regardless of its operating status. Only one selected die is active on the SPI bus at any one time. 


Fig. 4: the C2h Chip Select command uses a unique ID for each die in a SpiStack multi-chip package. (Image credit: Winbond)

Faster Read/Write operations

It is an inherent attribute of Flash memory technology that programming operations are relatively slow by comparison with other technologies such as SRAM and DRAM. When using a single-die Flash IC, then, it often happens that the system needs to read from memory while a long Write operation is still in progress. The controller then has to issue a command to suspend writing, read out from the memory, and then issue another command to resume writing. This makes use of the Flash device complex, and slows both the Read and Write operations.

Winbond’s introduction of a new concurrent operation feature in its SpiStack products removes this bottleneck. Put simply, concurrent operation enables the SoC to maintain Write or Erase operations on one die in a SpiStack product while reading from another die (see Figure 5).

Fig. 5: in a two-die SpiStack device, one die can occupy the SPI bus while another performs program or erase functions. (Image credit: Winbond)

As we saw above, only one die can access the SPI bus at any one time. But a SpiStack device can concurrently read while programming, read while erasing, program while erasing or program or erase on multiple dies simultaneously.

In many applications, this support for concurrent memory operations provides for substantial increases in operating speeds. It means that a SpiStack device has the potential to operate faster than any comparable device operating in sequential mode only.

Many homogeneous and heterogeneous choices

SpiStack multi-die packages are available in a wide variety of package types and densities, in both heterogeneous (NOR + NAND) and homogeneous (NOR + NOR, NAND + NAND) configurations. An advantage of Winbond’s position as the world’s biggest manufacturer of serial Flash is that it provides system designers with the widest choice of memory densities and package options.

Users therefore enjoy considerable design flexibility, benefiting from the freedom to swap devices with a common footprint and pin-out to meet changing memory density requirements.

SpiStack devices available today include:

  • a 16Mbit NOR + 1Gbit NAND
  • a 512Mbit NOR consisting of two 256Mbit NOR dies
  • a 2Gbit NAND consisting of two 1Gbit NAND dies

Winbond’s SpiStack roadmap shows many other options becoming available during 2017, or on customer request.

For more product information, please visit Winbond SpiStack Flash


In Search of the Best Op Amp for Remote Devices

Portable and remote devices are integral to medical, home, and business systems that manage the collection of analog data. The trend today is to create smaller, more energy efficient devices to shrink...


Bs&T at PCIM2018

powerlosstester presenting BsT-pulse 3 phase version and BsT-SQ for powerloss measurement of inductive components new findings of tester, the highest Bs ferrite material D9B for SiC application GaN fe...

Würth and AnDAPT describe their new programmable power solution

In this video an engineer from AnDAPT describes their new programmable power solution and their partnership with Würth at the APEC exhibition  in San Antonio, Texas. Drawing from Würth&...

MAGMENT: Magnetizable concretes, sole enablers for dynamic inductive wireless charging.

MAGMENT is a patented material technology, engineered from cement and magnetic particles from recycled electronic waste. We are the inventors and sole company worldwide to offer both the concrete mate...

A look at Analog Devices' wireless power demonstration at APEC 2018

In this video Steve from Analog Devices walks us through a wireless power transmission demonstration at APEC 2018 in San Antonio, Texas. The LTC4120 is a constant-current/constant-voltage wireless rec...

Analog Devices talks about their Power over Ethernet solutions at APEC

In this video Analog Devices talks about their Power over Ethernet solutions at APEC 2018 in San Antonio, Texas. Their LTC4291 provides four PSE Ports with two power channels per port, and is fully co...

Silicon Labs demonstrates their latest PoE solutions at APEC 2018

In this video John Wilson of Silicon Labs demonstrates their latest Power over Ethernet solutions at APEC 2018 in San Antonio, Texas. The live demonstration shows how a remote device can effectively p...

Vitrek explains their advanced testing solutions at APEC 2018

In this video Vitrek explains their advanced testing solutions at APEC 2018 in San Antonio, Texas. The devices displayed includes their 4700 high-voltage meter, which can measure up to 10kV and can pe...

Dirk Giesen describes the Parasoft tool suite for Embedded Software Development

Are you responsible for embedded software development in your organization? Your goal should be to create safe, secure, and reliable software. To make sure your device will work properly, deploy Paras...

Ross Sabolik of Silicon Labs talks about advanced Power over Ethernet

In this video Ross Sabolik of Silicon Labs talks about smart  Power over Ethernet systems with Alix Paultre at their APEC exhibit in San ANtonio, Texas. As PoE migrates to higher power levels and...

Dialog Semi walks through their latest IC solutions for battery chargers

In this video an engineer from Dialog Semiconductor walks us through their latest ICs for battery chargers at APEC 2018. Dialog's Qualcomm Quick Charge adapter solutions offer high efficiency to e...

Steve Allen of pSemi explains their latest LED driver solution

Steve Allen of pSemi explains their latest LED boost product based on Arctic Sand's two-stage architecture. Their PE23300 has a charge-pump, switched-capacitor architecture that offloads most of t...

Teledyne describes their latest 12-bit Wavepro HD oscilloscope

In this video Teledyne LeCroy describes their latest Wavepro HD oscilloscope to Alix Paultre of Power Electronics News at the company's launch event. The WavePro HD high-definition oscilloscope de...