Chip-packing

Saturday, April 23, 2011

Flip Chip Redefined

STATS ChipPAC has taken its innovative Low Cost Flip Chip (LCFC) technology and enhanced it to achieve greater design flexibility and performance across a broader range of applications, I/O requirements and fab nodes. This enhanced technology has been renamed fcCuBE^TM to better describe its broader range of enhanced features and capabilities: flip chip with Cu Column, BOL and Enhanced Processes.

A Transformative Flip Chip Technology

fcCuBE^TM technology leverages innovations such as copper (Cu) column bump, patented Bond-on-Lead (BOL) interconnection and enhanced assembly processes such as mold underfill (MUF) to deliver high input/output (I/O) density, performance and superior reliability in advanced silicon nodes while retaining the low price points which make it competitive with mainstream semiconductor packaging solutions available today. This combination of enhancements allows greater design flexibility utilizing relaxed substrate design rules and a streamlined manufacturing process.

fcCuBE^TM packages are produced on substrates with matrix strip or singulated format, and use overmolding and saw singulation processes for strip base substrate similar to wirebond packages of the same form factor. The fcBGA is typically an exposed die package with CUF (capillary underfill); fcFBGA is typically an overmolded package; both fcBGA and fcFBGA use solder balls for second level (BGA) interconnection; fcFBGA-SDx represents a variation of fcFBGA comprising a “hybrid” stacked construction, i.e., flip chip die on the bottom and wirebond die on the top; while the fcLGA is an exposed die product that does not have solder balls.

STATS ChipPAC’s fcCuBE packages are available in ball counts ranging from 32 to > 1000 depending on body size and external terminal (BGA) pitch. Other features such as heat spreaders for thermal enhancement, surface mounted passive components, etc. that are offered with traditional fcFBGA and fcBGA packages are also available for fcCuBE^TM packages.

Features

Bumped wafer thinning: 100µm Si thickness in production, 75µm qualified
0.40mm minimum package ball (BGA) pitch in production
High density matrix strip for fcFBGA and wide boat format for singulated fcBGA
Conventional 2/4 layer laminate, laminate build-up (BU) and ABF BU substrates
In-house Cu column wafer bumping for 200 and 300mm wafers
Mold underfill (MUF) and Capillary underfill (CUF optional) with Cu column bump
Broad fab node compatibility: 180n, 65n-LK, 40/28n-ELK/ULK
Applicable across broad package range: fcBGA, fcFBGA /-H (fcCSP) and fcPoP (3D)
Wide range of package body sizes: 4 x 4mm to > 40 x 40mm
Cost-effective manufacturing flow using conventional reflow process

End Applications

fcCuBE is a compelling solution for a wide cross section of end products in the mobile/handheld, computing and high-end network/telecom markets, including devices for wireless and portable products such as RFICs and power/analog ICs driven by miniaturization and low package parasitics, and for ASIC, graphics, computing and networking products driven by superior electrical and thermal performance.

STATS ChipPAC - A Pioneer in TSV Technology

As a longstanding leader in 3D packaging, STATS ChipPAC was one of the first Outsourced Semiconductor Assembly and Test (OSAT) providers to invest in TSV technology with a 51,000 sq. ft. R&D facility in Woodlands, Singapore dedicated to the development of next-generation wafer-level integration with TSV technology. STATS ChipPAC has developed and qualified key technology in areas such as TSV formation and metallization, bumped wafer thinning, thin wafer handling, 3D microbump bonding, wafer-level underfill and TSV assembly.

TSV is an important developing technology that utilises short, vertical electrical connections or “vias” that pass through a silicon wafer in order to establish an electrical connection from the active side to the backside of the die, thus providing the shortest interconnect path and creating an avenue for the ultimate in 3D integration. TSV technology offers greater space efficiencies and higher interconnect densities than wire bonding and flip chip stacking. When combined with microbump bonding and advanced flip chip technology, TSV technology enables a higher level of functional integration and performance in a smaller form factor.

STATS ChipPAC TSV Capabilities

TSV Post-Process (mid-end)

300mm wafers

Temp bonding/de-bonding

Backside via reveal

Silicon recess and backside metallization

Microbump technology for 50/40um u-bump plating

TSV Assembly/Packaging (back-end)

200mm and 300mm wafers

Chip-to-Wafer or Chip-to-Chip options

Microbump Flip Chip assembly

Bumped wafer thinning, planarization & via exposure

60/50/40um pitch bonding

Microbump bonding (solder, Cu column)

Thin wafer dicing

Wafer level underfill (ultra-small gap underfill process)

TSV package reliability & characterization

Developing Next-Generation 3D TSV packaging

TSV Silicon Interposer Technology

First & easier step for TSV application

Qualified tapered TSV process for low density Si interposer (sub-200um pitch)

High density Si Interposer with TSV in joint development

Potential to replace high-end organic (BU) substrates

Thinner profile, tighter pitch and high thermal/electrical performance

TSV Assembly / Packaging

STATS ChipPAC has full front- to back-end manufacturing capabilities for 200mm wafers and currently handles both chip-to-chip (C2C) and chip-to-wafer (C2W) assembly for TSV technology. This includes high density microbump capabilities in both solder and copper column, microbump bonding down to 40um pitch, thin wafer handling, wafer-level underfill, thin wafer dicing and microbumps for flip chip interconnection. Microbump technology is critical to delivering fine pitch, low profile solutions for high performance devices.

TSV Mid-end Fabrication

TSV STATS ChipPAC also offers a post-TSV “mid-end” fabrication process flow that occurs between the wafer fabrication and back-end assembly process. Mid-end processes support the advanced manufacturing requirements of 2.5D and 3D TSV, as well as wafer-level packaging, flip chip and embedded die technology. The mid-end process includes temporary bonding/de-bonding, back-side via reveal, silicon recess and back-side metallization and microbumping. Microbump is required to meet fine pitch, low profile applications in 3D TSV, stacking and assembly. STATS ChipPAC offers 60/40um pitch microbump bonding.

TSV Interposer and Assembly

TSV STATS ChipPAC offers TSV interposer fabrication to provide a “bridge” between today’s 2D packaging solutions and next-generation 3D technology. Often referred to as 2.5D technology, TSV interposers are an efficient and practical approach to die level integration.

Friday, April 22, 2011

Opportunities and Challenges for 3D Systems and Their Design (IBM Research)

3D integration increasingly receives widespread interest and focus as lithographic scaling becomes more challenging, and as the ability to make miniature vias greatly improves. Like Moore’s law, 3D integration improves density. With improvements in packaging density, however, come the challenges associated with its inherently higher power density. And though it acts somewhat as a scaling accelerator, the vertical integration also poses new challenges to design and manufacturing technologies.

The placement of circuits, vias, and macros in the planes of a 3D stack must be co-designed across layers (or must conform to new standards) so that, when assembled, they have correct spatial correspondence. Each layer, although perhaps being a mere functional slice through a system (and we can slice the system in many different ways), must be independently testable so that we can systematically test and diagnose subsystems before and after final assembly. When those layers are assembled, they must come together in a way that enables a sensible yield and facilitates testing the finished product. To make the most of 3D integration, we should articulate the leverages of 3D systems (other researchers offer a more complete treatment elsewhere). Then we can enumerate and elucidate many of the new challenges posed by the design, assembly, and test of 3D systems.

Unique leverages of 3D integration

Although 3D integration affords the same gross benefits as Moore’s law in circuit density, it’s worth mentioning two imminent concerns about such scaling. We take no sides on either issue; we merely point out that where one stands on these concerns will color one’s perception on the advantages of 3D integration.

First, as devices become smaller (say, below 45 nm), three things happen: the device performance doesn’t scale, device leakage increases, and device variability worsens. Using 3D integration, however, allows better density without making the devices smaller.

Second, the lithography needed to make devices significantly smaller is costly and even questionable beyond the 15 nm mark. Yet 3D integration does not require smaller devices; it works independently of, and in synergy with, this kind of scaling. Besides raw density, 3D integration allows five new degrees of freedom; none of them are useful in all market sectors, but each is useful in some. Whether they represent a real opportunity to you depends on exactly what you’re trying to do, and the reason that you’re trying to do it. Those considerations strongly influence how you should choose to practice the art of 3D design. First, by integrating multiple components into a single stack (that is, a single component) 3D integration enables a simpler package to suffice, and it simplifies the subsequent assembly processes to make the end product. This could represent a significant cost advantage, assuming that the cost of the 3D component is reasonable, and that the volumes are at a level sufficient for amortizing the nonrecoverable engineering (NRE) costs of 3D.

Second, the modular integration of layers can enable a range of products to be made from a common set of subsystems (where we consider each layer a subsystem). This has the effect of volumizing those subsystems (which reduces cost), and it simplifies the overall design effort associated with that range of products (which also reduces cost). This is exactly the philosophy behind ASIC books, but here we practice it directly at the physical level.
A third consideration is that 3D integration allows us to potentially combine disparate technologies within a single stacked component such as DRAM with high-speed logic in a manner that doesn’t compromise either technology. It could also allow us to combine 65-nm technology with 45- or 32-nm technology, which could save cost and schedule on new products if it allowed for the direct reuse of system parts that don’t (particularly) need updating for the new product. It could also enable a simpler and lower power integration of communications subsystems such as silicon-germanium (SiGe) technology, gallium arsenide (GaAs) semiconductors, and optoelectronics as well as accelerators.

A fourth factor is that, with 3D integration, we can incorporate pieces of the electrical and service infrastructures directly for much better electrical performance. For instance, integrating voltage regulators within a stack delivers cleaner power locally, more efficiently, and more controllably. Furthermore, this can potentially allow for power distribution at a higher voltage and lower current, which gives us greener technology. We could integrate passives more elegantly (for example, by placing decoupling capacitors liberally and locally to stiffen the power rails). Also, we could incorporate clocking and test-related logic in a more modular way. Finally, with a small via pitch, it’s possible to build short and wide vertical buses within a 3D stack. This is useful only if there are elements within the stack that we can place co-spatially, and which would benefit from massive bandwidth. It’s this last point that raises issues about how to make vias and how to place them. Of these five leverages, only the last one affords a direct system-level performance advantage, and it does so only for systems that would benefit from a higher internal bandwidth to an integrated (within the stack) cache structure. Integrating cache layers within the stack (instead of connecting to them as off-chip entities) eliminates the slower and higherpower off-chip buses, and instead connects to them with higher-bandwidth, lower-latency vertical buses.

Above figure shows micro-C4 connections between layers in a stack. Not only does this increase the bandwidth to the cache layers dramatically, it also improves the access latency and lowers the transmission power.

Compared to tens of millimeters per wire to connect logic and memory chips in the 2D plane, within a 3D stack we can connect using through-silicon vias (TSVs) that are mere tens of microns long. This is several orders of magnitude less. Wire-limited performance improvement through vertical integration is projected as the square root of the number of layers in a 3D stack. Other studies show that increasing the number of active layers through vertical integration significantly improves the interconnection performance and bandwidth. Researchers propose various alternatives to interconnect the layers in the stack, including micro-C4 techniques and Cu-Cu thermocompression bonding for bulk.

Design and test challenges in 3D stacks

Because 3D integration opens up a whole new set of challenges, we should ask ourselves several questions involving via pitch and placement, thermal issues, and scan chain reconfiguration, among other things to consider before designing and testing 3D stacks.

Vias: Pitch and placement

The first set of questions to answer before building anything in 3D revolves around TSVs, which are essential to 3D stacks because they’re the means for interconnecting the layers. The answers to those questions depend heavily on why you’re using 3D technology: How much current do you need? How many vias do you need? (And though the holes were rather small, they had to count them all. Now they know how many holes it takes to fill the Albert Hall.) How big do the vias need to be? Where should they be placed? For what are the vias (mostly) used? Of what material are the vias made? What are the vias’ electrical, mechanical, and thermal characteristics? For lower-power applications like commodity DRAM and the commercial mobile space, we need not bring lots of power conduits (vias) through the stack. Further, these applications don’t require that many signals. For markets like these, the issues surrounding vias are not paramount and therefore do not heavily influence other design choices. In these markets, the number of vias is small, so their sizes, placements, and constructions don’t bear heavily on other design considerations.

In high-power applications such as microprocessors, vias are paramount: they drive most of the other design considerations. In applications like this, power needs delivery on a regular grid, and distances between the points on that grid should be short, because we can’t distribute high power horizontally on the chip’s back-end-of-line (BEOL) wiring. Therefore, we should place power vias regularly on a close grid. This pitch will decrease with increasing current demand. Further, power vias must extend through the BEOL wiring layers; they cannot simply use BEOL wiring layers to deliver large currents vertically. Consider a via that’s etched through the bulk, and that connects to the next chip layer using the BEOL wiring hierarchy. While this kind of via is the easiest to make (because we need not deal with extending the via through the BEOL) and suffices for getting signals through a layer, a via structure like this is unsuitable for power delivery.

Therefore, although we might choose to have via structures like the one in Figure below

for transmitting signals, we also need power vias that cut all the way through the BEOL structure. Although it might make sense to have multiple via types, allowing this can drive more design complexity than defining a single via type (the power via) and using that via for signals, too.When placing power vias in a grid, this tends to constrain the placement of signal vias. In particular, x wiring (and y wiring) can’t run across the chip coincident with the x grid (or y grid) of power vias. Therefore, it makes the most sense to place groups of signal vias between the power vias, within that grid.

Finally, to connect the chips to the package, and to set an adequate power delivery grid, a number of studies reported micro-bump heights of 21 microns at a 50-micron pitch for the controlled collapse chip connection new process (C4NP). Patel et al [‘‘Silicon Carrier with Deep Through-Vias, Fine Pitch Wiring and Through Cavity for Parallel Optical Transceiver,’’ Proc. IEEE Electronic Components and Technology Conf., IEEE CS Press, 2005, pp. 1318-1324] also reported pitches ranging from 225 microns to fewer than 50 microns with TSV heights of 300 microns to fewer than 50 microns. The TSV pitch and size not only determine the kind of 3D integration enabled at the system level, but they impact the stack’s physical characteristics. The ability to probe signals becomes more challenging as 3D integration becomes more sophisticated (such as with finer TSV pitches and thinner silicon layers), which will cause more complications in the future.

Capturing and probing signals

In principle, signals that run between layers can connect arbitrary logic circuits within any latch-to latch path spread across those layers. For example, the output of a logic gate on one layer might also be the input to a logic gate on another layer. But it’s necessary (at this time within the nascency of 3D systems) to be able to access that signal for testing.
To access such a signal for logic testing, we need to either put a latch on both sides of the connecting via, or to connect the via contact to a landing pad that we can access with a test probe. If the signal is a logic-to-logic signal (that is, the signal is not on a latch boundary) and the signal is not amenable to alternating current (AC) testing, then the first solution requires that additional latches and an additional clocking structure be put in place just to capture this midcycle state for logic testing. The latter solution (a test probe) requires that we connect a large capacitance electrostatic discharge (ESD) diode to the signal landing pad to protect the circuits from static charge on the test probe. This will degrade the signal by significantly slowing it.

Note that, either way putting a latch on both sides or connecting the via to a landing pad having signals cross layers in the middle of a combinational logic flow is problematic when it comes to testing them, so designers should avoid a situation like this. Such approaches require some significant innovations before they become practicable. A more practicable design should have latches at both sides of any layer-to-layer interface (via). Ideally, both latches are part of the machine’s logic flow (for example, in a latch-to-latch path), so that AC effects are not critical to test (assuming that a signal can comfortably traverse layers in a cycle). However, if the path is not strictly latch to latch, but you’re sufficiently confident that timing will not be an issue because it will not need to be tested, then you should use boundary-scan latches to capture the layer-to layer signals during the test. Of course, we can perform rudimentary AC tests with boundary-scan latches on a single layer, but launching a signal from a scan latch on a single layer (in a test) fails to account for the impedance of the via in the final product.

We state axiomatically that signals traversing a 3D structure through vias must be latch-to-latch signals to be testable, although the capturing latches might simply be boundary scan latches that are not part of the nominal machine flow. Having said this, we can now differentiate between those signals that are only accessible through scan rings, and those that a tester can probe. Most signals in a large system must be of the former type, only because there are too many of them to probe. Whereas it is necessary to probe some of the signals (or at least, to be able to connect tester probes to an on-chip test infrastructure), any signal being probed requires two things: a landing pad adjacent to the via that will accommodate the test probe; and large-capacitance ESD structure that will protect the circuits from static discharge when we connect the probe. Also note that a test probe having hundreds of contacts will exert quite a bit of force on the chip being tested. If the chip is several hundred microns thick, this is generally not a problem. But if it’s only tens of microns thick, the test probe can do real damage. So testing layers prior to thinning them is prudent. However, testing chips in this way will obviously not detect problems created during the thinning process. In either case, test probes will damage the surface of the landing pads by leaving pockmarks and metal shards.We might need to repair (replanarize) the pads prior to stacking the tested chips, or these deformations can cause subsequent problems. A landing pad and an ESD device are big. Having 3D stacks with many signals would be infeasible if all vias required them. Although we need the pads to give the tester accessibility to each layer, clearly most signals passing through vias will be accessible only to scan chains.

Assembling and reconfiguring scan chains

Although it’s possible to improve the yield of a stack by using known-good dies (KGD) in a lab environment,the cost of doing this in production is prohibitive in a high-volume product. In an ideal manufacturing scenario, it’s best to stack finished wafers up and then dice them. Practically, we can’t hope for all chip sites in all layers in all 3D stacks made in this way to work, which is why it is essential to make all the layers independently testable prior to stacking them. If we do this, then at least we know what to expect when wafers are stacked. When some of the 3D chiplets work incorrectly, we will know why, and whether we can repair them. Therefore, to be cost-effective, it’s essential to perform and collect detailed failure data and analysis on each chip on a wafer. It’s also essential (in a waferto-wafer assembly process) to incorporate enough redundancy into the layers and the system to be able to repair most problems.

As already discussed, each layer should be testable through scan rings. On-layer built-in self-test (BIST) engines will control many of these. All signals that enter and leave a layer (for example, all via points) should be accessible through a boundary scan. When designed in this way, each layer may have many rings, and it may take lots of time to test each layer.

Testing layers in this way might not be the most effective way to test the finished product (the aggregated stack), so we also need a way to test the layer-to-layer paths (the vias) in the finished product. Therefore, when the layers are finally assembled, we might want to reconfigure the scan rings within the aggregated product to enable a final testing procedure that’s both more complete and more efficient, which will require considerable forethought. Partitioning the scan chains effectively through the stack is important because the testable state (number and length of the scan chains) in the finished stack can be large, making the testing more challenging.

One approach is to serially connect the scan chains from each of the 2D layers and preserve the existing order for each layer; however, this will lead to long chains that probably aren’t optimal. A better approach is to enable scanning across the layer boundaries, and to add some additional infrastructure that lets us access the existing 2D chains hierarchically. Although this approach makes the parts of the finished product more directly accessible for testing, there must be reasonable certainty that the vertical interconnection infrastructure (vias) will work. Clearly, we can increase certainty by using redundant vias and/or redundant scan paths.

The choice of face-to-face (F2F) and face-to-back (F2B) integration is yet another key consideration in manufacturability and testing. Whereas the testing and accessibility of F2F structures has numerous advantages, the F2F scheme cannot be continued beyond two layers without significant complications. With more than two layers, then, usually the face-toback integration step follows face-to-face integration. Note that while (perhaps) mechanically simpler, FTF requires a mirrored design on one layer so that it complies with the other layer, a characteristic that might add complexity to the design tools.

3D circuits versus 3D packages

In the popular 3D stacked-IC form of 3D integration, the mostly digital subsystem components form a stack of multiple die. In such an IC, it’s possible to fabricate transistors atop other transistors, resulting in multiple layers of active components. These transistors can then be wired to other transistors on the same device layer, to transistors on different device layers, or both, depending on the process technology. The several approaches to fabricating 3D ICs or 3D-compatible transistors vary in terms of the maximum number of device layers and the maximum density of interconnects between these layers. One leading approach is the wafer bonding method, which glues discrete wafers together using a copper interconnect interface. This low-cost method lets designers implement 3D interconnects for many wafers, overcoming the limitations of other proposed methods.

The above table compares 3D ICs and 3D SoPs in terms of their basic enabling technologies and physical design challenges. The basic processes behind the physical design of 3D ICs and 3D SoPs are similar: placement of multiple components into multiple device layers, and routing using multiple groups of multiple metal layers and various types of vias. However, component sizes in 3D ICs are in the nano- to microscale, and the total device and interconnect count is in the millions or billions. Therefore, it takes a hierarchical design methodology with design reuse to handle the complexity. On the other hand, 3D SoPs contain only hundreds to thousands of devices and interconnects, and these are in the micro- to milliscale. Therefore, a nonhierarchical design approach is sufficient for most cases. However, the SoP signal type is a mixture of digital, analog, and optical signals. Signal and power integrity challenges increase exponentially in mixed-signal systems that integrate RF front ends with optical signaling and digital baseband processing. Because designers use a highly complex and time-consuming mixed-signal noise-analysis tool to validate 3D SoP designs during the physical design process, design time and effort are at least as great as for 3D ICs.

3D System on Package.

System on Chip is a system integration approach that integrates large numbers of transistors as well as various mixed-signal active and passive components onto a single chip. However, the systems community is beginning to realize that this paradigm has fundamental engineering and investment limits. This realization led to the 3D system-in-package (SiP) approach, alternatively called 3D ICs or 3D stacked die/package. This approach lets designers stack multiple ICs or multiple-package stacked ICs at a far lower cost and in less space. The vertical die-to-die via pitch in a 3D stacked die is very small, so designers can arrange digital functional modules across multiple die at a fine level of granularity. This results in shorter wires, which translates into less wire delay and less power consumption. The SiP provides major opportunities in both miniaturization and integration for advanced and portable electronic products, but as a subsystem it is still limited by the CMOS process, just as the SoC is. Designers can take SiP a step further by embedding both active and passive components, but passive-component embedding is bulky and requires thick-film discrete components.

Thick-filmcomponent embedding distinguishes SiP from system on package (SoP), an emerging 3D system integration concept that involves embedding both active and passive components. SoP, however, incorporates ultrathin films at microscale to embed the passive components, and the package rather than the board is the system. SoP can address the shortcomings of both SoC and SiP, as well as those of traditional packaging, which is bulky, cost- ly, and lower in performance and reliability. With SoP, improvement comes in two ways: First, SoP uses CMOS based silicon for its best purpose that
is, for transistor integration; second, SoP uses the package for its best purpose—RF, optical, and digital com- ponent integration using IC-package-system codesign. SoP overcomes both the computing and integration limitations of SoC, SiP, multichip modules (MCMs), and traditional system packaging by having global wiring as well as RF, digital, and optical component integration in the package instead of on the chip. Moreover, 3D SoP addresses the wire delay problem by enabling the replacement of long, slow global interconnects with short, fast vertical routes.

Several fundamental enabling technologies support 3D SoP integration.

Electrical interconnect. With the availability of high density substrates with 3.5-micron line widths, SoP provides a unique opportunity for offloading global wiring to the package for enhanced performance. Key recent developments in next-generation build- up microvias for the SoP substrate include the integration of ultra-low-loss and high-k dielectrics, conductor geometries with submicron precision, and low-cost processes for multilayer stacked via interconnects.
Chip-to-package interface. The current approach of lead-free solders with underfill presents major challenges in both dispensing the underfill and guaranteeing fatigue resistance as solder bumps shrink in height. Recent SoP research advances concerning the chip-to-package inter- face include the extension of solder bumps to stretched-solder columns and improvements in underfill technology.
High-quality embedded passives. Designers use multilayer ceramic and multilayer organic structures with liquid crystal polymer technology to embed passives efficiently, including high-Q inductors, capacitors, matching networks, low-pass and band-pass filters, baluns, combiners, and antennas. The 3D design approach using multilayer topologies leads to high- quality, compact components to support multiple bands and standards and wider bandwidth in a compact form factor and at low cost.
Analog/RF components. The recent development of thin-film RF materials and processes lets designers bring the SoP concept into the RF world to meet the stringent needs of wireless communication. Researchers are addressing critical issues such as board compatible embedded antennas and switches, low-loss and low-cost boards, low-crosstalk embedded transmission lines, and single mode packages, as well as design rules for vertically integrated transceivers over a wide frequency range.
Optical interconnect. A high-speed optical clock and data transport simplify the digital architecture by requiring fewer parallel transmission lines. Moreover, optical links have low crosstalk and are not susceptible to electromagnetic interference (EMI) noise. Researchers have developed a low-temperature polymer process for fabricating and integrating opto-electronic components such as a microlens array, lasers, waveguides, splitters, couplers, gratings, and photodetectors on printed wiring boards for mixed- signal SoP applications.

Although 3D SoP manufacturing technology continues to advance, research into how to actually apply the technology lags. The complexity of designing large- scale 3D SoPs with various objectives and under multi- ple constraints has made CAD tools indispensable. In contrast to the active CAD research effort in mixed-signal SoCs and 3D stacked ICs, CAD research for 3D SoPs has a short history. Early efforts in pioneering physical design research for SoPs by no means cover existing and emerging 3D SoP physical CAD issues to their fullest extent. This article presents three physical design algorithms for fast design of reliable 3D SoPs.

Silicon interposer cost redux goal of GA Tech consortium.

Current work developing silicon interposers takes advantage of existing and depreciated 200 and 300mm wafer fabs, using back end of line (BEOL) tools and processes as well as the newly developed TSV technology for 3D ICs. Georgia Tech PRC believes such silicon interposers are limited in performance by high electrical loss of silicon and high cost of wafer-based interposers.

Thursday, April 21, 2011

ADVANCED PACKAGING : Flip-Chip

A chip packaging technique in which the active area of the chip is "flipped over" facing downward. Instead of facing up and bonded to the package leads with wires from the outside edges of the chip, any surface area of the flip chip can be used for interconnection, which is typically done through metal bumps. These "bumps" are soldered onto the package and underfilled with epoxy. The flip chip allows for a large number of interconnects with shorter distances than wire, which greatly reduces inductance.

• Metal bumps can be made of
– solder (tin, or tin-lead or lead-free
alloys)
– copper
– gold
– Copper-tin or gold-tin alloys

• Package substrates are
– Epoxy-based (organic substrates)
– Ceramic based
– Copper based (leadframe substrates)
– Silicon or glass based

For a more comprehensive report on Flip Chip refer this report i-micronews Flip-Chip Report.

3D MEMS Update | 3D InCites

The panel assembled to discuss ongoing developments in 3D MEMS integration brought together expertise in MEMS system design (Carolyn White, of Fitzgerald and Associates), MEMS assembly and packaging (Lee Smith, Amkor) MEMS R&D (Nicolas Sillon, Leti) and processes and equipment (Bioh Kim, EV Group.) As such, we touched on everything from processes being used in volume manufacturing today, process showing promise in development, limitations that are being addressed, and those that still need work.

3D MEMS Update | 3D InCites

Microchannel Cooling for 3D Electronic Circuit

3D circuits pose thermal management challenges due to the significant increase in total power generated per unit available surface area for cooling. Furthermore, the power generated per unit volume within a 3D circuit can vary significantly, yielding large junction temperature non uniformities that can impair the collective operation of the circuit. Another problem is that the increased functionality of the circuit demands greater surface area for input and output of electrical, optical, RF, and other types of signals, which further reduces the surface area available for heat removal. While the semiconductor research community is actively studying the electrical performance and manufacturing methods of 3D circuits with as many as one hundred device layers, the introduction of a new cooling approach is a critical issue in its implementation.
The heat removal problem is particularly challenging for vertically integrated circuit technologies. The first attempts at thermal analysis of 3D ICs addressed concerns regarding heating effects in 3D complementary metal-oxide-semiconductor and investigated the effects of the silicon thickness of the upper chip layers. Previous thermal analysis was performed through device-level or chip-level modeling, showing that thermal packaging technologies with thermal resistance below 0.5 K/W will be necessary to obtain reasonable chip temperature in 3D ICs. It is also reported that metal thermal vias and Cu bonding layers in 3D integration could be helpful for heat removal in 3D ICs.

Achieving Thermal Control for Power Devices: Die Attach Solder Pastes for Varying Requirements

Not only are today’s package designers and assemblers faced with the inherent design and functionality challenges associated with smaller device footprints and higher I/O counts, but they must also ensure that proper thermal control is built into advanced electronics packages. In fact, heat management for modern power semiconductor devices such as rectifiers, power transistors, amplifiers and countless other consumer and automotive applications is one of the most pressing issues facing the packaging industry. As these packages marry smaller outlines with higher functions, ensuring efficient thermal management will be key to long-term reliability and performance.

Achieving Thermal Control for Power Devices: Die Attach Solder Pastes for Varying Requirements

STATS ChipPAC expands TSV service with mid-end flow - Advanced Packaging

STATS ChipPAC Ltd. (SGX-ST: STATSChP), semiconductor test and advanced packaging service provider, is expanding its 300mm through silicon via (TSV) offering with the addition of mid-end manufacturing capabilities.

The TSV investment is the addition of a 300mm "mid-end" process flow that occurs between the wafer fabrication and back-end assembly process. Mid-end processes support the advanced manufacturing requirements of 2.5D and 3D TSV as well as wafer-level packaging (WLP), flip chip, and embedded die technology.

STATS ChipPAC expands TSV service with mid-end flow - Advanced Packaging

Vertical die stacking goes 3D without TSV - Advanced Packaging

In "Vertical die stacking goes 3D without TSV," Andrew Smith presents a 3D die stacking technology to address package miniaturization using bare die and vertical interconnect structures.

Vertical die stacking goes 3D without TSV - Advanced Packaging

C2W Bonding Approaches: Variations on Theme | 3D InCites

3D IC is only solution for scaling "up," says MonolithIC 3D exec - Advanced Packaging

IBM to use water cooling for future 3D IC processors - Advanced Packaging

IBM to use water cooling for future 3D IC processors - Advanced Packaging

March 18, 2011 - At the recent CeBIT Fair in Hanover, Germany, IBM CEO Sam Palmisano announced that IBM's 3D technology will likely appear in its Power8 processor, planned for 2013, using 28nm or 22nm process technology.
The first goal, he indicated, is to place the memory directly above or beneath the CPU. The processor will likely employ a linked memory and "a layer of small specialized computing cores adapted for specific intended uses." Future plans envision up to 100,000 connections per mm2 in silicon.

Wednesday, April 20, 2011

3 Dimensional Circuit Fabrication Methods.

The main goal of 3D circuit processing is creating additional semiconducting layers of silicon, germanium, gallium arsenide, or other materials on top of an existing device layer on a semiconducting substrate. There are several possible fabrication technologies to form these layers. The most promising near term techniques are wafer bonding, silicon epitaxial growth, and recrystallization of polysilicon.

Figure shows a schematic of 3D circuits illustrating two different fabrication schemes. The choice of a particular technology will depend on the requirements of the integrated circuit system, manufacturability, and process compatibility with current technology. There are a variety of methods available for forming microchannels within a three-dimensional circuit, including plasma etching prior to wafer bonding, sacrificial silicon channels, and even chemical etching. Furthermore, there has been much recent research on etching vertical channels through wafers for electrical connections, which can be leveraged to provide the vertical fluidic connections needed in this research.

Wafer bonding differs from other fabrication technologies due to the opportunity for independent processing of the wafers prior to bonding. Fully processed wafers are directly bonded using various techniques leading to the completely overlapped stacking of the chips. Wafer bonding can be achieved by using polyimide adhesive layers, Cu-Cu thermocompression method and microbumps with liquid epoxy injection. The wafer bonding process preserves electrical characteristics of each device layer and can be repeated without damaging existing circuits. The alignment tolerance of 1–2 $\mu$m requires careful design of vertical interlayer interconnections.

Silicon epitaxial growth technique utilizes epitaxially grown single-crystal Si islands as device layers. Single-crystal Si islands are formed out of the open seed window by selective epitaxial growth, epitaxial lateral overgrowth, and chemical mechanical polishing of excess Si. The major limitation of this technique is high process temperatures (~1000°C), which results in significant degradation in the lower device layers especially with metallization layers. Although low-temperature epitaxial Si can be obtained using ultra-high-vacuum systems and utilizing plasma, this process is not yet feasible for manufacturing.

Recrystallization of polysilicon is another method for forming a second Si layer. This technique deposits polysilicon and induces recrystallization of the polysilicon film using intense laser or electron beams to enhance the performance of the thin-film transistors (TFTs). This technique requires high process temperatures during the melting of polysilicon layers. Beam-recrystallized polysilicon TFTs also exhibit low carrier mobility and unintentional impurity doping. Local crystallization, induced by patterned seeding of Ge or by metal-induced lateral crystallization, can enhance TFT performance.

Tuesday, April 19, 2011

Thermofluidic consideration for 3D ICs

Three-dimensionally stacked ICs bring several challenges in thermal management. By stacking layers, the heat dissipation per unit volume and per unit horizontal footprint area are significantly increased. Also, the interior layers of the 3D structure are thermally detached from the heat sink. Heat transfer is further restricted by the low thermal conductivity bonding interfaces and thermal obstacles in multiple IC layers. Moreover, the inherent spatial nonuniformity of the power/heat flux distribution/dissipation within each active layer generates local hot spots in temperature, which degrade the functionality of circuits and create thermal stress issues,due to nonuniform thermal expansion.
Conventional cooling techniques, which depend on heat sinks on the backs of ICs to transfer heat into streams of forced air, will be unable to meet the needs of future power-hungry devices – especially 3D multichip modules that will pack more processing power into less space. Several kinds of advanced cooling technologies have been presented mainly for 2D ICs, including microjet impingement cooling, compact thermosyphon, loop heat pipe, electro-osmotic pumping loop, stacked microchannel heat sink, thermoelectric microcooler, miniature vapor compression heat pump system, and miniature absorption heat pump system. However, such cooling solutions for 2D planar circuits have difficulties to overcome the limited surface area available for thermal management and the large vertical thermal resistance between the bottom layer and the heat sink of 3D integrated circuits.

Microfluidic Channel-Based Cooling

Unlike air-cooled heat sinks, liquid cooling using microchannels offers a larger heat transfer coefficient (and thus lower thermal resistance) and chip-scale cooling solution.

The above figure shows the SEM image of fabricated microfluidic channel. Recent advancement on wafer-level fabrication technique provides polymer pipes that allow electronic and cooling interconnections to be made simultaneously using automated manufacturing processes. The low-temperature technique, which is compatible with conventional microelectronics manufacturing processing, allows fabrication of the microfluidic cooling channels without damage to integrated circuits. By controlling average operating temperature and cooling hotspots, liquid cooling can enhance reliability of the integrated circuits. Lower operating temperatures also mean a smaller thermal excursion between silicon and low-cost organic package substrates that expand at different rates.

Above figure shows an illustration of 4-tier 3D IC with microfluidic channel-based cooling. The coolant utilizes the following “thermal interconnect” path to cool individual dies in the 3D stack:
(1) packaging substrate,
(2) fluidic IO bump,
(3) fluidic TSVs,
(4) fluidic channel in each die.
The hot liquid exiting the system is cooled using an external freezer and reenters the system. Note that the fluidic TSVs are located outside the core region of the dies, thereby not causing any interference with the circuitry in each tier.

Monday, April 18, 2011

Chip Industry to Hit Record Revenue This Year, Says SIA

The semiconductor industry will see its total revenues hit record highs of US$319 billion this year and $330 billion in 2012 as orders mount for mobile devices such as smartphones and tablet PCs, according to a prediction from an industry group.
Read More

3 D packaging by the industry experts !!!!!!!!!

Here is an interesting discussion on 3D packaging by the industry leaders.

3D Network on Chip

Network-on-chip (NoC) is a general purpose on-chip interconnection network architecture that is proposed to replace the traditional design-specific global on chip wiring, by using switching fabrics or routers to connect processor cores or processing elements (PEs).

Typically, the PEs communicate with each other using a packet-switched protocol. Even though both 3D integrated circuits and NoCs are proposed as alternatives for the interconnect scaling demands, the challenges of combining both approaches to design three-dimensional NoCs have not been addressed until recently. Researchers have studied various NoC router design with 3D integration technology. For example, various design options of the NoC router for 3D NoC has been investigated:

(1) symmetric NoC router design with a simple extension to the 2D NoC router;

(2) NoC-bus hybrid router design which leverage the inherent asymmetry in the delays in a 3D architecture between the fast vertical interconnects and the horizontal interconnects that connect neighboring cores;

(3) true 3D router design with major modification as dimensionally decomposed router;

(4) multilayer 3D NoC router design which partitions a single router to multiple layers to boost the performance and reduce the power consumption. Three-dimensional NoC topology design was also investigated.

The Intel 80-core TeraFLOPS processor is a proof-of-concept of the 3D network- on-chip architecture. The 80-core chip is arranged as an 8 × 10 array of PE cores and packet-switched routers, connecting with a mesh topology. Each PE core contains two pipelined floating-point multiply accumulators (FPMAC), connecting with the router through the router interface block (RIB). The router is a five-port crossbar-based design, with mesochronous interface (MSINT). To provide a high memory bandwidth at a relative low power, 20 MB SRAM layer is stacked on top of 80-core layer, with 256 KB per core, connecting with the bus to the core. The resulting 3D NoC-bus hybrid design can provide a memory bandwidth of 12 GB/s/ core (totally 1 TB/s for the whole chip), while the mesh NoC network provides a bisection bandwidth of 2 TB/s.

3D Integration Technology on Microprocessor Design

The 3D integration technologies can be classified into one of the two following categories.

Industry experts on 3D stacking

(1) Monolithic approach.

This approach involves sequential device process. The frontend processing (to build the device layer) is repeated on a single wafer to build multiple active device layers before the backend processing builds interconnects among devices.

(2) Stacking approach,

which could be further categorized as wafer-to-wafer, die-to-wafer, or die-to-die stacking methods. This approach processes each layer separately, using conventional fabrication techniques. These multiple layers are then assembled to build up 3D IC, using bonding technology. Since the stacking approach does not require the change of conventional fabrication process, it is much more practical compared with the monolithic approach, and become the focus of recent 3D integration research.

Several 3D stacking technologies have been explored recently, including wire bonded, microbump, contactless (capacitive or inductive), and through-silicon vias (TSV) vertical interconnects. Among all these integration approaches, TSVbased 3D integration has the potential to offer the greatest vertical interconnect density, and therefore is the most promising one among all the vertical interconnect technologies.

Three-dimensional stacking can be carried out using two main techniques:
(1) face-to-face (F2F) bonding: two wafers (dies) are stacked so that the very top metal layers are connected. Note that the die-to-die interconnects in face-to-face wafer bonding does not go through a thick buried Silicon layer and can be fabricated as microbump. The connections to C4 I/O pads are formed as TSVs;

(2) faceto- back (F2B) bonding: multiple device layers are stacked together with the top metal layer of one die is bond together with the substrate of the other die, and direct vertical interconnects which are called TSV tunneling through the substrate. In such F2B bonding, TSVs are used for both between-layer-connections and I/O connections.

Above figure shows a conceptual two-layer 3D IC with F2F or F2B bonding, with both TSV and microbump connections between the layers.

All TSV-based 3D stacking approaches share the following three common process steps :
(a) TSV formation;
(b) wafer thinning; and
(c) aligned wafer or die bonding, which could be wafer-to-wafer (W2W) bonding or die-to-wafer (D2W) bonding.
Wafer thinning is used to reduce the impact of TSVs. The thinner the wafer, the smaller (and shorter) the TSV is (with the same aspect ratio constraint). The wafer thickness could be in the range of 10 to 100$\mu$ mm and the TSV size is in the range of 0.2 to 10$\mu$mm .
In TSV-based 3D stacking bonding, the dimension of the TSVs is not expected to scale at the same rate as feature size because alignment tolerance during bonding poses limitation on the scaling of the vias. The TSV size, length, and the pitch density, as well as the bonding method (face-to-face or face-to-back bonding, SOIbased 3D or bulk CMOS-based 3D), can have a significant impact on the 3D microprocessor design. For example, relatively large size of TSVs can hinder partitioning a design at fine granularity across multiple device layers and make the true 3D component design less possible. On the other hand, the monolithic 3D integration provides more flexibility in vertical 3D connection because the vertical 3D via can potentially scale down with feature size due to the use of local wires for connection. Availability of such technologies makes it possible to partition the design at a very fine granularity. Furthermore, face-to-face bonding or SOI-based 3D integration may have a smaller via pitch size and higher via density than face-to-back bonding or bulk-CMOS-based integration. Such influence of the 3D technology parameters on the microprocessor design must be thoroughly studied before an appropriate partition strategy is adopted.

Through-Silicon Via Technology Applications

Major applications for 3D TSV include CMOS image sensors, memory, processors, and other logic devices such as field programmable gate arrays (FPGAs). Each application has different requirements, therefore features and aspect ratios will vary. Different TSV fabrication processes may be used for each application. While through vias have been used in MEMS applications for many years; this market overview focuses on applications outside the MEMS area.

Image Sensors

The first application of the technology in production today is CMOS image sensors. Production lines at Toshiba were installed in 2008, and commercial products have been shipped for mobile phone applications. Aptina (Micron’s spin-off), Oki Electric, STMicroelectronics, and Tessera with modifications to the ShellCase technology have announced image sensor products for camera modules. Today’s applications use a backside via formation process. The addition of DSP to image sensors is anticipated in future camera module versions.

Memory

Issues include concerns that high-speed memory such as DDR3 will suffer performance limitations when connected in a stacked package using wire bond technology. 3D TSV memory technology developments have been announced by Samsung Electronics, Micron Technology, NEC, Elpida, Oki Electric, Hynix, and Tezzaron. Some companies predicted that at the 32 nm node in 2010, a 3?memory density could be achieved using TSV technology. While some companies are targeted at mobile phones and other portable devices with sufficient RAM to run high-definition video and other 3D graphics applications, the first commercial application for DRAM with TSVs is expected to be high-performance memory for the server market starting in 2012.

Samsung’s developments include an all-DRAM stacked memory packaging using TSV technology. Prototypes using the company’s wafer level-processed stacked package (WSP) consist of four 512 MB DDR2 DRAMs for a combined 2 GB of high-density memory and a 4 GB DIMM stack made up of TSV-processed 2 GB DRAMs. Samsung stated that it was developing the process for next-generationcomputer systems in 2010 and beyond. Samsung has also published the results of its 3D DRAM that supports four-rank operations with a single master and three slave chips connected using approximately 300 TSVs. The total device density is 8 GB and each stack constitutes a rank. The master acts as a buffer that isolates the channel and the slave chips. Samsung reported both improved performance and lower power consumption for these prototypes .

Tezzaron has developed a process for high-volume 3D memory that is repairable, with smaller capacitances and with low manufacturing costs. The company is expected to begin selling a high-performance 3D SRAM replacement that will be less expensive to produce than the existing high-end SRAMs.

Some companies have also discussed the possibility of TSVs for flash memory applications. Examples of NAND flash memory applications include memory storage cards, USB drives, MP3 players such as Apple’s iPod, digital cameras, and portable gaming machines. Samsung has noted that the demand for smaller and smaller NAND flash card form factors is increasing at a rapid rate. Samsung representatives have indicated that future flash applications may require TSV to meet performance/form factor requirements. Other industry experts are pessimistic about shrinking design rules for NAND flash beyond the 32 nm generation because memory cells will be so small that operation will be unstable. The big problem is not the transistors, but the increase in delay.

Here is an interesting discussion on 3D packaging by the industry leaders.

Through-Silicon Via

The through-silicon via (TSV) is composed of a conductor, also named “nail” or “plug,” crossing the Si substrate of the stacked dies. The conductor [common material choices include copper (Cu), tungsten (W), and polysilicon] is electrically insulated from the substrate by a dielectric layer (usually SiO2) and interconnects the metal wires of the stacked dies.

The geometry of the TSV conductor may vary depending on the 3D stacking technology. The area crossed by current may have different shapes (squared, rectangular, circular, elliptical, and polygonal). Also, the lateral surface of the conductor can be cylindrical or conical. The TSV interconnection of metal wires in adjacent dies within a 3D stack can follow different schemes. For example, TSVs can connect a Metal 1 (M1) wire of the top die with the topmost MN wire of the bottom die when the latter features N-level wire hierarchy. Another scheme may foresee the connection of both the topmost MN layers of two adjacent stacked dies. The electrical link established by a TSV between dies can be utilized for any of the typical functions supported by standard 2D interconnects: signal (analog and/or digital), clock, and power supply/ground links. In general, the geometry and the materials used in all the TSVs crossing a die are the same and this simplifies the optimization of the 3D technology process.

Once the TSV technology is fixed, the only degree of freedom in achieving different design requirements in the electrical characteristics of TSV links is to employ more TSVs in parallel in the same link. For example, this can be utilized in power supply distribution networks, where high DC current capability is required and low-resistance interconnects are desirable. Connecting multiple TSVs in parallel can decrease the resistance of power/ground links among the stacked dies. In addition, an increase in the parasitic TSV capacitance due to multiple parallel TSVs can also contribute to compensating dI/dt effects, similar to the effect of inserting large decoupling capacitors.

The electrical parameters of a TSV, namely, RTSV , LTSV , and CTSV would strongly depend on the TSV structure (both geometry and materials).

Sunday, April 17, 2011

CHIP STACKING TECHNOLOGY

Chip stacking to improve electronic implementation density was pioneered in the early 1980’s and developments continue in the 1990’s. The first 3-D modules, comprising multiple thin (i.e., 100$\mu$ m) IC layers, targeted the compact fabrication of analog processing channels for mosaic detector arrays. The technology was extended to include fabrication of high density neural networks for advanced focal plane array (FPA) applications. The 3-D packaging of digital memories began in late 1988 when 3-D DRAM, SRAM, and Flash memory modules of various capacities were demonstrated.

Two types of IC stacks have been built, namely short-stack and tall-stack configurations. Short-stacks consist of few (i.e., 4–6) stacked layers whereas tall-stacks may have as many as 128 layers. Many of the production steps are common for tall- and short-stacks.

The process flow for module fabrication begins with wafers comprising the die to be stacked. The first step is lead modification to relocate each die’s contact pads to one or two of its edges to align with the side-wall interconnect of the final chip-stack. This metallization step includes an additional layer gold and the additional capacitative loading is of the order of 1 pF/cm. All wafers are tested after rerouting. The wafers are then thinned on a diamond grinding machine. Whereas the starting wafer thickness is between 500 $\mu$ m and 750$\mu$ m, wafers destined for short-stacks are typically thinned to 250 $\mu$ m and those destined for tall-stacks are typically thinned to 100 $\mu$ m with a variation across a 150$\mu$ mm wafer of 3$\mu$ m. Occasionally, thinner layers have been used for special applications. The thinned wafers are then diced maintaining chip size being with an accuracy of 7.5 $\mu$ m. The thickness and dicing dimensions of the die are checked for statistical process control.

To form a chip-stack, its dice are laminated in custom fixtures using epoxies or other adhesives. The top layer of each short-stack is a cap chip which allows the conversion of the side-wall (i.e., chip edge) interconnect to an array format for wire-bonding. AlN is selected for cap chips to provide a better thermal match with the silicon chips in the stack and for its mechanical hardness. Excess epoxy is squeezed out until the final adhesive thickness is about 1 $\mu$m. The module faces are then lapped and polished. Silicon is then etched back on the faces of the chip-stack where the reroute lead ends are located. A polyimide passivation is spun on the exposed leads and is then polished just enough to expose the leads. The final step is metallization for the side-wall interconnect of the rerouted leads. Typically, gold is sputtered on top of the passivation layer to form T-connects with the reroute metallization as shown in Fig. Interconnect patterns are designed to minimize resistance and capacitance; pads are typically 50–100 $\mu$m wide, whereas typical bus metal stripes are 75 $\mu$ m or greater in width.

Tall-stacks are nearly complete at this point. Further processing may include bumping to allow flip-chip bonding to substrates and, for imaging tall-stacks, a detector array is bonded onto its front face. Short-stacks are also built in the same cube form as used for tall-stacks. This allows many short-stacks to be built in a

batch. For example, a short-stack may contain only six layers but as many as 12 short-stacks can be built in a single cube. Separation of the short-stacks is achieved by softening, at a moderate temperature, the thermoplastic polymer which holds them together; the layers within the short-stack being bonded with a thermosetting epoxy remain unaffected. The individual short-stacks are then attached to packages and wire-bonded.

The packages are evacuated and sealed using conventional processes.

The short-stack process is much more tolerant to stacking mismatches and run-outs in the -direction than the tall-stack imaging module. Indeed, die thickness accuracy has to be more stringent for tall-stacks so that detector arrays will mate to the stacks. In practice, tolerances of 25 m are held for tallstacks, with individual layer tolerances in the sub-micron range whereas the stacking tolerance for short-stacks is 50 m. Tall-stacks with 128 layers and short-stacks with four and five layers have been fabricated and short-stacks with many
more layers could be fabricated. Tall-stacks are typically 10–15 mm tall, whereas short-stacks are thin, being between 0.75 mm and 2.5 mm. Short-stacks are built in modules before segmentation; the pre-segmented stack being about 25 mm tall and containing 10–20 short-stacks. A summary of the typical specifications is given in Table .

Chip-stacks have been tested over a wide range of temperatures, with memory modules passing full military temperature screening of 60 C to 150 C and focal-plane modules operating at liquid Nitrogen temperatures. Environmental testing of chip-stacks has also shown that modules can handle forces over 15 000 G during operation.

Pages