积分规则 网站地图 帮助中心    
嵌入式软件 单 片 机 DSP 存储器 传感控制 光电显示
嵌入式硬件 CPLD/FPGA SOPC AD/DA 接口电路 模拟设计
I C设计 通信产品 汽车电子 电源产品 消费电子 数控系统
工业控制 军工/航天 安防产品 医疗电子 计算机外设 测试测量
供应 I C
求购 公司库

  IC 求购 销售 公司 论文 DATASHEET 参考设计 论坛
当前位置: 首页 >> 英文资料
  相关分类:
Designing DDR3 SDRAM controllers with today's FPGAs
 
作者:Adrian Cosoroaba, Xilinx   来源:EETimes    点击数:349   更新时间:2008-4-3
您可以添加到网摘 让更多人关注此文章:

    The recently introduced DDR3 SDRAM technology paves the way to higher data rates (from 800 Mbps to 1600 Mbps) and provides higher performance for many systems that depend on data, video, or packet processing.

Every architectural change for higher performance comes at a price, however, and one aspect is measured in additional man hours of system design time, simulation, and troubleshooting. DDR3 SDRAM is an evolutionary step from DDR2 and provides enhanced features to enable higher data rates. It also maintains enough backward compatibility with DDR2 to provide system designers with the benefit of not having to reinvent the wheel on all aspects of controller and interface design.

In the case of FPGA-based designs, some FPGA vendors have taken on the task of designing a complete controller and physical layer interface. This article outlines the major differences between DDR3 and DDR2 SDRAM architecture, the challenges that come with architectural changes for higher data rates, and also reviews them in the context of a Xilinx Virtex-5 FPGA reference design tested in hardware at 800 Mbps. The reference design is available free for downloading.

DDR3 evolution from DDR2 SDRAM
DDR3 SDRAMs are following the same path that previous generations of DDR SDRAMs have, and are thus slated to provide higher data rates than DDR2 SDRAMs. This new architecture comes with enhancements that help with the design of the physical layer, system, and controller interface. Architectural changes were required to enable external controllers to communicate at these higher rates and the DDR3 memory device to provide I/O throughput up to 1,600 Mbps.

The performance limiter for DRAMs has been – and continues to be – the internal memory array. Increasing external data rates to keep up with performance demands is difficult to scale at the internal memory array level. The DDR2 architecture uses a 4n prefetch scheme that enables four words to be accessed in the internal memory array for every command. This was not sufficient to keep up with higher external data rates, however, and an 8n prefetch was introduced with DDR3.

The trade-off is that for applications that require shorter bursts of four, this longer prefetch mode is a mismatch. A special command will chop the burst of eight in half for those cases when a burst of four is required. The burst of four will have gaps on the data bus between bursts, since the internal memory array access is an eight-word access.

Increasing the burst rate will increase the operating power, but the DDR3 architecture gives designers a break with a lower core and I/O voltage reduction from 1.8V to 1.5V. Additionally, the BGA package has improved, while additional power and ground balls better shield signals at higher data rates and reduce crosstalk.

The higher transition rates for both data and address lines can be a challenge. Improved connectivity options are necessary to give designers more options for matching the driver to PCB impedance and thus minimizing signal reflections. The ZQ pin option with an external resistor provides the control mechanism to calibrate the impedance of the DDR3 drivers. Reducing the reflections improves the data eye and consequently improves timing margins, especially in systems that have multi-load topologies and more predispositions to signal reflections.

Because calibration for improved timing is essential with higher data rates, the DDR3 architecture has introduced two new features. For read calibration during initialization, a multi-purpose register (MPR) provides the means for calibrating the read operation at the controller side, with a data pattern sent out from this register rather than the memory array. This feature can be useful when the memory array is not used to generate the data read pattern.

For applications requiring the use of DDR3 DIMMs, the JEDEC standard prescribes the fly-by topology. This fly-by topology is different than the star or T topology used for DDR2 DIMMs, where the signals are routed to arrive at the same time for all of the memory components on the DIMM. In the case of the fly-by topology, signals are routed to each component in serial-like fashion and flight times to memory devices are skewed. This improves signal integrity from the reduction of stubs but requires additional complexity for the controller. The function of the controller to compensate for the skew between signals to satisfy write timing is called "write leveling."

Table 1 overviews the major changes introduced by the DDR3 architecture from previous-generation DDR2 SDRAMs.



Table 1. Comparison between DDR2 and DDR3 SDRAMs.

For those designers who like to read industry standards, a 188-page description of the DDR3 JEDEC standard is located at www.jedec.org/DOWNLOAD/search/JESD79-3.pdf.

Memory interface and controller design challenges
Designing the memory interface and controller with an FPGA presents a number of challenges: a physical layer interface comprising read and write data capture, controller state machine, bank management, and board design issues related to signal integrity and layout. Let's look at these challenges and how the new DDR3 SDRAM architecture plays a role in making them more – or less – difficult.

Physical layer – read and write data capture
DDR3 SDRAMs were built to enable higher data rates than DDR2, but these rates come with a price. The data valid window (the period during which data can be reliably latched) for reads by the controller and for writes by the memory device is rapidly shrinking, with data rates moving to 800 Mbps and beyond.

The data valid window is shrinking faster than the data period due to timing uncertainties that do not scale well with increasing data rates. These uncertainties are caused by parameter changes such as clock jitter, device process, and system voltage changes. The capture mechanism for read data is particularly challenging because data (DQ) signals are transmitted by the memory device edge-aligned with the clocking strobe (DQS). It is up to the FPGA I/O and logic circuitry to properly center-align them for reliable capture.

The DDR3 data and strobe signaling is very similar to the DDR2 interface for read cycles. The previous DDR2 design methodology can be leveraged if it involves calibration or "read leveling" to correctly align various DQS signals to their respective DQs during system initialization.

Write operations have become more complex for DDR3 DIMM implementations where the fly-by topology is mandated by the JEDEC standard. The burden is on the controller to generate the data strobe (DQS) and data (DQs) signals with the proper delays to compensate for different flight times to each memory device on the DIMM.

If the design involves point-to-point component implementations, the fly-by topology is not needed and signaling for writes is similar to the DDR2 architecture. In either case, the FPGA needs to have programmable I/O delays for DQ and DQS signals to meet the read and write timing requirements.

Controller state machine design
Capturing data for reads and properly sending data out for writes is only one aspect of the controller and interface design. The state machine that manages the physical layer interface and the user or back-end interface that communicates with the rest of the FPGA design can be elaborate and time-consuming to implement from scratch.

The DDR3 SDRAM architecture is different, but maintains a great deal of backward compatibility with previous-generation DDR2, ensuring that some of the DDR2 basic functions implemented in RTL code can be leveraged in the new architecture. One notable aspect is the internal bank structure access algorithm, which can be used identically for both DDR2 and DDR3. Other aspects have changed with the 8n prefetch scheme. Some modifications are required for changes in burst length from four to eight if the previous DDR2 design was only using a burst length of four.

System design issue – signal integrity
Transmission of data through parallel lines from chip-to-chip at high rates has always presented a challenge, both because of cross-talk and ground bounce from multiple signals switching at the same time and from ringing caused by impedance mismatching. With higher data rates and lower voltages, these issues have become very serious. Fortunately, the package and driver design for both controller and memory device can help deal with these challenges.

The DDR3 package has additional power and ground pins for better immunity to crosstalk. Some FPGA vendors have also introduced improved packages with sufficient power and ground balls surrounding the signal I/Os, in addition to power and ground planes and decoupling capacitors inside the BGA package.

The impedance mismatching between chip drivers and the PCB has always caused signal reflections that disturb the valid eye with unwanted ringing. Even though the DDR3 architecture provides the means to control driver impedance (ODT and ZQ pin), IBIS simulations are a necessary design practice to verify FPGA and DDR3 driver calibration and the actual signal behavior with a particular board topology.

Xilinx has alleviated many of these challenges by implementing in hardware a 32-bit-wide DDR3 interface and controller running at 800 Mbps, using the Virtex-5 FPGA on the ML561 evaluation board for memory interfaces.


[1] [2]  下一页



相关文章
· 采用FPGA提高广播应用的集成度[12]
· 基于FPGA的高阶全数字锁相环的设计与实现[35]
· 基于ARM和FPGA的嵌入式超声探伤系统[8]
· 基于ARM和FPGA的嵌入式超声探伤系统[8]
· FPGA电路设计: 如何应对电源相关问题的挑战[6]
热门评论排行
·江苏嵌入式Linux教育培训
·锐极LINUX驱动培训班定于
·VHDL设计中电路简化问题的
·基于实时操作系统μC/OS-
·TMS320VC5402&n

文章评论
    没有任何评论
*只显示最新10条评论。评论内容只代表网友观点,与本站立场无关。更多评论
发表评论
  * 请先[登陆]再进行评论,谢谢。
评分: 1分 2分 3分 4分 5分
内容: *
发帖须知:
一.所发文章必须遵守《互联网电子公告服务管理规定》;
二.严禁发布供求代理信息,公司介绍,产品信息等广告宣传信息;
三.严禁恶意重复发帖;
四.严禁对个人,实体,民族,国家等进行漫骂,污蔑,诽谤。
 
热点新闻 [更多]
 
印度电子工程专业的学生提
抗震救灾 电子
汶川地震牵动人心&nbs
盖茨演示超大触摸墙计算机
北京奥运门票首次内嵌RF
地震殃及IT企业&nbs
Vishay 
中国已成全球最大机顶盒生
电子纸集成芯片问世&nb
NAND Fl
 
热门下载 [更多]
 
[ ] 手把手学单片机20个例
[ ] 单片机做的智能台灯
[ ] 单片机入门书
[ ] linux系统移植开发文档
[ ] IC卡的读写程序
[ ] 8051单片机C语言彻底应用
[ 常用软件] 555定时器电路设计软件V1.2
[ 常用软件] 51定时器计算软件
[ ] ARM处理器应用开发4步骤
[ ] 实用电源电路集锦
 
论坛新帖 [更多]
 
DSP与普通MCU的区别...
如何选择DSP的电源芯片?...
如何选择外部时钟?...
系统学习Linux的11点建议...
安防...
新型高智能门禁系统...
什么是嵌入式系统...
晶振不起振什么原因啊...
FPGA电源管理问答...
[求助]外部数据存储器,供个芯...

 
赞助商 [更多]
 


 

ICP许可证号:[粤 05056597]
联系电话:010-82517432 82517615 传真: 010-82517615

版权所有 Copyright © 2006 嵌入式技术网