Print This Page
Email This Page
 
IXA University Program Newsletter

IXA University Program Quarterly Newsletter

IXA Education Summit 2004, September 22-23

The Intel® IXA University team routinely gathers professors, students and Intel specialists once a year to review the work done by the program schools as well review the latest news from Intel. This year we held our conference in Hudson Massachusetts.  We brought in Doug Davis and Matt Adiletta to deliver keynote speeches. Doug is General Manager of the CIG Infrastructure Processor Division and has been a strong supporter of the IXA University Program. Matt is an Intel Fellow and Director of Communication Processor Architecture in the Intel Communications Group, based in Massachusetts. Below is a brief overview of the Conference. We have also attached links to the relevant presentations made.


Opening Remarks: Paul Posco - IXA University Program Manager

This year we had an exciting agenda. Paul Posco, the IXA University Program manager, started off the day providing an overview of the two day session and opening remarks.  During the introductions, participants described their areas of research. Participants hailed from Australia, Canada, India, Switzerland, Ireland, England, China, Japan, Malaysia, and the United States.


Keynote Speaker: Doug Davis - IPD GM and Intel VP

Doug gave a keynote presentation that covered innovation, the future of network processors, the importance of Intel/University relationship to innovation, and the future direction of the IXA University program. Participants were happy to know that Intel will continue supporting this program and value the work of the universities.

 

Doug's presentation will be available in the near future.


Keynote Speaker: Matt Adiletta- Director of Communication Processor Architecture

Matt gave a compelling and interesting talk highlighting the future direction of Intel’s network processors as well as reviewing the IXP2XX architecture and proposing interesting research ideas. The title of the presentation was "A Chat With Matt".  As the title suggests the talk was interactive as Matt fielded many questions from the audience. 

Due to the proprietary nature of this presentation, we are unable to offer it for public viewing now.


IXP2xxx Silicon Architecture Technical Overview - John Morgan, Intel Corporation

.

John provided an enlightening presentation of the IXP2xxx hardware architecture. His talk included overviews of the IXP2400, the IXP2800, differences between the two processors, and a detailed description of the IXP2XXX components.

 

John's presentation is available here.


IXA Software Portability Framework Technical Overview - Ihab Bishara

To complement IXP2XXX hardware discussion, Ihab presented a technical review of the IXA Software Framework.  Included in the discussion: Microblock programming model and components, architecture, Microblock types and structure, Microblock I/O, the Dispatch Loop Architecture, and an example Microblock application.

Ihab's presentation is available here.


Research Challenges in Network Measurement - Prof. Jim Kurose, University of Massachusetts Amherst

.

Professor Kurose analyzed the research challenges in network measurement.  The presentation covered the reasons for monitoring/measuring, what should be measured and concluded with a number of interesting research activities that are ongoing.

 

Jim's presentation is available here.


An IXA-based Network Measurement Node - Prof. Tilman Wolf, University of Massachusetts Amherst

As a complement to Professor Kurose's talk, Professor Wolf presented his research in passive network measurement employing the IXP2400. The node captures the packet, performs anonymization of IP addresses, and collects statistics. Tilman describes the anonymization process in great detail.  He concluded the presentation with the results they have achieved, most notably a working solution on the IXP2400 hardware.

Tilman's presentation is available here.


Research Experiences with the IXP Processor - Prof. Patrick Crowley, Washington University in St. Louis

.

Professor Crowley presented a combination of teaching experiences and research at Washington University. Specifically he covered two of the research projects that are under way. In the first project, Segmented hash tables, he discussed motivation, basic idea, design, analysis, and implementation of such on the IXP platform. The second project area, an innovative use of the IXP network processor, was the development of Bioinformatics algorithms on the network processor, taking advantages of the parallel processing of the IXP platform. 

 

Patrick's presentation is available here.


NePSim: A Network Processor Simulator with Power Evaluation Framework - Prof. Laxmi Bhuyan, University of California Riverside

Professor Bhuyan treated us to a description of their open-source IXP1200 simulator.  He addressed project goals, the architecture and internals, power efficiency, and the effort taken in validating the simulator. They are currently working on extending NePSim to the IXP2400/2800.

Laxmi's presentation is available here.


Supporting Enterprise Applications with Attached Network Processors - Ada Gavrilovska, Georgia Institute of Technology

.

Ada presented the IXA related research areas that are ongoing at Georgia Tech.  Most of the presentation covered application-level services, her area of expertise. She covered the motivation, service requirements, current research approaches, and then the specific approach they are pursuing. 

 

Ada's presentation is available here.


Data Transposition of the Kasumi Block Cipher on the IXP Processor - Berk Sunar, Worcester Polytechnic Institute

Professor Sunar described his research efforts with the Kasumi Block Cipher on the IXP network processor. He described the Kasumi Block Cipher, the implementation approach they have taken, and the advantages and disadvantage of their implementation.  He concluded the presentation with a brief overview of other ongoing research.

Berk's presentation is available here.


Distributed Lock Manager on a Network Processor - Jie Lu, University of Massachusetts Lowell

.

 

Jie Lu, a PhD student of Professor Wang, presented the research of implementing a distributed lock manager on a network processor.  Jie covers the motivation, background, and the design of the distributed lock manager. 

 

Jie's presentation is available here.


Stareast Platform Overview, Xingang Guo, Intel Corporation

.

Xingang gave an informative presentation on Stareast, an IXP425 development platform.  He described the value of the IXP425 in wireless networking research. He went on to discuss the design requirements of the platform, its features, architecture, and the software stack included with the Stareast platform.

 

Xingang's presentation is available here.


Design of a QoS Gateway, Prof. Kang Shin, University of Michigan

Professor Shin presented the research underway on a QoS gateway. He discussed the motivation for using the IXP platform, the basic Agent/Manager architecture of the QoS gateway, design considerations,  and described the QoS Gateway solution implemented on the IXP platform.

Kang's presentation is available here.

 


Wireless Media Transport and Cross-Layer QoS, David Romano, Intel Corporation

.

David presented an entertaining and informative presentation on Wireless Media Transport. He began the presentation with a wireless television playing Star Wars to demonstrate current wireless home entertainment products. He explained wireless LANs and the technical barriers associated with them in the context of the digital home. He concluded the talk with the current methodologies they are pursuing in their research.

 

David's presentation is available here.


Panel Discussion - IXA Curriculum Programs

We originally slated a panel discussion on IXA curriculum programs. The purpose was to discuss experiences, learnings, best practices and challenges in employing NPUs in the classroom. Due to time constraints we were unable to include this discussion. We are very interested in this topic though. Please share your IXP related teaching experiences with us.  What worked?  What didn't work?  What can the IXA University Program do to assist? Send feedback to admin@ixaedu.com

 

Included below are the presentations from Professor Steenkiste from Carnegie Mellon University and Professor Kalyanaraman from Rensselaer Polytechnic Institute.

 

Peter's presentation is available here.

Shivkumar's presentation is available here.


In addition to the presentations, we allotted a significant amount of time both days of the conference for Q&A with IXP experts from Intel.   Many questions were asked relating to the program and the development tools. Special thanks to the Intel engineers who were present to field questions from the audience.

 

We look forward to seeing you again next year!


Universities receive Intel grants during Q4, 04

Congratulations to the following Universities who had research proposals approved during the fourth quarter of 2004.

  • Carnegie Mellon University - research grant - Anastassia Ailamaki

  • University of Genoa - research grant - Raffaele Bolla

  • Indian Institute of Science - research grant - HS Jamadagni

  • Multimedia University - research grant - G.S.V. Radha Krishna Rao

  • Royal Institute of Technology (KTH) - research grant - Peter Sjödin

  • Swiss Federal Institute of Technology - research grant - Lukas Ruf

  • Tsinghua University - research grant - Feng-Yuan Ren

  • Tsinghua University - research grant - Jun Li

  • University of Cape Town - research grant - Neco Ventura

  • University of Massachusetts Amherst - research grant - Tilman Wolf

  • University of Michigan - research grant - Kang Shin

  • North Dakota State University - research grant - Raj Katti

  • OGI School of Science and Engineering - research grant - Wu-Chi Cheng

  • Washington University in St. Louis - research grant - Patrick Crowley

  • Georgia Institute of Technology - research grant - Karsten Schwan

  • New York University - equipment grant - Vijay Karamcheti

  • University of KwaZulu-Natal - equipment grant - A.L.L. Jarvis


 

IXA UNIVERSITY GLOBAL UPDATES

Paul Posco Spreads IXA Message in India and Malaysia

Paul Posco, Program Manager for the IXA University Program, recently visited India and Malaysia to facilitate the growth of the program in these geographies. Primary objectives of the trip included: discussing IXA University Program strategy with local Intel sponsors; meet with the university professors gathering information regarding their curriculum, research, needs and concerns; visit newly funded universities, India Institute of Science Bangalore and India Institute of Technology Chennai . In addition, Paul met with Professor Ryoichi Komiya from Multimedia University and Professor Madya Othman Sidek from Universiti Sains Malaysia.


 

TECH WATCH

NePSim - A Network Processor Simulator with Power Evaluation Framework by Yan Luo, Jun Yang, Laxmi N. Bhuyan and Li Zhao, University of California Riverside

    The NP research community needs an integrated infrastructure that provides the maximum flexibility in simulating different architecture extensions of a network processor and provides detailed power dissipation analysis accurately and efficiently. We present NePSim, an integrated system that includes a cycle-accurate architecture simulator, an automatic formal verification engine, and a parameterizable power estimator of a network processor that consists of clusters of multi-threaded execution cores, memory controllers, input/output ports, packet buffers and high speed buses. The simulator supports the IXP1200 microengine ISA and performs an execution-driven simulation that reproduces IXP's internal operation. A comprehensive set of statistics is collected and reported at the end of execution to facilitate performance analysis. The verification engine, called iveri, can validate the simulation from the execution log traces using user-defined constraints in a verification language. The power estimator combines a suite of models (XCacti, Wattch, and Orion) for dynamic power measurements and calculates results based on per-cycle resource usage counts.

    We ported four representative NP applications to conduct the experiments: IP forwarding (ipfwdr), URL routing (url), MD4 and network address translation (nat). To verify NePSim against IXP1200, we ran four benchmarks and measured the performances in terms of throughput (Mbps, megabits per second) and average packet processing time (cycles). We observed an average error of %1 in throughput and 6% in average processing time across the four benchmarks compared to the SDK simulator supplied by Intel.

    With NePSim, we conduct a set of experiments as follows:

  • We study the impact of having more number of MEs on the total packet throughput measured in Mbps (megabits per second) and the ME idle time due to memory contention. Intuitively, the throughput of a benchmark should increase with the number of MEs or threads. However, for memory intensive benchmarks (url and md4), increasing the number of threads means increasing the memory contention since the memories are shared among all threads. When a program is memory bound, it often happens that all the threads in an ME are swapped out of pipeline waiting for their memory references, resulting an idle ME. The abundant core idle time gives us power-saving opportunities.

  • We develop a model for power consumption of the IXP1200 and the MEs. In particular, we identify those components where bulk of the power is spent. It is interesting to see from the result that the ALU takes most power (45% on average), followed by the control store (28%) where the program is stored. This is because the control store is accessed almost every cycle.  The third power hungry component is the general purpose register (GPR) files (13%), where instruction operands and results are stored. Even when data are loaded from the memory the transfer files, they are moved again to GPRs for ALU operations.

  • We apply Dynamic Voltage Scaling (DVS) technique to reduce ME power consumption. The average ME idle time is shown to be abundant (one average 10-23%) since most of the benchmarks are memory bound. Applying DVS while MEs are not very active can reduce the total power consumption substantially. Our scheme observes the ME idle time periodically. Once the percentage of the idle time in the past period is over a threshold, we scale down the voltage and frequency (VF in short) by one step unless the minimum allowable VF is hit. If the percentage is below threshold, indicating that the ME in the past period is relatively busy, we scale up the VF by one step unless they are at maximum allowable values. Overall, we achieve maximum 17% of power savings of the NP over four network applications with less than 6% performance loss.

    NePSim has drawn a lot of attention from both the academic community and the industry. Since July 2004 to the date of this paper, NePSim has over 100 downloads worldwide. We welcome feedback from users, and we are currently extending NePSim to simulate IXP2400/2800 network processors.  The source code of the complete simulator can be downloaded from http://www.cs.ucr.edu/~yluo/nepsim/.

 

References

[1] Yan Luo, Jun Yang, Laxmi N. Bhuyan, Li Zhao, "NePSim: A Network Processor Simulator with Power Evaluation Framework," IEEE MICRO Special Issue on Network Processors for Future High-End Systems and Applications, Sept/Oct 2004.

[2] Xi Chen, Yan Luo, Harry Hsieh, Laxmi Bhuyan, F. Balarin, "Utilizing Formal Assertions for System Design of Network Processors, " Design Forum, Design Automation and Test in Europe (DATE), 2004.

 

 

PRODUCT SPOTLIGHT

Intel Expands Network Processor Line for Communications and Embedded Networking Applications

    With a goal of boosting already robust network processor market segment sales, Intel Corporation announced today two new families of network processor units (NPUs) for traditional communications applications and for the emerging embedded networking segment.

    Network processors in traditional communications systems help to power many access and edge networking applications such as DSLAMs, wireless access switches and enterprise router line cards. Now, Intel’s NPUs are also being used in embedded networking applications in industrial automation environments -- instead of custom components. The benefit to equipment manufacturers is a faster time-to-market with customizable components.

    “The performance and flexibility of our network processor technology has become very attractive to new networking segments such as automation and control,” said Doug Davis, vice president and general manager of the Infrastructure Processor Division, Intel Communications Group. “Network processors are no longer just about relentless pursuit of line speed; they now need to combine more features and flexibility that allows customers to customize designs for their individual specifications.”

   The Intel® IXP460 and Intel® IXP465 network processors are the latest additions to the Intel IXP4XX product line and offer a higher speed Intel XScale® core, expanded connectivity options as well as enhancements to improve end system reliability and security.

   These features are what helped attract such customers as Rockwell Automation to use Intel’s network processor technology. “Rockwell Automation has chosen to develop its next- generation products using Intel network processors instead of custom ASICs and other technologies. This allows our engineers to focus on our value-add,” said Scot Tutkovics, software engineering manager, Rockwell Automation. “The IXP465 meets our demanding design requirements including low power consumption, a high degree of reliability, built-in Ethernet, USB, real-time synchronization and other networking capabilities.” HP is also planning to use the IXP46X in a future line of high-end printers because of the scalable processing performance and on-chip integration of a wide variety of functions and interfaces.

    The Intel® IXP2325 and Intel® IXP2350 network processors targeted for network access and edge applications, combine data plane and control plane processing capabilities in a single chip and are Intel’s first network processors built on 90nm process technology.

    These NPUs deliver up to 2-Gbps line rates, while offering developers significant savings in part count, power consumption and board area. The IXP23XX network processors use the same hardware and software architecture as the rest of the IXP2XXX product line. "Using the scalable Intel network processor architecture for several of our product lines provides substantial savings of time and effort in both hardware and software design," said Youngky Kim, senior vice president, Samsung Telecommunication Systems Division. "The performance and features of the IXP2350 allow us to apply these advantages to our next-generation wireless system design."

    Intel and members of the Intel® Communications Alliance also announced development tools, hardware platforms, software building blocks, and application-specific software solutions to support the IXP23XX and IXP46X product lines.

 

For more information regarding the IXP23XX product line visit: http://www.intel.com/design/network/products/npfamily/ixp2350.htm

 

Noteworthy

Wind River's University Program

Embedded technology is becoming more popular as a tool for applied research and an area of specialized study at colleges and universities. Through Wind River's University Program, students and faculty at colleges and universities throughout the world are able to use Wind River's state-of-the-art, real-time, embedded development software for curricula and research projects. Colleges and universities, worldwide, may apply through the University Program to receive software donations of Wind River technology. Universities are not charged for donated software. For more information regarding the program visit: http://www.windriver.com/universities/


IXA Software Update Coming Soon

In early November, IXA University participants will be receiving a software update. The software update will include:

  • Intel IXA SDK Tools 4.0

  • Intel IXA SDK Software Framework 4.0

  • RadiSys ENP-2611 SDK 4.0 for Linux

  • RadiSys ENP-2611 SDK 4.0 for VxWorks

Your comments and suggestions are encouraged to ensure that this Newsletter contains appropriate, informative and accurate information. If you have published papers, news items, events, etc. that you would like to see added to future editions of the newsletter send us feedback.

Recent Published Papers

Intel Whitepapers
Advanced Software Development Tools for Intel® IXP2XXX Network Processors - Maximizing productivity and efficiency at every stage in the development process
Intel IXP2XXX network processors provide robust performance to support value-added network services at line rate, while offering the design flexibility to meet the application requirements of individual projects. Built on the fully programmable Intel IXA architecture, these network processors afford network equipment manufacturers an unparalleled opportunity to build high-quality, high-performance, and market-differentiated products. By providing a modular software framework—the Intel IXA Portability Framework—Intel enables developers to easily take full advantage of the performance and flexibility of Intel network processors (NPUs) to create unique applications and bring them to market quickly. The Intel IXA Portability Framework facilitates reuse and portability of application code, enables optimal application partitioning across processing elements, and simplifies integration of customer-developed or third-party software modules. To accelerate the application design cycle, Intel also offers a suite of advanced software tools that maximize developer productivity and efficiency at every stage in the development process. As a result, network equipment manufacturers can gain a valuable competitive advantage when using Intel NPUs: high-performance processing, flexibility to differentiate products, and an easy-to-use development environment to reduce time-to-market. In this white paper, we will examine the newest tools available from Intel that simplify three key areas of application design: mapping application requirements to processing resources, generating code with suitable performance, and testing/ debugging units. Valuable efficiency gains can be made in these stages of design by simplifying and automating tasks, thus making development faster and easier while maximizing the advantages of fully programmable Intel network processors.

Intel® IXP2400 Network Processor Low Power
This application note provides power-reduction techniques for the Intel® IXP2400 Network Processor in applications such as OC-3 and OC-12 line cards, and shows how effective these techniques can be by presenting an example system configuration; this example shows that the total power consumption, including the IXP2400 and its memory subsystem, can be nearly 10W.

Intel® IXP42X Product Line of Network Processors and IXC1100 Control Plane Processor Customizing RedBoot
The bootloader is a fundamental software component for most computer systems; it provides the initialization sequence for the processor and hardware components so that the system is operational. Primarily intended to boot Linux, RedHat* RedBoot* is the bootloader provided for the Intel® IXDP425 / IXCDP1100 Development Platform. RedBoot is provided in both binary and source form, and is developed and maintained by RedHat; for further information regarding RedBoot, see http://sources.redhat.com/redboot/. Note that potential issues arise when product development shifts from the IXDP425 / IXCDP1100 platform to a prototype/custom baseboard. This shift will likely involve modification of the RedBoot source code to support the custom board design. So a fundamental question is: “What does RedBoot need to do (or be modified to do) in order to support my custom board and the product that I am intending to ship?” In many cases, the default bootloader source configuration is acceptable; there may be no need to modify RedBoot. But, the default configuration may not be acceptable for your final product. This application note is intended to provide guidance on issues that may arise when customizing (aka “productizing”) Redboot. Main topics covered in this document include: • What is customizing? • Getting started • Installing and building RedBoot • A tour of the RedBoot source tree: — What needs to be modified, reconfigured, added, and removed when moving to a custom board — Where are modifications required? • Typical customizing options — How-to’s

Intel® IXP42X Product Line of Network Processors and IXC1100 Control Plane Processor Performance Tuning
This document summarizes a number of performance-enhancement techniques. Some of the techniques are generally applicable and some are specific to networking applications, the Intel XScale® Core, or the Intel® IXP42X Product Line of Network Processors and IXC1100 Control Plane Processor. The techniques suggested in this document are suggested solutions to the problems proposed and are provided for informational purposes only. There can be no guarantee that these proposed solutions will be applicable to your application or that they will resolve the problems in all instances. The performance tests and ratings mentioned in this document are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. Most of these techniques fall into one of four categories: • Determining the cause of a performance problem • Identifying the solution for a problem • Information, tasks and ways of organizing performance work • Organizing information, tasks, and performance work Intel has published a significant amount of information on performance optimization for the processors based on the Intel XScale core. This information is available in many of the documents listed in “Related Documents” on page 8. This document summarizes and collects in one location some of the information in these different references.

 

IXA University papers


A Scalable, Cache-Based Queue Management Subsystem for Network Processors - Sailesh Kumar, Patrick Crowley, Washington University in St. Louis.
Queues are a fundamental data structure in packet processing systems. In this short paper, we propose and discuss a scalable queue management (QM) building block for network
processors (NPs). We make two main contributions: 1) we argue
qualitatively and quantitatively that caching can be used to
improve both best- and worst-case queuing performance, and 2)
we describe and discuss our proposal and show that it avoids the failings of existing approaches. We show that our cache-based approach improves worst-case queue operation throughput by a factor of 4 as compared to a cache-less system. We also argue that our approach is more scalable and more efficient than the cache-based mechanism used in Intel’s second-generation network processors.


Performance Analysis of Multi-dimensional Packet Classification on Programmable Network Processors - Deepa Srinivasan, IBM Corporation, Wu-cheng Feng, Portland State University
Multi-field packet classification is frequently performed by network devices such as edge routers and
firewalls – such devices can utilize programmable network processors to perform this compute-intensive task at nearly line speeds. The architectures of programmable network processors are typically highly parallel and a single algorithm can be mapped in different ways onto the hardware. In this paper, we study
the performance of two different design mappings of the Bit Vector packet classification algorithm on the Intel® IXP1200 network processor. We show that: (i)
Overall, the parallel mapping has better packet processing rate (25% more) than the pipelined mapping; (ii) In the parallel mapping, a processing element’s utilization can be considerably affected by code complexity,
in terms of branching, because of significant time wasted (as much as 40% more) due to aborting
instruction execution pipelines; (iii) In the pipelined mapping, multiple memory reads per packet can lower the overall performance.

The Execution of Event-Action Rules on Programmable Network Processors - Ada Gavrilovska, Sanjay Kumar, Karsten Schwan, Georgia Institute of Technology
This paper evaluates the ability of programmable network
processors (NPs) to perform application-specific processing
that is structured as sets of interdependent event action
rules sharing joint state. Our intent is twofold: (1) to assess the ability of NPs to deal with the complex application-specific code and state maintained by event action rules like those used in business processes also
termed (termed business rules), and (2) to create system solutions that permit developers to dynamically map such application-level service code to appropriate sites in target
distributed platforms comprised of hosts, network processors, and embedded server systems. A specific outcome of our work presented in this paper is the creation of a simple, efficient dynamically reconfigurable
rule engine for a network processor, able to execute rules at the Gigabit speeds required by the network links attached to it. Business rules like those found in the Operational Information Systems used by companies like Delta Air Lines are used to demonstrate rule engine  capabilities and overheads. A second outcome of our work is its demonstration of the flexibility and ease of reconfiguration associated with a network processor-resident rule engine, where rules can be added and removed whenever appropriate (i.e., hot-swapping) without compromising the processor’s ability to Large-Scale Network Simulation: How Big? How Fast?
maintain high performance for its ongoing processing
tasks.

Offloading Multimedia Proxies using Network Processors - Øyvind Hvamstad, University of Oslo
A Multimedia Proxy aims to reduce the client startup latency, network load and server load. Such a proxy may be subject to many concurrent clients and experience high processing loads due to for example transcoding or
protocol translation. At the same time a high load can be experienced when fetching data from the server. In this thesis, we will explore how to offload a multimedia streaming proxy by using network processing technology.
We design, implement and evaluate a proxy prototype on the IXP1200 network processor. As a proof of-concept we show that the prototype successfully offloads the proxy host in the data-plane, i.e., no data packets are processed by the host CPU, leaving it free to perform other CPU intensive tasks. The prototype is able to do application layer forwarding using approximately a tenth of the cycles compared to a traditional architecture, where all packets are processed by the host CPU.

 

 

 

 

 Send Feedback