( ESNUG 532 Item 3 ) -------------------------------------------- [09/06/13]

From: [ Frank Schirrmeister of Cadence ]
Subject: CDNS says Hogan missed close to 10 emulation customer use models

Hi, John,

In ESNUG 522 #2, Jim Hogan talks about simulation acceleration.  He points
out the key challenge correctly in acceleration performance is determined by

   1) the percentage of the simulation that is left running in software,
   2) the number of I/O signals communicating between the PC/workstation
      and the hardware engine,
   3) communication channel latency and bandwidth and
   4) the amount of visibility enabled for the hardware being accelerated.  

However, later in the table Jim then only mentions GPU acceleration by way
of Rocketick.  One main use model of Palladium emulation is "Verification
Acceleration", in which the hardware DUT executes in the HW accelerator and
a portion of the design, typically its test bench, is kept in SW simulation.

Good examples for this use model are PMC Sierra at CDNLive 2012 achieving
40x speedup [Ref 6], Broadcom at DVCON 2011 achieving 230x speedup [Ref 7]
and Samsung at CDNLive 2013 achieving 410x speedup [Ref 8] -- all compared
to pure RTL simulation.  In comparison, Rocketick's GPU acceleration was
featured in ESNUG 523 #1 as getting up to 23x speedup.

To better distinguish acceleration using GPUs from acceleration using
Emulation connected to RTL SW simulation, I suggest separating simulation
acceleration and verification acceleration in Jim's table as follows, the
bold red elements are my updates.
 Computational Element Granularity (# of comp elements) Speed per comp element Cycles per Sec (100 M gates) Vendors
SW Simulation X86 cores under 16 3 GHz under 1 Cadence Incisive/NC-Sim, Synopsys VCS, Mentor Questa
Simulation accelerationGPU processing elements 100's 1 GHz 10 to 1,000
NVDIA: 17x
Rocketick
Verification AccelerationMix of Host and Emulator millions under 1 GHz 40 to 10,000 Cadence Palladium XP + RTL Sim

Mentor Veloce + RTL Sim

Synopsys Zebu Server + RTL Sim
Processor
-based emulation
custom processors It is actually 100s of 1000s to millions under 1 GHz 100 K to 2M, processor based scales better with design size than FPGA Cadence Palladium
FPGA
-based emulation
FPGA gates millions 1 MHz - 100 MHz 500 K - 2 M, does not scale well with design size, so at 100MG to reach the M range is very unlikely. Debug causes further slow down. Mentor Veloce, Synopsys EVE-Zebu
FPGA prototyping

 

FPGA gates millions 1 MHz - 100 MHz 2M - 50 M, sometimes up to 100M Synopsys HAPS, Cadence RPP, DINI, Aldec, S2C, HOENS, Hitech Global, ProDesign
    Table 2: Various Types of HW Assisted Verification

There are three main customer use models in In-Circuit Emulation (ICE) and
Verification Acceleration: 

  1) functional verification
  2) regression runs and 
  3) software development/validation for drivers, operating
     system (OS) bring-up and middleware.

Besides these, there are many other customer use models like: performance
validation, post-silicon debug, test pattern preparation, failure analysis
and reproduction, and virtual silicon support making the "chip-to be"
available to customers for early access.

You can check out these use models at a recent case study: "Using Cadence
Palladium for SoC Performance Validation and Analysis" that Freescale gave
at CDNLive 2013 and later in similar form at DAC [Ref 9].

Two use models very unique to Palladium XP:

   - Gate-Level Acceleration -- something FPGA-based emulators cannot
     support due to the explosion in complexity in re-mapping the
     target technology to FPGA gates, and

   - Dynamic Low Power analysis. 

For the latter your readers can check out last year's presentation "NS115
System Emulation Based on Palladium XP" given by Nufront [Ref 4] that won
the pest paper award at CDNLive Shangai last year.  They booted Android,
achieved 1000x performance improvement over RTL simulation and correlated
power consumption using realistic runtime environments and applications
before silicon was available. 

For your reader's convience, in Item 5, I've assembled a list of use models
together with sources and results users achieved with Palladium.

    - Frank Schirrmeister
      Cadence Design Systems, Inc.               San Jose, CA

         ----    ----    ----    ----    ----    ----   ----

Related Articles

  CDNS says Hogan missed granularity, user access, speed, capacity
  CDNS says Hogan missed FPGA compile time, Rent's Rule, probing
  CDNS says Hogan missed 5 metrics/gotchas for picking an emulator
  CDNS says Hogan missed 47 Palladium user papers on Cadence.com

Join    Index    Next->Item






   
 Sign up for the DeepChip newsletter.
Email
 Read what EDA tool users really think.






Feedback About Wiretaps ESNUGs SIGN UP! Downloads Trip Reports Advertise

"Relax. This is a discussion. Anything said here is just one engineer's opinion. Email in your dissenting letter and it'll be published, too."
This Web Site Is Modified Every 2-3 Days
Copyright 1991-2025 John Cooley.  All Rights Reserved.
| Contact John Cooley | Webmaster | Legal | Feedback Form |

   !!!     "It's not a BUG,
  /o o\  /  it's a FEATURE!"
 (  >  )
  \ - / 
  _] [_     (jcooley 1991)