comp.arch.fpga | Estimating ROM gate count in ASIC| page 2

Reply by Kevin Neilson ●January 1, 20192019-01-01

On Sunday, December 30, 2018 at 2:37:46 PM UTC-7, gnuarm.del...@gmail.com wrote:
> On Sunday, December 30, 2018 at 2:52:43 PM UTC-5, Kevin Neilson wrote:
> > On Saturday, December 29, 2018 at 9:53:20 AM UTC-7, gnuarm.del...@gmail.com wrote:
> > > On Friday, December 28, 2018 at 2:49:37 PM UTC-5, Kevin Neilson wrote:
> > > > On Saturday, December 22, 2018 at 12:14:43 PM UTC-7, gnuarm.del...@gmail.com wrote:
> > > > > On Thursday, December 6, 2018 at 6:02:25 PM UTC-5, Kevin Neilson wrote:
> > > > > > I've searched for this but to no avail.  I'd like a function f(D,W), where D=depth and W=width, which provides an estimate of the gate count of a lookup ROM implemented in ASIC gates.
> > > > > > 
> > > > > > Yes, I know it's dependent on the contents.  However, if half the bits are ones and the contents are randomly distributed, a formula should be pretty accurate.
> > > > > > 
> > > > > > It's easy for me to figure out an upper limit.  A basic ROM is an AND-OR array.  The D address decoders (comprising ANDs/NOTs) can be shared amongst the W columns.  Each of the W columns would require D/2-1 OR gates if half the ROM bits in each column are 1.
> > > > > > 
> > > > > > What I don't know is how many gates can be eliminated by sharing terms.  As W increases, term sharing should go up.  Again, I'm looking for a *formula*.
> > > > > 
> > > > > That would be pretty easy.  Consider the costs of a D wide multiplexer with 1 or 0 on each input.  That would be an upper bound in any case.  
> > > > > 
> > > > > I believe my text book of many years ago used one of the input variables in either true or inverted form combined with 1s and 0s as choices for inputs which simplified the mux by one address input.  
> > > > > 
> > > > >   Rick C. 
> > > > > 
> > > > >   Tesla referral code - https://ts.la/richard11209
> > > > >   Get 6 months of free supercharging
> > > > 
> > > > Thanks, but I am looking for an accurate estimate.
> > > 
> > > I'm confused.  Do you want an accurate measurement or an estimate?  
> > > 
> > >   Rick C.
> > > 
> > >   - Get 6 months of free supercharging 
> > >   - Tesla referral code - https://ts.la/richard11209
> > 
> > Both.  An estimate will be very close for large D,W.  If I roll a die 1e6 times, estimating there will be 5e5 heads is pretty accurate.  Estimating there will be less than or equal to the upper bound of 1e6 heads is correct but not helpful.
> 
> Good thing you aren't rolling a die. 
> 
> How much do you think this upper bound will vary from your estimate?  Have you tried any tests?  What is the result of your "estimate" under-estimating? 
> 
> I think if I were doing this I'd try to get a handle on the expected results and how much they might vary before I start looking for an equation to "estimate" the number.  The one thing you didn't do with the die example is to figure out how much you need to adjust your estimate to get a bound that will include 99.9xxx% of your cases or whatever value you need.  Do you know that equation? 
> 
> 
>   Rick C.
> 
>   + Get 6 months of free supercharging
>   + Tesla referral code - https://ts.la/richard11209

Sorry, my response could've been more nicely expressed.  I originally though that ASIC guys must have some formula for this but perhaps not--maybe they just run it through the synthesizer and check.  I'm writing ASIC code but don't have direct access to the synthesizer.  If I did I could maybe plot a few points and fit a curve to it.  I only have access to FPGA synthesizers and of course they just implement ROMs in LUTs and the LUT count is proportional to the number of bits in the ROM and there is no logic sharing as the ROM size increases.

I did think about the die/coin example and how to find how good the estimate is.  For example, say you flip a coin 1000 times.  My expected number of heads is is 500.  Say you want to know how many runs will have between 450 and 550 heads.  You can use the Poisson CDF.  In Matlab/Octave:

  octave:32> n_flips=1000; exp_heads = n_flips*0.5; poisscdf(550,exp_heads)-poisscdf(450,exp_heads)
  ans =  0.97469

So I'll be in that range 97.5% of the time.

Reply by ●January 1, 20192019-01-01

On Tuesday, January 1, 2019 at 6:55:57 PM UTC-5, Kevin Neilson wrote:
> On Sunday, December 30, 2018 at 2:37:46 PM UTC-7, gnuarm.del...@gmail.com wrote:
> > On Sunday, December 30, 2018 at 2:52:43 PM UTC-5, Kevin Neilson wrote:
> > > On Saturday, December 29, 2018 at 9:53:20 AM UTC-7, gnuarm.del...@gmail.com wrote:
> > > > On Friday, December 28, 2018 at 2:49:37 PM UTC-5, Kevin Neilson wrote:
> > > > > On Saturday, December 22, 2018 at 12:14:43 PM UTC-7, gnuarm.del...@gmail.com wrote:
> > > > > > On Thursday, December 6, 2018 at 6:02:25 PM UTC-5, Kevin Neilson wrote:
> > > > > > > I've searched for this but to no avail.  I'd like a function f(D,W), where D=depth and W=width, which provides an estimate of the gate count of a lookup ROM implemented in ASIC gates.
> > > > > > > 
> > > > > > > Yes, I know it's dependent on the contents.  However, if half the bits are ones and the contents are randomly distributed, a formula should be pretty accurate.
> > > > > > > 
> > > > > > > It's easy for me to figure out an upper limit.  A basic ROM is an AND-OR array.  The D address decoders (comprising ANDs/NOTs) can be shared amongst the W columns.  Each of the W columns would require D/2-1 OR gates if half the ROM bits in each column are 1.
> > > > > > > 
> > > > > > > What I don't know is how many gates can be eliminated by sharing terms.  As W increases, term sharing should go up.  Again, I'm looking for a *formula*.
> > > > > > 
> > > > > > That would be pretty easy.  Consider the costs of a D wide multiplexer with 1 or 0 on each input.  That would be an upper bound in any case.  
> > > > > > 
> > > > > > I believe my text book of many years ago used one of the input variables in either true or inverted form combined with 1s and 0s as choices for inputs which simplified the mux by one address input.  
> > > > > > 
> > > > > >   Rick C. 
> > > > > > 
> > > > > >   Tesla referral code - https://ts.la/richard11209
> > > > > >   Get 6 months of free supercharging
> > > > > 
> > > > > Thanks, but I am looking for an accurate estimate.
> > > > 
> > > > I'm confused.  Do you want an accurate measurement or an estimate?  
> > > > 
> > > >   Rick C.
> > > > 
> > > >   - Get 6 months of free supercharging 
> > > >   - Tesla referral code - https://ts.la/richard11209
> > > 
> > > Both.  An estimate will be very close for large D,W.  If I roll a die 1e6 times, estimating there will be 5e5 heads is pretty accurate.  Estimating there will be less than or equal to the upper bound of 1e6 heads is correct but not helpful.
> > 
> > Good thing you aren't rolling a die. 
> > 
> > How much do you think this upper bound will vary from your estimate?  Have you tried any tests?  What is the result of your "estimate" under-estimating? 
> > 
> > I think if I were doing this I'd try to get a handle on the expected results and how much they might vary before I start looking for an equation to "estimate" the number.  The one thing you didn't do with the die example is to figure out how much you need to adjust your estimate to get a bound that will include 99.9xxx% of your cases or whatever value you need.  Do you know that equation? 
> > 
> > 
> >   Rick C.
> > 
> >   + Get 6 months of free supercharging
> >   + Tesla referral code - https://ts.la/richard11209
> 
> Sorry, my response could've been more nicely expressed.  I originally though that ASIC guys must have some formula for this but perhaps not--maybe they just run it through the synthesizer and check.  I'm writing ASIC code but don't have direct access to the synthesizer.  If I did I could maybe plot a few points and fit a curve to it.  I only have access to FPGA synthesizers and of course they just implement ROMs in LUTs and the LUT count is proportional to the number of bits in the ROM and there is no logic sharing as the ROM size increases.
> 
> I did think about the die/coin example and how to find how good the estimate is.  For example, say you flip a coin 1000 times.  My expected number of heads is is 500.  Say you want to know how many runs will have between 450 and 550 heads.  You can use the Poisson CDF.  In Matlab/Octave:
> 
>   octave:32> n_flips=1000; exp_heads = n_flips*0.5; poisscdf(550,exp_heads)-poisscdf(450,exp_heads)
>   ans =  0.97469
> 
> So I'll be in that range 97.5% of the time.

My point is that unless you can come up with a number like this for your estimate, how much good will it do you?  Even if you are right 99% of the time, when designing a chip is that of much value?  

I would think knowing the upper bound is a very useful thing indeed when planning an ASIC.  But I don't have any more info on calculating the estimate you say you need, so I can't help you. 

  Rick C.

  -- Get 6 months of free supercharging
  -- Tesla referral code - https://ts.la/richard11209

Reply by Thomas Stanka ●January 7, 20192019-01-07

Am Mittwoch, 2. Januar 2019 00:55:57 UTC+1 schrieb Kevin Neilson:

> So I'll be in that range 97.5% of the time.

I think you did not understand my posting. There is no way of estimating the synthesis result of a ROM right by simple formula.
Unless you build a model of the synthesis tool itself with this formula.

A change of 1 dataword in a ROM with 1024 words could significant change the synthesis result by more than 1%.

There is a simple upper bound and a simple lower bound but real ROMs are always in between.
Simple upper bound is calculated by building DNF for ROM in given technology and lower bound is 1 gate per bit (tie0 or tie1) for stand alone synthesis.

Take for example DES encryption algorithm S-Box. These are lookup tables (ROM) designed to be not easy reduced by synthesis tools. You could synthesis one of this S-Box 10 times with different seeds and would not get 2 identical results for the same S-Box. 

If you have no access to ASIC synthesis tool than download free FPGA synthesis tool and test the results for FPGA synthesis. This gives at least a feeling how synthesis tools deal with your ROM.

bye Thomas

Previous 12Next

Estimating ROM gate count in ASIC

Sign in

Search forums

Free PDF Downloads

Blogs - Hall of Fame

Quick Links

About FPGARelated.com

Social Networks

The Related Media Group