comp.arch.fpga | how to speed up my accumulator ??| page 3

Reply by rickman ●December 6, 20042004-12-06

Moti wrote:
> 
> Hi Jhon,
> thanks for your reply, altough I have to admit that I didnt entirely
> understood how to actually caluculate the frequency.
> Best regards, Moti

I think he is saying that you should expect the maximum deviations to be
n/m * Fref and n+1/m * Fref where n/m is the closest integer ratio that
gives you an exact frequency that the NCO can make without jitter.  

Example: Fref is 1 MHz, Fout is 211 kHz, 32 bit accumulator.  This gives
a step size of Fout/Fref * 2^32 = 906,238,099.456 ~= 906,238,099.  This
will roll over the MSB on 4 or 5 clock pulses.   So your Fout will be
composed of pulses of 4 clocks and pulses of 5 clocks with high and low
times of 2 and 3 clocks.  The corresponding jitter will be...  geeze,
this is hard isn't it?  

I think you will get Fref/6 and Fref/2 as the range of jitter.  I think
the formula should be...  

Fmax = floor(Fref/(2*Fout)) * 0.5
Fmin = ceiling(Fref/(2*Fout)) * 0.5
Fjitter = Fmax-Fmin

I expect you will see main peaks in your FFT at Fmax and Fmin, no?  

-- 

Rick "rickman" Collins

rick.collins@XYarius.com
Ignore the reply address. To email me use the above address with the XY
removed.

Arius - A Signal Processing Solutions Company
Specializing in DSP and FPGA design      URL http://www.arius.com
4 King Ave                               301-682-7772 Voice
Frederick, MD 21701-3110                 301-682-7666 FAX

Reply by Falk Brunner ●December 6, 20042004-12-06

"rickman" <spamgoeshere4@yahoo.com> schrieb im Newsbeitrag
news:41B4AF23.5CCD7DA7@yahoo.com...

> Example: Fref is 1 MHz, Fout is 211 kHz, 32 bit accumulator.  This gives
> a step size of Fout/Fref * 2^32 = 906,238,099.456 ~= 906,238,099.  This
> will roll over the MSB on 4 or 5 clock pulses.   So your Fout will be
> composed of pulses of 4 clocks and pulses of 5 clocks with high and low
> times of 2 and 3 clocks.  The corresponding jitter will be...  geeze,
> this is hard isn't it?

If someone really want to break this down to the last bit, I got a PHD paper
here that is all about DDS, spectrum etc.
But its german, 3 Mb pdf.
If someone is intereted, drop me a mail.

Regards
Falk

Reply by rickman ●December 6, 20042004-12-06

Antti Lukats wrote:
> 
> "rickman" <spamgoeshere4@yahoo.com> wrote in message
> news:41B49989.E7D9EFBC@yahoo.com...
> > Antti Lukats wrote:
> > >
> > > [lots of snipped]
> > > > > Rick, hmmm... care to comment?
> > > > > see synthesis and timing reports above :)
> > > >
> [snip]
> > > I used all signal 32 bit wide, inc_value as input port
> >
> > That should work.  Can you post the code you worked with?
> > --
> > Rick "rickman" Collins
> 
> Rick you can try your own code with XST it complains about the sll at least!
> Maybe there is better(read proper fix) to main
> 
> the timings I posted I always posted synthesis estimae and post P&R timings
> news posting did change the text aligne so was hard to read
> 
> below is what I used (fast-do-not-think-at-all .. fixed) from your code
> ----------------------------------------------------------------------------
> ---
> library IEEE;
> use IEEE.STD_LOGIC_1164.ALL;
> use IEEE.STD_LOGIC_ARITH.ALL;
> use IEEE.STD_LOGIC_UNSIGNED.ALL;
> 
> entity dds is
>     Port ( clk       : in std_logic;
>            rst       : in std_logic;
>            inc_value : in std_logic_vector(31 downto 0);
>            fout      : out std_logic);
> end dds;
> 
> architecture Behavioral of dds is
> 
> signal accsingle : std_logic_vector(31 downto 0);
> signal accdouble : std_logic_vector(31 downto 0);
> signal accfast   : std_logic_vector(31 downto 0);
> signal phase     : std_logic;
> 
> begin
> 
> process (clk, rst)
> begin
>   if rst = '1' then
>     phase       <= '0';
>     accsingle   <= (others =>'0');
>     accdouble   <= (others =>'0');
>     accfast     <= (others =>'0');
>   elsif clk'event and clk ='1' then
>     phase       <= not phase;
>     if (phase = '0') then
>       accfast   <= accsingle;
>     else
>       accfast   <= accdouble;
>       accsingle <= accdouble + inc_value;
>       accdouble <= accdouble + (inc_value(30 downto 0) & '0');
>     end if;
>   end if;
> end process;
> 
> fout <= accfast (accfast'high);
> 
> end Behavioral;

This looks good to me.  I found the sll in a book that warned about not
always being supported in synthesis since it was VHDL-1993 and still new
at that time.  Jeeze, you would think XST would have gotten up the curve
by now.  

But this code looks good to me.  It should produce the following
hardware..

            accsingle ====X=======X=======X=
            accdouble ====X=======X=======X=
              accfast X===X===X===X===X===X=
                       _   _   _   _   _   _
+--0<|---+      Clock | |_| |_| |_| |_| |_|
|        |              ___     ___     ___
| +---+  |      Phase  |   |___|   |___|   |__
+-|   |--+--------+------------------------+ 
  |   |           |                        |
  |>  |           |                        |
  +---+           | V Left shift one bit   |
        +---------)-------------+          |
        |         |             |          |
        | |\      |             |  |\      |
        | | \     |             |  | \     |
 -------+-|  \    |             +--|  \    |
          |   \   |    accsingle   |   \   |   accdouble
inc_value  \   |  |  +----+         \   |  |  +----+
            >  |--)--|    |--+       >  |--)--|    |--+
           /   |  |  |    |  |      /   |  |  |    |  |
          |   /   +--|En  |  |     |   /   +--|En  |  |
       +--|  /       |    |  |  +--|  /    |  |    |  |
       |  | /        |>   |  |  |  | /     |  |>   |  |
       |  |/         +----+  |  |  |/      |  +----+  |
       |                     |  |          |          |        
       +---------------------)--+----------)----------+
       |                     |             |
       |                     |   |\--------+
       |                     +---| \          accfast
       |                         |  |        +----+
       |                         |  |--------|    |---->
       |                         |  |        |    |  
       +-------------------------| /         |    |  
                                 |/          |    |  
                                             |>   |  
                                             +----+  

The paths leading to accsingle and accdouble have two clock cycles to
settle and the paths to accfast has to settle in one clock cycle.  

Looking at this, I realize that the accfast does not need the full 32
bits since only the MSb is used.  So the LUT count should be 32 * 2 + 2
= 66.  

The timing constraints must reflect the proper delays (2 clocks default
with one clock on all paths ending at accfast and all paths starting at
phase.  Then the critical path will be one of the one clock paths which
should be able to be optimized to much faster than 200 MHz.  Without the
timing constraints, the tools will analyze the adder paths as the
critical paths and report wrong timings.  

The Spartan 3 CLB has input setup and clock to output delays of about
0.7 ns.  With a routing delay of 1 ns between FFs in the same CLB, (I am
estimating a worst case routing delay, *really* worst case) this gives
2.4 ns or over 400 MHz.  I guess the routing of the phase signal to the
32 bit registers may be a bit slower, but good floorplanning should
speed this up.  Heck, I can get close to this speed in a relatively slow
and *ancient* ACEX part from Altera.  I am sure the Spartan 3 can do
better.   

-- 

Rick "rickman" Collins

rick.collins@XYarius.com
Ignore the reply address. To email me use the above address with the XY
removed.

Arius - A Signal Processing Solutions Company
Specializing in DSP and FPGA design      URL http://www.arius.com
4 King Ave                               301-682-7772 Voice
Frederick, MD 21701-3110                 301-682-7666 FAX

Reply by mike...@comcast.net ●December 6, 20042004-12-06

Hi Rick
SLL will work for numeric_std.unsigned or bit vector:
http://groups-beta.google.com/groups?q=vhdl+sll+sla+treseler

Thanks for the code example.
Nice demo of area vs. speed at the hdl level.

      -- Mike Treseler

Reply by Moti ●December 6, 20042004-12-06

Hi Rickman,
Lots of thanks for the explanation and the numeric example.
I will defently use it and afterwards I will probably measure the
actual jitter frequency and compare it to the calculated results - when
I'll get the results I will post them (for those intersted)..
Thanks again, Moti.

Reply by Moti ●December 6, 20042004-12-06

Hi Rickman,
Lots of thanks for the explanation and the numeric example.
I will defently use it and afterwards I will probably measure the
actual jitter frequency and compare it to the calculated results - when
I'll get the results I will post them (for those intersted)..
Thanks again, Moti.

Reply by Moti ●December 6, 20042004-12-06

Hi Falk,
My german is pretty "rusty" :)  so if the document is in .pdf format it
will very hard...
but if it's in a html format it can translated by google and then it
will be possible to read it!
Regards, Moti.

Reply by John_H ●December 6, 20042004-12-06

It's involved...
With your example of 211kHz from a 1MHz reference, the ratio of
906,238,099/2^32 has closest fractions in order of worst to best of

1/5
4/19
23/109
211/1000
1987386/9418891
19873649/94187910
57633561/273144839

The offsets are the ideal frequency compared to the ratio frequency:
211 kHz - 200 kHz
211 kHz - 210.52632 kHz
211 kHz - 211.00917 kHz
211 kHz - 211 kHz... Here Excel starts to lose digits:

  The difference between 906,238,099/2^32 and 906,238,099.456/2^32 is about
5.03e-10 at which point small amounts of jitter are lost.  If the jitter at
that tiny offset is large, you will experience phase jumps when that beat
frequency is felt.  There's no way to filter those with analog filters.

Your largest observed peaks in the spectrum will be at offsets of 11 kHz,
526 Hz, and 9.17 Hz.  You should be able to see the 526 Hz modulating the
11kHz for spikes much smaller than the 11 kHz peak.


"rickman" <spamgoeshere4@yahoo.com> wrote in message
news:41B4AF23.5CCD7DA7@yahoo.com...
> Moti wrote:
> >
> > Hi Jhon,
> > thanks for your reply, altough I have to admit that I didnt entirely
> > understood how to actually caluculate the frequency.
> > Best regards, Moti
>
> I think he is saying that you should expect the maximum deviations to be
> n/m * Fref and n+1/m * Fref where n/m is the closest integer ratio that
> gives you an exact frequency that the NCO can make without jitter.
>
> Example: Fref is 1 MHz, Fout is 211 kHz, 32 bit accumulator.  This gives
> a step size of Fout/Fref * 2^32 = 906,238,099.456 ~= 906,238,099.  This
> will roll over the MSB on 4 or 5 clock pulses.   So your Fout will be
> composed of pulses of 4 clocks and pulses of 5 clocks with high and low
> times of 2 and 3 clocks.  The corresponding jitter will be...  geeze,
> this is hard isn't it?
>
> I think you will get Fref/6 and Fref/2 as the range of jitter.  I think
> the formula should be...
>
> Fmax = floor(Fref/(2*Fout)) * 0.5
> Fmin = ceiling(Fref/(2*Fout)) * 0.5
> Fjitter = Fmax-Fmin
>
> I expect you will see main peaks in your FFT at Fmax and Fmin, no?
>
> -- 
>
> Rick "rickman" Collins
>
> rick.collins@XYarius.com
> Ignore the reply address. To email me use the above address with the XY
> removed.
>
> Arius - A Signal Processing Solutions Company
> Specializing in DSP and FPGA design      URL http://www.arius.com
> 4 King Ave                               301-682-7772 Voice
> Frederick, MD 21701-3110                 301-682-7666 FAX

Reply by Jim Granville ●December 6, 20042004-12-06

John_H wrote:
> It's involved...
> With your example of 211kHz from a 1MHz reference, the ratio of
> 906,238,099/2^32 has closest fractions in order of worst to best of
> 
> 1/5
> 4/19
> 23/109
> 211/1000
> 1987386/9418891
> 19873649/94187910
> 57633561/273144839
> 
> The offsets are the ideal frequency compared to the ratio frequency:
> 211 kHz - 200 kHz
> 211 kHz - 210.52632 kHz
> 211 kHz - 211.00917 kHz
> 211 kHz - 211 kHz... Here Excel starts to lose digits:
> 
>   The difference between 906,238,099/2^32 and 906,238,099.456/2^32 is about
> 5.03e-10 at which point small amounts of jitter are lost.  If the jitter at
> that tiny offset is large, you will experience phase jumps when that beat
> frequency is felt.  There's no way to filter those with analog filters.
> 
> Your largest observed peaks in the spectrum will be at offsets of 11 kHz,
> 526 Hz, and 9.17 Hz.  You should be able to see the 526 Hz modulating the
> 11kHz for spikes much smaller than the 11 kHz peak.

Good example maths, but is the principle right ?

For the example of 211KHz from 1Mhz, you have 1us quantize, and so will 
be able to generate 4us, or 5us periods, giving 250KHz and 200KHz.

Over many cycles, the 'wobbling' between these two will average to 
211KHz. The more cycles, the better the match to 211KHz.

Over a 6 cycle snapshot, you might see 5@200, 1@250, and Favge 208.33Khz
That's appx one part in 77 too slow.
This 6 cycle frame has a freq of 34.6KHz

Next frame group would be (eg)
every 79 cycles, to see => 14 @ 250KHz, 65@ 200KHz => 210.76923Khz, 
Error is now one part in 1000, and this finer frame is 2.65KHz
( etc ) as over wider frame snap-shots, the average frequency gets
closer to the 211KHz ideal.

So I'd expect to see, on a spectrum analyser, 200KHz, (Dominant) 250KHz
and 34.6KHz and 2.65KHz (etc) energies.

-jg

Reply by Allan Herriman ●December 6, 20042004-12-06

On Tue, 07 Dec 2004 15:34:36 +1300, Jim Granville
<no.spam@designtools.co.nz> wrote:

>John_H wrote:
>> It's involved...
>> With your example of 211kHz from a 1MHz reference, the ratio of
>> 906,238,099/2^32 has closest fractions in order of worst to best of
>> 
>> 1/5
>> 4/19
>> 23/109
>> 211/1000
>> 1987386/9418891
>> 19873649/94187910
>> 57633561/273144839
>> 
>> The offsets are the ideal frequency compared to the ratio frequency:
>> 211 kHz - 200 kHz
>> 211 kHz - 210.52632 kHz
>> 211 kHz - 211.00917 kHz
>> 211 kHz - 211 kHz... Here Excel starts to lose digits:
>> 
>>   The difference between 906,238,099/2^32 and 906,238,099.456/2^32 is about
>> 5.03e-10 at which point small amounts of jitter are lost.  If the jitter at
>> that tiny offset is large, you will experience phase jumps when that beat
>> frequency is felt.  There's no way to filter those with analog filters.
>> 
>> Your largest observed peaks in the spectrum will be at offsets of 11 kHz,
>> 526 Hz, and 9.17 Hz.  You should be able to see the 526 Hz modulating the
>> 11kHz for spikes much smaller than the 11 kHz peak.
>
>Good example maths, but is the principle right ?
>
>
>For the example of 211KHz from 1Mhz, you have 1us quantize, and so will 
>be able to generate 4us, or 5us periods, giving 250KHz and 200KHz.
>
>Over many cycles, the 'wobbling' between these two will average to 
>211KHz. The more cycles, the better the match to 211KHz.
>
>Over a 6 cycle snapshot, you might see 5@200, 1@250, and Favge 208.33Khz
>That's appx one part in 77 too slow.
>This 6 cycle frame has a freq of 34.6KHz
>
>Next frame group would be (eg)
>every 79 cycles, to see => 14 @ 250KHz, 65@ 200KHz => 210.76923Khz, 
>Error is now one part in 1000, and this finer frame is 2.65KHz
>( etc ) as over wider frame snap-shots, the average frequency gets
>closer to the 211KHz ideal.
>
>So I'd expect to see, on a spectrum analyser, 200KHz, (Dominant) 250KHz
>and 34.6KHz and 2.65KHz (etc) energies.

Did you actually plug it in to a spectrum analyser and see those
tones?

Ten highest spurious tones:

 55.000kHz -11.4dBc
367.000kHz -16.7dBc
101.000kHz -17.0dBc
165.000kHz -22.4dBc
  9.000kHz -22.9dBc
257.000kHz -24.1dBc
321.000kHz -25.1dBc
147.000kHz -25.8dBc
119.000kHz -27.4dBc
 37.000kHz -27.7dBc

Regards,
Allan

Previous 1 234 5 Next

how to speed up my accumulator ??

Sign in

Search forums

Free PDF Downloads

Blogs - Hall of Fame

Quick Links

About FPGARelated.com

Social Networks

The Related Media Group