comp.arch.fpga

Hi,


OK, left the lora chips asside for a while, so .. now back to FPGAs.

I have two olimex ice40 boards where I would like to use the onboard 
SRAM. The RAM chip is a samsung K5R4016V1B-10 (256K words * 16 bits).

The datasheets are here:
https://www.olimex.com/Products/_resources/ds_k6r4016v1d_rev40.pdf
The most important pages are page 7 (for "read"), pages 8 and 9 (for 
"write") and page 10 (for the functional description of the pins).


I am trying to interprete the datasheets to see how to use the chip. I 
think I understand how to read or write one word, but I still puzzled on 
how to do bulk-write transfers


* For read, it seams to be simple:
set /WE high and  /OE low (*)

1/ put the address on the address-bus
2/ 10 ns later, read the data from the data-out
(*) ignoring the /CS, /LB and /UB pins to keep things simple.

In bulk transfer, it is like this:
- set address 1 on the Address bus
- 10 ns later:
-> read the data of address 1 from data-out
-> (at the same time) set address 2 on the address bus
- 10 bs later:
-> read the data of address 2 from data-out
-> (at the same time) set address 3 on the address bus
(etc)


* For write, to write one single word, I think it goes like this

1/ set /WE low and /OE high to go to "write" mode
-> at the same time set te address on the address bus
-> do not yet put the data on the databus (as it still in "output" mode)
2/ 10 ns later:
-> put the data on the data-bus (by then, the data-bus has switched to 
"data-in"
3/ another 10 ns later:
-> set /WE high and /OE low to leave "write" mode

But I am still puzzled on how to do a "bulk write" of data. The 
datasheets do not mention anything on what happens if leave the chip in 
"write" mode and just change the address on the address-bus (as is done 
for bulk-read)

It there is no seperate bulk-write protocol, it looks like a write to 
the chip takes 3 times as much steps then a bulk-read (3 steps compaired 
to one single step).


Is this a correct interpretation of the datasheet?

Can somebody who has already interfaced an FPGA with SRAM confirm or 
deny this. Or is there another trick on how to do a bulk-write on a SRAM 
chip?




Cheerio! Kr. Bonne.

Reply by Cecil Bayona ●July 22, 20172017-07-22

Static RAM chips do not have bulk mode, it's not needed, you write to it 
one word at a time. Its EEPROM, FLASH, and similar memory with it's 
complicated setup that are  in need of bulk mode as they are slow and 
bulk mode is faster, some only have bulk mode.


On 7/22/2017 12:52 PM, kristoff wrote:
> Hi,
> 
> 
> OK, left the lora chips asside for a while, so .. now back to FPGAs.
> 
> I have two olimex ice40 boards where I would like to use the onboard 
> SRAM. The RAM chip is a samsung K5R4016V1B-10 (256K words * 16 bits).
> 
> The datasheets are here:
> https://www.olimex.com/Products/_resources/ds_k6r4016v1d_rev40.pdf
> The most important pages are page 7 (for "read"), pages 8 and 9 (for 
> "write") and page 10 (for the functional description of the pins).
> 
> 
> I am trying to interprete the datasheets to see how to use the chip. I 
> think I understand how to read or write one word, but I still puzzled on 
> how to do bulk-write transfers
> 
> 
> * For read, it seams to be simple:
> set /WE high and  /OE low (*)
> 
> 1/ put the address on the address-bus
> 2/ 10 ns later, read the data from the data-out
> (*) ignoring the /CS, /LB and /UB pins to keep things simple.
> 
> In bulk transfer, it is like this:
> - set address 1 on the Address bus
> - 10 ns later:
> -> read the data of address 1 from data-out
> -> (at the same time) set address 2 on the address bus
> - 10 bs later:
> -> read the data of address 2 from data-out
> -> (at the same time) set address 3 on the address bus
> (etc)
> 
> 
> * For write, to write one single word, I think it goes like this
> 
> 1/ set /WE low and /OE high to go to "write" mode
> -> at the same time set te address on the address bus
> -> do not yet put the data on the databus (as it still in "output" mode)
> 2/ 10 ns later:
> -> put the data on the data-bus (by then, the data-bus has switched to 
> "data-in"
> 3/ another 10 ns later:
> -> set /WE high and /OE low to leave "write" mode
> 
> But I am still puzzled on how to do a "bulk write" of data. The 
> datasheets do not mention anything on what happens if leave the chip in 
> "write" mode and just change the address on the address-bus (as is done 
> for bulk-read)
> 
> It there is no seperate bulk-write protocol, it looks like a write to 
> the chip takes 3 times as much steps then a bulk-read (3 steps compaired 
> to one single step).
> 
> 
> Is this a correct interpretation of the datasheet?
> 
> Can somebody who has already interfaced an FPGA with SRAM confirm or 
> deny this. Or is there another trick on how to do a bulk-write on a SRAM 
> chip?
> 
> 
> 
> 
> Cheerio! Kr. Bonne.


-- 
Cecil - k5nwa

Reply by kristoff ●July 22, 20172017-07-22

Hi Cecil,


Thanks for your reply.


I agree it's not a bulk-mode as such.

What I meant was that when doing multiple reads one after the other you 
can stich them together:


Correct me if I am wrong, but how I interprete the datasheets, the "read 
data from the address-bus" can be done at the same time as the "set next 
address on address-bus". This -I think- means you can "overlap" two 
concequative reads, resulting in one read per clock cycle.

At least, that is -I guess- what the "t OH" (Output Hold from Address 
Change) means in the "ready cycle(1)" timing waveform on page 7 of the 
datasheet).



But I do not see how (or if) something simular can be done for "write" 
operations, but perhaps I am missing something.




Kristoff




On 22-07-17 20:19, Cecil Bayona wrote:
> Static RAM chips do not have bulk mode, it's not needed, you write to it 
> one word at a time. Its EEPROM, FLASH, and similar memory with it's 
> complicated setup that are  in need of bulk mode as they are slow and 
> bulk mode is faster, some only have bulk mode.
> 
> 
> On 7/22/2017 12:52 PM, kristoff wrote:
>> Hi,
>>
>>
>> OK, left the lora chips asside for a while, so .. now back to FPGAs.
>>
>> I have two olimex ice40 boards where I would like to use the onboard 
>> SRAM. The RAM chip is a samsung K5R4016V1B-10 (256K words * 16 bits).
>>
>> The datasheets are here:
>> https://www.olimex.com/Products/_resources/ds_k6r4016v1d_rev40.pdf
>> The most important pages are page 7 (for "read"), pages 8 and 9 (for 
>> "write") and page 10 (for the functional description of the pins).
>>
>>
>> I am trying to interprete the datasheets to see how to use the chip. I 
>> think I understand how to read or write one word, but I still puzzled 
>> on how to do bulk-write transfers
>>
>>
>> * For read, it seams to be simple:
>> set /WE high and  /OE low (*)
>>
>> 1/ put the address on the address-bus
>> 2/ 10 ns later, read the data from the data-out
>> (*) ignoring the /CS, /LB and /UB pins to keep things simple.
>>
>> In bulk transfer, it is like this:
>> - set address 1 on the Address bus
>> - 10 ns later:
>> -> read the data of address 1 from data-out
>> -> (at the same time) set address 2 on the address bus
>> - 10 bs later:
>> -> read the data of address 2 from data-out
>> -> (at the same time) set address 3 on the address bus
>> (etc)
>>
>>
>> * For write, to write one single word, I think it goes like this
>>
>> 1/ set /WE low and /OE high to go to "write" mode
>> -> at the same time set te address on the address bus
>> -> do not yet put the data on the databus (as it still in "output" mode)
>> 2/ 10 ns later:
>> -> put the data on the data-bus (by then, the data-bus has switched to 
>> "data-in"
>> 3/ another 10 ns later:
>> -> set /WE high and /OE low to leave "write" mode
>>
>> But I am still puzzled on how to do a "bulk write" of data. The 
>> datasheets do not mention anything on what happens if leave the chip 
>> in "write" mode and just change the address on the address-bus (as is 
>> done for bulk-read)
>>
>> It there is no seperate bulk-write protocol, it looks like a write to 
>> the chip takes 3 times as much steps then a bulk-read (3 steps 
>> compaired to one single step).
>>
>>
>> Is this a correct interpretation of the datasheet?
>>
>> Can somebody who has already interfaced an FPGA with SRAM confirm or 
>> deny this. Or is there another trick on how to do a bulk-write on a 
>> SRAM chip?
>>
>>
>>
>>
>> Cheerio! Kr. Bonne.
> 
>

Reply by Richard Damon ●July 22, 20172017-07-22

This looks to be a fairly standard asynchronous static ram.

The basic requirement for a write cycle is that there is a Tas (Address 
stable) which the address bus must be stable before you can pull the WE 
line low, a Twp as the minimum length of time you can need to pull the 
WE signal low, and a Taw address hold you need to hold the address bus 
stable after WE goes high.

Sine Tas >= 0, and Taw >= 0, it is easy to think that you can just clock 
the WE signal on the same clock edge as the address, but that requires 
that the FPGA and the board layout has ZERO skew, which is basically 
impossible.

As you note, it is easy to read at full speed, cycle after cycle, you 
just need clock new addresses and one cycle later you can read the 
results. Note, this is not really a 'burst' operation, but just running 
full cycles one after the other (the burst terminology tend to imply 
there is some setup you do and after that you can read a given number of 
locations without needing to do the setup again).

For write with this sort of part there are several options:

1) Simplest, do every thing on rising edges and need 3 clock cycles to 
write, cycle 1, change address, cycle 2: drop we, cycle 3: Raise we and 
address hold.

2) Slightly more complicated, again do things on rising edges, but have 
something to delay the WE signal slightly. 2 Cycles, 1) Set Address, and 
with slight delay drop WE. 2) Hold address, and after a slight delay 
raise WE.

3) Instead of a slight delay in WE, drive WE on the falling edge of the 
clock, again 2 Cycles as above with the slight delay being the 1/2 cycle 
delay of the falling edge.

4) Discrete Pulse generation logic, have logic on the board with delay 
lines to generate the write pulse, so that WE will pulse low shortly 
after the address is stable, and comes back high shortly before the 
address might change again. This lets you do a write every cycle.

5) Like the Discrete Pulse Generation, but in the FPGA using a higher 
speed clock. If you can be sure that the WE pulse is faster or slower 
than the address bus (including FPGA skew), you could use a 400-500 MHz 
clock and create a 7.5/8 ns pulse on WE. If you can enforce that, you 
can use a 700 MHz clock and generate a 5 clock cycle pulse (7.14ns) in 
the middle of the 10 ns cycle.

This is one of the limitations of asynchronous rams, write cycles take 
more 'edges' to perform. Thus either needing more cycles or something to 
generate higher speed edges.

On 7/22/17 1:52 PM, kristoff wrote:
> Hi,
> 
> 
> OK, left the lora chips asside for a while, so .. now back to FPGAs.
> 
> I have two olimex ice40 boards where I would like to use the onboard 
> SRAM. The RAM chip is a samsung K5R4016V1B-10 (256K words * 16 bits).
> 
> The datasheets are here:
> https://www.olimex.com/Products/_resources/ds_k6r4016v1d_rev40.pdf
> The most important pages are page 7 (for "read"), pages 8 and 9 (for 
> "write") and page 10 (for the functional description of the pins).
> 
> 
> I am trying to interprete the datasheets to see how to use the chip. I 
> think I understand how to read or write one word, but I still puzzled on 
> how to do bulk-write transfers
> 
> 
> * For read, it seams to be simple:
> set /WE high and  /OE low (*)
> 
> 1/ put the address on the address-bus
> 2/ 10 ns later, read the data from the data-out
> (*) ignoring the /CS, /LB and /UB pins to keep things simple.
> 
> In bulk transfer, it is like this:
> - set address 1 on the Address bus
> - 10 ns later:
> -> read the data of address 1 from data-out
> -> (at the same time) set address 2 on the address bus
> - 10 bs later:
> -> read the data of address 2 from data-out
> -> (at the same time) set address 3 on the address bus
> (etc)
> 
> 
> * For write, to write one single word, I think it goes like this
> 
> 1/ set /WE low and /OE high to go to "write" mode
> -> at the same time set te address on the address bus
> -> do not yet put the data on the databus (as it still in "output" mode)
> 2/ 10 ns later:
> -> put the data on the data-bus (by then, the data-bus has switched to 
> "data-in"
> 3/ another 10 ns later:
> -> set /WE high and /OE low to leave "write" mode
> 
> But I am still puzzled on how to do a "bulk write" of data. The 
> datasheets do not mention anything on what happens if leave the chip in 
> "write" mode and just change the address on the address-bus (as is done 
> for bulk-read)
> 
> It there is no seperate bulk-write protocol, it looks like a write to 
> the chip takes 3 times as much steps then a bulk-read (3 steps compaired 
> to one single step).
> 
> 
> Is this a correct interpretation of the datasheet?
> 
> Can somebody who has already interfaced an FPGA with SRAM confirm or 
> deny this. Or is there another trick on how to do a bulk-write on a SRAM 
> chip?
> 
> 
> 
> 
> Cheerio! Kr. Bonne.

Reply by ●July 22, 20172017-07-22

Den l&oslash;rdag den 22. juli 2017 kl. 20.32.52 UTC+2 skrev kristoff:
> Hi Cecil,
> 
> 
> Thanks for your reply.
> 
> 
> I agree it's not a bulk-mode as such.
> 
> What I meant was that when doing multiple reads one after the other you 
> can stich them together:
> 
> 
> Correct me if I am wrong, but how I interprete the datasheets, the "read 
> data from the address-bus" can be done at the same time as the "set next 
> address on address-bus". This -I think- means you can "overlap" two 
> concequative reads, resulting in one read per clock cycle.

SRAM doesn't have a clock, you just have to comply with the required timing

> 
> At least, that is -I guess- what the "t OH" (Output Hold from Address 
> Change) means in the "ready cycle(1)" timing waveform on page 7 of the 
> datasheet).
> 
> 
> 
> But I do not see how (or if) something simular can be done for "write" 
> operations, but perhaps I am missing something.
> 

write happens on the rising edge on /WR

-Lasse

Reply by Richard Damon ●July 22, 20172017-07-22

On 7/22/17 3:56 PM, lasselangwadtchristensen@gmail.com wrote:
> Den l&oslash;rdag den 22. juli 2017 kl. 20.32.52 UTC+2 skrev kristoff:
>> Hi Cecil,
>>
>>
>> Thanks for your reply.
>>
>>
>> I agree it's not a bulk-mode as such.
>>
>> What I meant was that when doing multiple reads one after the other you
>> can stich them together:
>>
>>
>> Correct me if I am wrong, but how I interprete the datasheets, the "read
>> data from the address-bus" can be done at the same time as the "set next
>> address on address-bus". This -I think- means you can "overlap" two
>> concequative reads, resulting in one read per clock cycle.
> 
> SRAM doesn't have a clock, you just have to comply with the required timing
> 
>>
>> At least, that is -I guess- what the "t OH" (Output Hold from Address
>> Change) means in the "ready cycle(1)" timing waveform on page 7 of the
>> datasheet).
>>
>>
>>
>> But I do not see how (or if) something simular can be done for "write"
>> operations, but perhaps I am missing something.
>>
> 
> write happens on the rising edge on /WR
> 
> -Lasse
> 

Actually, with asynchronous parts, things don't happen 'on edges' but on 
levels (you measure timing requirements edge to edge). Asynchronous 
Srams tend to be a sea of RS Flip flops, and when write is low, the 
addresses flip flops will have their set or reset line asserted, so if 
you wanted to talk of a time when the write happened, it was on the 
falling edge, with a propagation delay/hold requirement.

Toh is the minimum guaranteed propagation delay from address to data, 
just like Taa is the maximum delay from address to data. (Trc actually 
isn't a critical parameter for the ram itself, but is a nominal system 
parameter. With Asyncronuous SRam, changing the address inputs faster 
than Trc won't cause any problems, except for the fact that you won't 
get valid data out until you stop doing it.

Reply by kristoff ●July 22, 20172017-07-22

Hi Richard,


Thank you for your reply.

Your message really helped to better understand the timing waveforms.


I'll start with the simpest setup and after that experiment with using 
the falling edge of the clock to clear the /WE signal (option 3).



Kristoff

Reply by Richard Damon ●July 22, 20172017-07-22

On 7/22/17 7:46 PM, kristoff wrote:
> Hi Richard,
> 
> 
> Thank you for your reply.
> 
> Your message really helped to better understand the timing waveforms.
> 
> 
> I'll start with the simpest setup and after that experiment with using 
> the falling edge of the clock to clear the /WE signal (option 3).
> 
> 
> 
> Kristoff
> 
> 

One thing to remind about, having a 10ns memory part does NOT mean you 
can talk to it with a 100MHz (10ns) clock. You will need to add in time 
from Clock->output on your address bus, and the needed Setup time on the 
data bus in. If you want the best performance, if possible you want both 
of these to be using FF in the I/O block of the FPGA, as those will have 
much lower propagation delays.

Asynchronous devices can be harder to use, but can give you 
significantly improved read performance if you are worried about 
latency, as synchronous interfaces can cost clock cycle. (on the other 
hand, synchronous interfaces can often write faster as you can often 
just stream the data, and the latency isn't important).

Reply by rickman ●July 24, 20172017-07-24

Richard Damon wrote on 7/22/2017 4:23 PM:
> On 7/22/17 3:56 PM, lasselangwadtchristensen@gmail.com wrote:
>> Den l&oslash;rdag den 22. juli 2017 kl. 20.32.52 UTC+2 skrev kristoff:
>>> Hi Cecil,
>>>
>>>
>>> Thanks for your reply.
>>>
>>>
>>> I agree it's not a bulk-mode as such.
>>>
>>> What I meant was that when doing multiple reads one after the other you
>>> can stich them together:
>>>
>>>
>>> Correct me if I am wrong, but how I interprete the datasheets, the "read
>>> data from the address-bus" can be done at the same time as the "set next
>>> address on address-bus". This -I think- means you can "overlap" two
>>> concequative reads, resulting in one read per clock cycle.
>>
>> SRAM doesn't have a clock, you just have to comply with the required timing
>>
>>>
>>> At least, that is -I guess- what the "t OH" (Output Hold from Address
>>> Change) means in the "ready cycle(1)" timing waveform on page 7 of the
>>> datasheet).
>>>
>>>
>>>
>>> But I do not see how (or if) something simular can be done for "write"
>>> operations, but perhaps I am missing something.
>>>
>>
>> write happens on the rising edge on /WR
>>
>> -Lasse
>>
>
> Actually, with asynchronous parts, things don't happen 'on edges' but on
> levels (you measure timing requirements edge to edge). Asynchronous Srams
> tend to be a sea of RS Flip flops, and when write is low, the addresses flip
> flops will have their set or reset line asserted, so if you wanted to talk
> of a time when the write happened, it was on the falling edge, with a
> propagation delay/hold requirement.
>
> Toh is the minimum guaranteed propagation delay from address to data, just
> like Taa is the maximum delay from address to data. (Trc actually isn't a
> critical parameter for the ram itself, but is a nominal system parameter.
> With Asyncronuous SRam, changing the address inputs faster than Trc won't
> cause any problems, except for the fact that you won't get valid data out
> until you stop doing it.

I think what Richard wrote is the clearest explanation of why there is no 
bulk write with async RAM.  The level of the AND of WR- and CS-.  So while 
these two signals are low it is expected the address does *not* change.  If 
the address changed, the RAM cell selected will change and there can be 
extraneous cells selected as the address lines settle.  By writing to 
location 3 and then 4 without removing WR or CS you can be writing to any 
combination of 0 to 7 in the switch.  Since none of this meets timing the 
writing will be random garbage and not even the data you are trying to write 
to locations 3 and 4.

When both WR and CS are asserted, keep the address stable and keep the data 
stable for the last N ns before either control line is deasserted.

-- 

Rick C

Reply by Mike Perkins ●July 29, 20172017-07-29

On 22/07/2017 20:56, lasselangwadtchristensen@gmail.com wrote:
> Den l&oslash;rdag den 22. juli 2017 kl. 20.32.52 UTC+2 skrev kristoff:
>> Hi Cecil,
>>
>>
>> Thanks for your reply.
>>
>>
>> I agree it's not a bulk-mode as such.
>>
>> What I meant was that when doing multiple reads one after the other you
>> can stich them together:
>>
>>
>> Correct me if I am wrong, but how I interprete the datasheets, the "read
>> data from the address-bus" can be done at the same time as the "set next
>> address on address-bus". This -I think- means you can "overlap" two
>> concequative reads, resulting in one read per clock cycle.
>
> SRAM doesn't have a clock, you just have to comply with the required timing

There are some forms of clocked SRAM. ZBT was one type introduced by IDT.

I assume it still exists?


-- 
Mike Perkins
Video Solutions Ltd
www.videosolutions.ltd.uk

Previous12 3 4 5 Next

sram

Sign in

You might also like...

Search forums

Free PDF Downloads

Blogs - Hall of Fame

Quick Links

About FPGARelated.com

Social Networks

The Related Media Group