Messages from 105275

Article: 105275
Subject: Virtex-4 PowerPC and Trace32 ICD - start up help wanted
From: jackhab@gmail.com
Date: 19 Jul 2006 06:13:21 -0700
Links: << >> << T >> << A >>

Hello, guys!

Anyone has an experience debugging Virtex-4 PPC via Lauterbach TRACE32?
I just can't connect them together!
I have previous experience in ARM-7 and it was simply plug-and-play.
Please, help.

Article: 105276
Subject: PCIe: use 8*x1 PHY devices to form x8
From: "=?iso-8859-1?B?R2FMYUt0SWtVc5k=?=" <taileb.mehdi@gmail.com>
Date: 19 Jul 2006 06:16:53 -0700
Links: << >> << T >> << A >>

Hi all,
We are about to implement a PCIe x8 core but don't want to make it on
Virtex-4 FX devices, so the PHY layer should be realized by external
chips. So the question is: is it possible to use 8 external x1 PHY
devices (for example Philips PX1011A or TI XIO1100) to form a x8 PHY?

Article: 105277
Subject: Re: Which PCI core for Cyclone II board?
From: "Brian McFarland" <brian.mcf1985@gmail.com>
Date: 19 Jul 2006 06:19:30 -0700
Links: << >> << T >> << A >>

Mark McDougall wrote:
> No implementation/compatibility issues at all really. Nice if you have
> wishbone peripherals you want to hook up on the back end - very easy to
> get going. Had opencores DMA mastering over PCI too.
>
> Performance is another issue. If you're looking to push PCI burst
> transfers to the limit, then it's not so straightforward. Not saying it
> can't be done, but it will take some effort.

That's another thing i meant to ask about in my original post.  Is
there noticible performance difference between the different available
cores?

This is my experience with anything PCI related, so I'm still not very
clear on whether I can get away with a target, or if I'll need to
mastering/DMA capabilities.  I know that theoretically, the max tranfer
rate of the bus is 133 MB/s w/ 33MHz systems.  The intended customer of
this thing is going for low cost, so I'm not going to assume that it
will be used with a computer that supports 66Mhz or 64-bit transfers.
Ideally, I would like to be able to guarantee 54 MB/s with pretty much
equal I/O rates (27MB/s into and out of the device). Most of the time,
rates should be lower than that, but just about the max I could ever
need it to be.  This card will probably be the only thing on the PCI
bus of the PC it's plugged into, and the primary purpose of the PC will
be to process the data to & from this device, so having moderately high
CPU usage requirements is not much of a concern.

Article: 105278
Subject: Re: PCIe: use 8*x1 PHY devices to form x8
From: "Antti" <Antti.Lukats@xilant.com>
Date: 19 Jul 2006 06:28:58 -0700
Links: << >> << T >> << A >>

GaLaKtIkUs=99 schrieb:

> Hi all,
> We are about to implement a PCIe x8 core but don't want to make it on
> Virtex-4 FX devices, so the PHY layer should be realized by external
> chips. So the question is: is it possible to use 8 external x1 PHY
> devices (for example Philips PX1011A or TI XIO1100) to form a x8 PHY?

at least PLDA fully supports PCIe 8 lane by using 2 times quad PCIe
PHYs

Antti
http://antti-brain.com

Article: 105279
Subject: Re: corrupted data when accessing dual port bram in Cyclone II
From: "Subroto Datta" <sdatta@altera.com>
Date: Wed, 19 Jul 2006 13:32:08 GMT
Links: << >> << T >> << A >>

I recommend that you first upgrade to 6.0 SP1. If that does not fix your 
problem, contact the help desk at mysupport.altera.com. We will need your 
design and vectors to debug this case. The other things that I would look at 
is to make sure this design has the right set of timing constraints so that 
the fitter can implment the desired timing relationships.

Hope this helps,
Subroto Datta
Altera Corp.

"homoalteraiensis" <fpgaengineerfrankfurt@arcor.de> wrote in message 
news:1153303906.254358.157210@m79g2000cwm.googlegroups.com...
>I am having several dual port block rams in a cyclone II device. The
> access takes place as simple dual port option with only one clock for
> the bram but sometimes a write, when a read is done simultaneously. I
> expexted the old data to appear at the output as demanded, when I
> defined the brams from within the altera megafunction wizzard.
>
> I know about the possible bugs, when using two different clocks, as
> described in an altera technical announcement, therefore I use only one
> system clock for the rams.
>
> I observe corrupted data though! :-( It obviously is caused when the
> reading and writing processes access the same time. I am sure that I
> contraint "old data" and varified this in the vhdl-definitions created
> be the wizzard.
>
> Ideas?
>

Article: 105280
Subject: Re: OpenFire - public domain MicroBlaze clone in verilog
From: "Stephen Craven" <nevarcs@gmail.com>
Date: 19 Jul 2006 07:32:53 -0700
Links: << >> << T >> << A >>

> read-modify-write - thats not sufficent - the external memory interface must
> be able todo real byte writes as well, in the case that some stupid ip core
> is connected with 8 bit registers and side-effects on register reads. there
> are other reasons as well.

Indeed.  However, for my purposes the simple memory interface and the
reduced BRAM requirement justified the internal byte steering.  For the
MicroBlaze, RAM must be added in groups of 4 BRAMs, so that an
individual byte can be stored by selecting a specific memory.  The
read-modify-write of the OpenFire permits a single BRAM to be used.

Regardless, the OpenFire code is (hopefully) fairly easy for others to
customize to their needs.

Article: 105281
Subject: VHDL Data Buffer on Spartan-3E
From: "Alex" <alexmchale@gmail.com>
Date: 19 Jul 2006 07:48:30 -0700
Links: << >> << T >> << A >>

My application needs 4KB (32768 bits) of buffer space for incoming
data.  We are wanting to use the Spartan-3E, but we have not entirely
settled on a version of it.  Obviously, we would like to use the
cheapest package possible.

My question is if this is too much buffer to synthesize in VHDL.  If
so, what would be my best option here?  The data is written to the
buffer 8 bits at a time, and it is used 64 bits at a time.

Currently, I am creating this buffer like this:

	type ram_type is array (0 to 511) of STD_LOGIC_VECTOR(63 downto 0);
	signal data_buffer: ram_type;

However, once I connect it to the logic that uses it, it reports that
it's too large for the chip.

Is there a better way to accomplish what I am trying to here?

Thanks!

Alex McHale

Article: 105282
Subject: Re: How much time does it need to sort 1 million random 64-bit/32-bit integers?
From: "Weng Tianxiang" <wtxwtx@gmail.com>
Date: 19 Jul 2006 08:07:48 -0700
Links: << >> << T >> << A >>

Hi Jonathan,
"1. I would like to know why you get 543 compare/swap blocks?

It has (lg N)*((lg N)+1)/2 stages based on formula of Knuth Vol 3,
p.113. For N= 64, it has 21 stages.
Yes, that's where I get my figure for 21 levels of logic.
With n/2 comparators at each stage, it means it has 32 comparators at
each stage.
32*21 = 672 /= 543?
Not all levels have the maximum population of n/2 blocks. "

I read Knuth's Volumn Batcher's algorhtm and didn't find the formula
for the
number of comparators. Do you have a ready formula for it?

Thank you.

Weng

Article: 105283
Subject: Re: VHDL Data Buffer on Spartan-3E
From: Frank Buss <fb@frank-buss.de>
Date: Wed, 19 Jul 2006 17:23:48 +0200
Links: << >> << T >> << A >>

Alex wrote:

> My application needs 4KB (32768 bits) of buffer space for incoming
> data.  We are wanting to use the Spartan-3E, but we have not entirely
> settled on a version of it.  Obviously, we would like to use the
> cheapest package possible.
> 
> My question is if this is too much buffer to synthesize in VHDL.  If
> so, what would be my best option here?  The data is written to the
> buffer 8 bits at a time, and it is used 64 bits at a time.

You can use block RAM, e.g. one instance of RAMB16_S9 for 16384 bits and
some parity bits (the smallest Spartan 3E, XC3S100E, has 4 RAM blocks of
this size). See the Xilinx libraries guide for details.

-- 
Frank Buss, fb@frank-buss.de
http://www.frank-buss.de, http://www.it4-systems.de

Article: 105284
Subject: Sorting algorithm for FPGA availlable?
From: heinerlitz@gmx.de
Date: 19 Jul 2006 08:26:45 -0700
Links: << >> << T >> << A >>

Hello,

we have a Virtex4 FPGA and are looking for availlable hardware sorting
algorithms. I couldnt find anything @opencores.org however I guess that
it has been worked on this subject.

The Virtex4 supports 2 PowerPCs and lots of Block RAMs so everything
seems to be there. Are there any projects and what speeds could be
estimated for sorting 64 bit integers?

thx Heiner

Article: 105285
Subject: Re: Sorting algorithm for FPGA availlable?
From: "homoalteraiensis" <fpgaengineerfrankfurt@arcor.de>
Date: 19 Jul 2006 08:48:03 -0700
Links: << >> << T >> << A >>

I'd suggest a binary tree based sorting algorithm with destination
vectors in order to be able to sort them parallely with more than one
architecture (e.g. one per block ram) or even one per cell in my case
:-) once having found the final order.

If you have no special demands concerning the speed, you should be able
to use a software based algo on the power pc core using one of the
algos around in the net.

Article: 105286
Subject: Re: Virtex 4 ACE Compact Flash configuration problem
From: Ed McGettigan <ed.mcgettigan@xilinx.com>
Date: Wed, 19 Jul 2006 08:49:39 -0700
Links: << >> << T >> << A >>

Dan wrote:
> I have tried already all the demos and they all work fine. Problem is with a new configuration ACE file that I create with ISE 8.1 and iMPACT - even the simplest design can not configure the board.

Are you sure that you set up the JTAG chain properly in iMPACT when
you generated the ACE file?    The chain should be defined as

  XCF32P -> XC4VSX35 -> XC95144XL

Ed McGettigan
--
Xilinx Inc.

Article: 105287
Subject: Specify Clock Correction Sequence for Virtex-II ProX MGT (Rocket
From: Patrik Eriksson <no-replay@dummy.com>
Date: Wed, 19 Jul 2006 18:11:51 +0200
Links: << >> << T >> << A >>

I have a problem while specifying the clock correction sequence for the 
Virtex-II ProX MGT. The problem is that I don't find a way to make the 
clock correction function of the MGT to distinguish between 64B66B 
control blocks and data blocks. Does anyone have any experience in this?

Article: 105288
Subject: Re: corrupted data when accessing dual port bram in Cyclone II
From: "homoalteraiensis" <fpgaengineerfrankfurt@arcor.de>
Date: 19 Jul 2006 09:12:39 -0700
Links: << >> << T >> << A >>


1) "Learn about semaphores"

I know about semaphores, but according to my understanding it is a
feature of a dp ram to be accessed the same time from both sides. If
I'd implement a semaphore based access, there would be no need to use a
dual ported ram, would it? The data that appears is totally wrong,
having nothing to do with old data, as expected.

Anyway, I''ll camoe back to this issue later.


2) "Upgrade to 6.0 SP1"

Thanks, yes, I am with 6 SP1 now but the same experience. I also think,
that it is a matter of physical timing. I will do a back annotation sim
on order to find out.

Article: 105289
Subject: Re: Sorting algorithm for FPGA availlable?
From: "Eric Crabill" <eric.crabill@xilinx.com>
Date: Wed, 19 Jul 2006 10:05:15 -0700
Links: << >> << T >> << A >>

Hello,

There was a similar thread in this newsgroup.  If you are looking to solve a
specific problem and have performance targets, you should share those -- it
will help people give you more focused answers.

You do not need PowerPCs or BlockRAMs to sort integers.  Sure, you can write
a sequential sort algorithm and run it on a processor, but if you are
interested in performance, I think that approach may be one of the last
things you would want to do.  If you are interested in two examlpes of a
hardware sort, take a look at:

http://www.engr.sjsu.edu/crabill/module05.pdf

There is a bubble sort and an odd-even transposition sort.  The examples
sort a set of five, 16-bit values.  They could certainly be extended to
handle fewer/more and/or wider/narrower inputs.  One example is pipelined,
the pipeline depth can certainly be changed.  In fact, you can turn it into
a combinational sorter if you want.

If anyone expands the examlpes to sort 1 million 64-bit integers, please let
me know what your synthesis tool thinks about it.

Certainly, I am not advertising the examples as "the best" and I am no
"expert" in sorting.  Or anything.  But they might be a place for you to
start.  I only knew the bubble sort, but used Google to search for "parallel
sorting algorithms" because I figured there would be one which I could
implement in hardware.  The odd-even transposition sort is indeed quite
easy.

Eric

> we have a Virtex4 FPGA and are looking for availlable hardware sorting
> algorithms. I couldnt find anything @opencores.org however I guess that
> it has been worked on this subject.
>
> The Virtex4 supports 2 PowerPCs and lots of Block RAMs so everything
> seems to be there. Are there any projects and what speeds could be
> estimated for sorting 64 bit integers?

Article: 105290
Subject: Re: Reuse a Speed Grade -8 Stratix image in Speed Grade -6 ...?
From: "Subroto Datta" <sdatta@altera.com>
Date: 19 Jul 2006 10:07:45 -0700
Links: << >> << T >> << A >>

Faster speed grade parts will work in almost all scenarios. While min
timing analysis should be conducted to ensure there are no timing
problems, likelihood is extremely high that faster speed grade parts
will work. To do min timing analysis from the UI use
Processing->Start->Start Timing Analyzer (Fast Timing Model).

Hope this helps
- Subroto Datta
Altera Corp.

Jesper.Kristensen@tellabs.com wrote:
> Hello Group.
>
> Quite simple question... but I cannot seem to be able to find the
> answer:
>
> Given a Code Image initially targeted for a Stratix EP1S40F780C8 - i.e.
> Speed Grade "-8" - would it then be without troubles to load and run
> that Image on the corresponding "-6" device...?
>
> The need for the question arises in the crossfire of device
> availability and the wish of having only one Coding Image to update.
> 
> Thanks in advance & best regards,
> 
>   Jesper.

Article: 105291
Subject: Re: Virtex 4 ACE Compact Flash configuration problem
From: Dan <dan_mr@hotmail.com>
Date: Wed, 19 Jul 2006 10:20:53 -0700
Links: << >> << T >> << A >>

Ed McGettigan wrote:

      Are you sure that you set up the >JTAG chain properly in iMPACT when you
      generated the ACE file? The >chain should be defined as





      XCF32P -> XC4VSX35 -> XC95144XL




How do I set up the JTAG chain in iMPACT? What I see on the iMPACT is:

xc4vsx35 myfile.bit -> XCCACE-TQ144I

Article: 105292
Subject: Re: Synthesis Problems with Quartus II Version 6.x
From: "Derek Simmons" <dereks314@gmail.com>
Date: 19 Jul 2006 10:26:46 -0700
Links: << >> << T >> << A >>

Have you installed service pack 1 for version 6? If you haven't, tech
support will prbably have you do this first.

I haven't used 6 yet but, when I upgraded from 4 to 5, I had the
optimizer reduce a state machine to nothing because the newer version
identified an ambiguous state and poof. I'm not saying this is your
problem but examine your project settings, see if it is doing anything
differently. Put an emphasis on optimizing. And look at the warnings

Derek

Article: 105293
Subject: Re: Which PCI core for Cyclone II board?
From: "Eric Crabill" <eric.crabill@xilinx.com>
Date: Wed, 19 Jul 2006 10:32:07 -0700
Links: << >> << T >> << A >>

Hi Brian,

An important consideration regarding the PCI bus (say, in a desktop PC) is
that you generally cannot make any guarantees about bandwidth.  There are
too many things outside of your control -- the O/S, other devices using the
bus, the performance capabilities of the host bridge device...

That "133 MBytes/sec for PCI32/33" is the physical bus bandwidth -- how much
data could possibly fly over the bus.  It is not an indicator of what your
device may realize in a typical system.

If you need performance, and your host is going to be x86 based with a
commodity chipset (certainly not "perfect"), you absolutely must include
some kind of bus mastering capability.  For all practical purposes, it will
be very difficult to get any kind of data burst initiated by the CPU to your
device target.  For that reason, a target will not suffice unless you are
building something that only needs a dribble of data -- a standard serial
port, or an LED flasher.  :)

Eric

> This is my experience with anything PCI related, so I'm still not very
> clear on whether I can get away with a target, or if I'll need to
> mastering/DMA capabilities.  I know that theoretically, the max tranfer
> rate of the bus is 133 MB/s w/ 33MHz systems.  The intended customer of
> this thing is going for low cost, so I'm not going to assume that it
> will be used with a computer that supports 66Mhz or 64-bit transfers.
> Ideally, I would like to be able to guarantee 54 MB/s with pretty much
> equal I/O rates (27MB/s into and out of the device). Most of the time,
> rates should be lower than that, but just about the max I could ever
> need it to be.  This card will probably be the only thing on the PCI
> bus of the PC it's plugged into, and the primary purpose of the PC will
> be to process the data to & from this device, so having moderately high
> CPU usage requirements is not much of a concern.
>

Article: 105294
Subject: Re: Which PCI core for Cyclone II board?
From: "bart" <bart.borosky@latticesemi.com>
Date: 19 Jul 2006 10:47:12 -0700
Links: << >> << T >> << A >>

I am no lawyer, but it my understanding that the Lattice reference
design is intended for use on Lattice devices. The license agreement
says something like: "for the sole purpose of programming Lattice
programmable logic devices."

If you use the Lattice PCI reference design, it sounds like you should
use the LatticeECP2 device or another Lattice FPGA, a list of which you
can find here:
http://www.latticesemi.com/products/fpga/index.cfm

Hope this helps.
Bart Borosky, Lattice

Antti wrote:
> > but do you mean that at lattice they have a downloadable pci ip
> > retargettable to any fpga? (i.e  rtl code?) if yes could put here the link?
> >
> > Antti Lukats ha scritto:
>
> sure get it here
http://www.latticesemi.com/products/intellectualproperty/referencedesigns/pcitarget32bit33mhz.cfm
> 
> 100% verilog source included!
> 
> Antti
> http://antti-brain.com

Article: 105295
Subject: Re: Which PCI core for Cyclone II board?
From: "Antti Lukats" <antti@openchip.org>
Date: Wed, 19 Jul 2006 20:04:40 +0200
Links: << >> << T >> << A >>


"bart" <bart.borosky@latticesemi.com> schrieb im Newsbeitrag 
news:1153331232.004936.238120@p79g2000cwp.googlegroups.com...
>I am no lawyer, but it my understanding that the Lattice reference
> design is intended for use on Lattice devices. The license agreement
> says something like: "for the sole purpose of programming Lattice
> programmable logic devices."
>
> If you use the Lattice PCI reference design, it sounds like you should
> use the LatticeECP2 device or another Lattice FPGA, a list of which you
> can find here:
> http://www.latticesemi.com/products/fpga/index.cfm
>
>
> Hope this helps.
> Bart Borosky, Lattice
>
Hi Bart,

well yes -- everyone should read the license of course.

I would gladly use that lattice pci core on lattice boards, and hopefully 
one day I will - but as I dont have any lattice-pci fpga boards so I have 
evaluated the lattice-pci core on different PCI boards.

[snip - self censoring, the deleted text goes to lattice in private]

all the use of it I have ever done is fpga board initial testing - nothing 
more.

I have never considered using it in any products based on non-Lattice 
silicon (because of the license)

but - the fact that this core is available - it brings some attention to 
Lattice if someone mentions it. I should have mentioned the license clause 
of course - but the link I provided did land on the license agreement and 
not direct download.

Antti

Article: 105296
Subject: Re: Virtex 4 ACE Compact Flash configuration problem
From: Dan <dan_mr@hotmail.com>
Date: Wed, 19 Jul 2006 11:11:42 -0700
Links: << >> << T >> << A >>

I also tried to edit svf file before converting it to ace with svf2ace117.exe and the red led 'err' was turned on again. But when I converted the edited svf file with -s option (skip TDO test) then finally the red led 'err' was not on, neither was the green led 'done'. It didn't work again.

Article: 105297
Subject: Re: corrupted data when accessing dual port bram in Cyclone II
From: "KJ" <Kevin.Jennings@Unisys.com>
Date: 19 Jul 2006 11:48:31 -0700
Links: << >> << T >> << A >>

homoalteraiensis wrote:
> That is the point. Modelsim shows a well behavior. Nothing wrong.

And the Quartus timing analysis shows no problems either I'm assuming.

> The error only shows up in real and signal tap analysis.

Then open a service request to Altera.

> I already changed
> the timing (slower). The only way to shun this problem is, to disallow
> simultanous access, which in fact lowers performance.

Or go to Altera to find out what the issue is, and if there are any
work arounds.

I haven't used Cyclone but I've used Stratix and Stratix II dual port
memories extensively with a single clock without any issues.

KJ

Article: 105298
Subject: xess board problem (error downloading into ram)
From: "Subhasri krishnan" <subhasri.krishnan@gmail.com>
Date: 19 Jul 2006 12:00:51 -0700
Links: << >> << T >> << A >>

Hi all,
I downloaded code for a vga generator from xess design examples. I just
want to verify that my board is functioning properly. The vga generator
should display a frame from the sdram which is preloaded using GXSLOAD.

I loaded the image.xes file and the test_vga.bit onto the board. As
soon as the ramintfc.bit file (sdram|fpga interface file) is loaded, i
get the following error.

Major Error : Error Downloading into the RAM!!
An error occured while downloading to the RAM?

I click 'Continue' and now the test_vga.bit is loaded into the fpga.
The image displayed is not what is expected.
Has anyone seen this error before?
Any help is appreciated.
Thanks,
Subhasri.K

Article: 105299
Subject: Re: Virtex 4 ACE Compact Flash configuration problem
From: Dan <dan_mr@hotmail.com>
Date: Wed, 19 Jul 2006 12:04:37 -0700
Links: << >> << T >> << A >>

I forgot to say that I edited the svf file according to:

 <http://www.xilinx.com/xlnx/xil_ans_display.jsp?iLanguageID=1&iCountryID=1&getPagePath=23146>

The issue is caused by the lack of a proper time delay after the JPROGRAM instruction is issued to the device.

//NEW CODE SEGMENT: Loading device with a 'bypass' instruction SIR 14 TDI (3fff) SMASK (3fff) ; RUNTEST 434000 TCK ;

here I changed TCK to 2434000 - made it longer because I am programming Virtex 4 SX35.

Site Home Archive Home FAQ Home How to search the Archive How to Navigate the Archive
Compare FPGA features and resources

Authors:A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Custom Search