800 TFLOP real time ray tracing GPU

Modelworks

Lifer
Feb 22, 2007
16,240
7
76
I would love to have that setup. Maybe in 10 years we can get something like that in the home :)

There is a picture at the link with more details on how it all works.
http://techon.nikkeibp.co.jp/a...090629/172373/fig1.jpg

http://arstechnica.com/hardwar...led-not-for-gamers.ars
A Japanese company has announced a massive, 800 teraflop real-time ray tracing (RTRT) system that gangs together nine, 73-core chips into a single system that fits inside a desktop computer form factor. The new chip, which is being jointly developed with Toyota and Unisys, is aimed at the auto industry, where designers will use it to prototype body designs and paint combinations.

As for how this system works, there are currently only two sources of information: a Japanese description on the website of the chip's maker, TOPS Systems Corporation, and a Nikkei article in English that's presumably a summary of the Japanese original. Given the paucity of information and the relative shallowness of my technical knowledge of ray tracing, I'll give my best shot at explaining this system and putting it in context, and I'll invite others to weigh in with more info in the comments thread.

As I noted above, the overall system consists of nine identical, 45nm ASICs ganged together via some unspecified interconnect scheme. Each individual ASIC consists of nine compute clusters connected to one another through a shared bus. (See this excellent diagram from Nikkei.) This bus also hosts a 64-bit RISC master controller that presumably takes in work in batches and assigns it to the other eight cores, which then do the grunt work of computing the rays; there are also I/O and memory interfaces attached to this shared bus, which link the chip to the rest of the system.

There are a few things that are interesting about these clusters, one of which is pictured below. First, you'll notice that each cluster is made of eight heterogeneous cores, each of which is supposed to handle on part of the ray tracing algorithm.


The heterogeneous cores are connected by a high-bandwidth, three-bus link (system bus, data bus, and instruction bus), which lets a job move in stages from one core to the next. Clearly, this is a pipeline setup, with one core per stage, and in this respect the ASIC is very much a ray-tracing GPU?analogous to the fixed-function GPUs of yesteryear, which had custom hardware blocks dedicated to each stage of the rasterization pipeline.

The fact that this is a ray-tracing GPU has very important implications for the part's future in gaming. To wit, it has no such future. But more on that in a moment.

The second interesting thing about this system is that it addresses RTRT as a compute problem, instead of as a data management problem like the much less ambitious Caustic Graphics solution. You'll recall that Caustic's solution relies on the traditional GPU to do the computational heavy lifting, with the Caustic board accelerating the data lookup part of the problem. The TOPS design, in contrast, is more traditional brute force, multicore plus caches solution whose main novel twist is this one-core-per-pipeline-stage idea.

There's a reason that the other plans for a hardware-based RTRT solution have been homogeneous multicore designs that throw bandwidth and highly parallel math hardware at the problem, and that's the fact that you can actually repurpose a homogeneous design for other applications in different verticals, thereby gaining the sales volume needed to make producing the IC profitable. This brings me to the reason why you shouldn't plan on using a successor to this chip in a computer game.
 

OCNewbie

Diamond Member
Jul 18, 2000
7,596
25
81
I'm just wondering how long it'll take for us to reflect back on that article and laugh at how we once thought that was impressive.
 

Idontcare

Elite Member
Oct 10, 1999
21,110
64
91
system that gangs together nine, 73-core chips into a single system that fits inside a desktop computer form factor.

Seems kind of a stretch to call this a "GPU" in the traditional use of the term.

A graphics processing system or farm might be the more applicable description.
 

MarcVenice

Moderator Emeritus <br>
Apr 2, 2007
5,664
0
0
I think the right way to describe it, is that it behaves as a gpu, rendering pictures in real time. The fact it takes up a whole midtower and consumes 1000W is just a bonus :p
 

firewolfsm

Golden Member
Oct 16, 2005
1,848
29
91
800TFlops is still very far ahead of what any single GPU can accomplish. It should take at least 10 years before that happens.
 

arkcom

Golden Member
Mar 25, 2003
1,816
0
76
Originally posted by: MarcVenice
I think the right way to describe it, is that it behaves as a gpu, rendering pictures in real time. The fact it takes up a whole midtower and consumes 1000W is just a bonus :p

1000w is pretty impressive considering how powerful it is.
 

MarcVenice

Moderator Emeritus <br>
Apr 2, 2007
5,664
0
0
Don't forget it's a HIGHLY specialised asic. It can only do ray-tracing and that's it. Theoretically each of the 9 chips would have a little less then 100tflops of computing power. That's still 100 times more then a HD 4870. You can't compare them though, because the HD 4870 can't do ray-tracing (or rly slow) but the 800tflops monster can't do anything the hd 4870 can.
 

Modelworks

Lifer
Feb 22, 2007
16,240
7
76
The best I can do with a single quad core cpu is about 8 minutes for a single scene at 1024x768 resolution. With a quadroplex that drops to 3.5 minutes. We will see 800Tflop gpu one day in home pc maybe. I just can't imagine playing games that use rendering engines with the quality of something like vray in real time.
 
Apr 20, 2008
10,067
990
126
Originally posted by: Modelworks
The best I can do with a single quad core cpu is about 8 minutes for a single scene at 1024x768 resolution. With a quadroplex that drops to 3.5 minutes. We will see 800Tflop gpu one day in home pc maybe. I just can't imagine playing games that use rendering engines with the quality of something like vray in real time.

When i was a young boy seeing my family's computer with a massive 4GB HDD i looked it up to see what was next after Giga and dreamed of a Terabyte HDD. Well, I'm only 20.

:-/
 

thilanliyan

Lifer
Jun 21, 2005
12,060
2,273
126
Originally posted by: Scholzpdx
When i was a young boy seeing my family's computer with a massive 4GB HDD i looked it up to see what was next after Giga and dreamed of a Terabyte HDD. Well, I'm only 20.

:-/

4GB!!! That's HUGE!! I started off with a 130mb (yes that's megabyte) HDD and 4mb of RAM back in '92 and my family paid CAD$2500 for that computer. How times change. :)