|
|
 |
|
04-24-2012, 04:59 PM
|
#26
|
|
Senior Member
Join Date: Aug 2008
Posts: 958
|
Quote:
Originally Posted by ShintaiDK
Thats very misleading. Since the GPUs FP units can barely handle 5% of the code that the CPUs FP units can. Hence also why its so relatively easy to make fast. The bar is simply low enough.
This is also why GPGPU technologies like CUDA and OpenCL is a niche and will always be so.
|
I don't understand this. What is "code" here? What kind of FP computations can you do on a CPU, but cannot do via CUDA/OpenCL on a GPU, performance considerations aside?
I don't see how it's misleading: QuickSync doesn't try nor purport to do anything general, it's only for video transcoding. Unlike stream processors, even if they are often a very poor choice.
|
|
|
04-24-2012, 05:22 PM
|
#27
|
|
Diamond Member
Join Date: Dec 2009
Location: Denmark
Posts: 3,929
|
Quote:
Originally Posted by ShintaiDK
Wierd isnt it. When Llano already supports it?
Must be a slow newsday....
|
It is...IVB launch is over.
BTW:
Jeg ved hvem du er 
J..... ....
HLJ
FR
.oO(Ellers er sammentræffet ret griner...)
__________________
Don't get my personality confused with my attitude.
My personality is who I am...my attitude depends on who YOU are ^_^
990X supporting Titan...socket 1366 gone mean ^^
|
|
|
04-24-2012, 05:30 PM
|
#28
|
|
Diamond Member
Join Date: Apr 2012
Location: Copenhagen
Posts: 5,917
|
Quote:
Originally Posted by Lonbjerg
It is...IVB launch is over.
BTW:
Jeg ved hvem du er 
J..... ....
HLJ
FR
.oO(Ellers er sammentræffet ret griner...)
|
Haha Hej
__________________
MiniITX - Intel i5 4670
Board - Intel DH87FB
SSD - Crucial M500 480GB mSATA
Memory - Crucial Ballistix Sport 2x8GB 1600Mhz 1.35V
Case - Sugo SG08B with 600W PSU
GPU - Zotac GTX 680 2GB
|
|
|
04-24-2012, 05:32 PM
|
#29
|
|
Diamond Member
Join Date: Dec 2009
Location: Denmark
Posts: 3,929
|
Quote:
Originally Posted by ShintaiDK
Haha Hej 
|
Welcome to ATF
__________________
Don't get my personality confused with my attitude.
My personality is who I am...my attitude depends on who YOU are ^_^
990X supporting Titan...socket 1366 gone mean ^^
|
|
|
04-24-2012, 05:37 PM
|
#30
|
|
Diamond Member
Join Date: Apr 2012
Location: Copenhagen
Posts: 5,917
|
Quote:
Originally Posted by iCyborg
I don't understand this. What is "code" here? What kind of FP computations can you do on a CPU, but cannot do via CUDA/OpenCL on a GPU, performance considerations aside?
I don't see how it's misleading: QuickSync doesn't try nor purport to do anything general, it's only for video transcoding. Unlike stream processors, even if they are often a very poor choice.
|
The CPU can do everything. This is basicly also what makes it "slow". Where the GPU is very limited in the abilities and what instructions it can execute. There is also another issue. 1 flop aint equal to 1 flop either. The GPU might need 4 flops to accomplish what the CPU handles in 1 flop.
Take a look on the OpenCL programming guide for nVidia:
http://www.nvidia.com/content/cudazo...mmingGuide.pdf
Then compare it to a regular one for x86/x64.
__________________
MiniITX - Intel i5 4670
Board - Intel DH87FB
SSD - Crucial M500 480GB mSATA
Memory - Crucial Ballistix Sport 2x8GB 1600Mhz 1.35V
Case - Sugo SG08B with 600W PSU
GPU - Zotac GTX 680 2GB
|
|
|
04-24-2012, 05:38 PM
|
#31
|
|
Lifer
Join Date: Aug 2000
Posts: 12,285
|
Quote:
Originally Posted by Phynaz
So any OpenCL capable GPU. Hardly an AMD coup.
|
Very good for AMD, but no, not nearly a coup. Adobe even said awhile back that they went with CUDA because there was nothing else, and would use OpenCL for future versions.
__________________
"The computer can't tell you the emotional story. It can give you the exact mathematical design, but what's missing is the eyebrows." - Frank Zappa
|
|
|
04-24-2012, 05:58 PM
|
#32
|
|
Senior Member
Join Date: Aug 2008
Posts: 958
|
Quote:
Originally Posted by ShintaiDK
The CPU can do everything. This is basicly also what makes it "slow". Where the GPU is very limited in the abilities and what instructions it can execute. There is also another issue. 1 flop aint equal to 1 flop either. The GPU might need 4 flops to accomplish what the CPU handles in 1 flop.
Take a look on the OpenCL programming guide for nVidia:
http://www.nvidia.com/content/cudazo...mmingGuide.pdf
Then compare it to a regular one for x86/x64. 
|
Instruction set has nothing to do with this. Or does this mean Pentium 1 is not "general purpose" any more, because it lacks many instructions that IB/Bulldozer have? Or ARM? Also, GPUs have some instructions that not all CPUs have as of now, like FMA.
The point is that stream processors in GPUs are designed so they can also be used for many different purposes. In a Turing-complete sense, GPUs are equivalent to CPUs in terms of computability, i.e. they too can do everything. QuickSync was designed for one purpose and is not a step forward in GPGPU. It's a step forward in video transcoding.
|
|
|
04-24-2012, 06:03 PM
|
#33
|
|
Diamond Member
Join Date: Jan 2010
Location: Mejico
Posts: 8,671
|
Quote:
Originally Posted by Olikan
it's just some cuda-related stuff... nothing really important here (exept for another bad news for nvidia)
|
Some people use "cuda-stuff", really expensive "cuda-stuff". Can't wait to see what my 6870s can in Premiere Pro once my company upgrades to CS6.
__________________
AMD X6 1090T @ 3.8GHz Cooled by CM Hyper212+ P/P | Sapphire AMD HD6870 Vapor-X Crossfire @ 930/1150 | Gigabyte 990XA-UD3 | 8GBs Patriot G2 DDR3-1600Mhz | Corsair FORCE 3 90GB SSD | 1TB WD RAID0 | Antec HCG-750 | CM HAF 932
Lonely HeatWare
|
|
|
04-24-2012, 07:33 PM
|
#34
|
|
Golden Member
Join Date: May 2004
Location: Kent, Ohio
Posts: 1,390
|
Quote:
Originally Posted by Lonbjerg
Well...then it's time to benchmark CUDA vs OpenCL in Adobe...and get the full picture.
Because that is what matters right?
Performance.
|
....and glide offered better performance than opengl and directx....As long as performance is "good enough" getting this out to the full market rather than just one vendor will be a big step forward.
|
|
|
04-24-2012, 08:12 PM
|
#35
|
|
Diamond Member
Join Date: Apr 2012
Location: Copenhagen
Posts: 5,917
|
Quote:
Originally Posted by iCyborg
Instruction set has nothing to do with this. Or does this mean Pentium 1 is not "general purpose" any more, because it lacks many instructions that IB/Bulldozer have? Or ARM? Also, GPUs have some instructions that not all CPUs have as of now, like FMA.
The point is that stream processors in GPUs are designed so they can also be used for many different purposes. In a Turing-complete sense, GPUs are equivalent to CPUs in terms of computability, i.e. they too can do everything. QuickSync was designed for one purpose and is not a step forward in GPGPU. It's a step forward in video transcoding.
|
I think you should read the programming guide.
__________________
MiniITX - Intel i5 4670
Board - Intel DH87FB
SSD - Crucial M500 480GB mSATA
Memory - Crucial Ballistix Sport 2x8GB 1600Mhz 1.35V
Case - Sugo SG08B with 600W PSU
GPU - Zotac GTX 680 2GB
|
|
|
04-24-2012, 09:31 PM
|
#36
|
|
Senior Member
Join Date: Aug 2008
Posts: 958
|
Quote:
Originally Posted by ShintaiDK
I think you should read the programming guide.
|
That's your argument? Go read a 60-page document?
Here's a counterargument: I think you should read the x86/x64 guide:
http://www.intel.com/content/www/us/...r-manuals.html
|
|
|
04-25-2012, 06:02 AM
|
#37
|
|
Member
Join Date: Dec 2011
Location: PRC
Posts: 105
|
Quote:
Originally Posted by ShintaiDK
The CPU can do everything. This is basicly also what makes it "slow". Where the GPU is very limited in the abilities and what instructions it can execute. There is also another issue. 1 flop aint equal to 1 flop either. The GPU might need 4 flops to accomplish what the CPU handles in 1 flop.
Take a look on the OpenCL programming guide for nVidia:
http://www.nvidia.com/content/cudazo...mmingGuide.pdf
Then compare it to a regular one for x86/x64. 
|
This is merely an old programming guide...The newest should be version 4.0
BTW version 2.3 was..I guess for GT200.
__________________
There is no fate but what we make for ourselves.
|
|
|
04-25-2012, 06:06 AM
|
#38
|
|
Member
Join Date: Dec 2011
Location: PRC
Posts: 105
|
And...I do wonder only on the arithmetic level whether there is something GPU can't "do".
At least we get Taylor Series don't we guys?
Well, efficiency & speed must be a problem.
If I remember it correctly the INT shift instruction in GK104 runs about 1/6 of the FP SP speed
__________________
There is no fate but what we make for ourselves.
|
|
|
04-25-2012, 06:57 PM
|
#39
|
|
Diamond Member
Join Date: Dec 2009
Location: Denmark
Posts: 3,929
|
Quote:
Originally Posted by nismotigerwvu
....and glide offered better performance than opengl and directx....As long as performance is "good enough" getting this out to the full market rather than just one vendor will be a big step forward.
|
Glide failed to to 3Dfx's hardware circus, that took out not only glide...but also VooDoo graphics chips...nice fail
__________________
Don't get my personality confused with my attitude.
My personality is who I am...my attitude depends on who YOU are ^_^
990X supporting Titan...socket 1366 gone mean ^^
|
|
|
04-25-2012, 09:57 PM
|
#40
|
|
Golden Member
Join Date: May 2004
Location: Kent, Ohio
Posts: 1,390
|
Actually glide started losing it's grip on the tail end of the Voodoo 2 era, which was unquestionably the fastest card still at the time. Nice attempted troll though.
|
|
|
04-25-2012, 09:59 PM
|
#41
|
|
Golden Member
Join Date: May 2004
Location: Kent, Ohio
Posts: 1,390
|
Go ahead and flip through http://www.anandtech.com/show/197 to refresh your memory
|
|
|
04-26-2012, 02:50 AM
|
#42
|
|
Diamond Member
Join Date: Feb 2009
Location: Athens Greece
Posts: 3,693
|
|
|
|
05-04-2012, 02:35 PM
|
#43
|
|
Junior Member
Join Date: Dec 2001
Posts: 1
|
Open CL and the Mercury Playback Engine
I found this thread, http://forums.adobe.com/thread/994892 on the Adobe site. Here is the reason OpenCL isn't implemented yet.
"David McGavran here, engineering manager for Premiere Pro. You can stop speculating. Porting MPE to OpenCL was hard, very hard! It took an entire cycle to get it to work. It isn't just about the engineering effort it is about the testing effort and making sure it will work. We worked very closely with partners to make this happen. We focused on the mac first because there aren't any NVidia cards in the apple store configurations for our customers here. We focused on the macbooks because many of our customers use these products. We developed the OpenCL code so that it can in the future possibly work on windows. We are very interested in making sure our product uses all the compute power that your computer has. No conspiracy theories just time and effort and balancing all the work we need to get done so that you can have the product you want to use to make great video.
Hope that helps
Dave"
I'll wait a while before I select a gpu. I would guess at cs6.5 there's an openCL for windows.
|
|
|
05-04-2012, 02:37 PM
|
#44
|
|
Diamond Member
Join Date: Oct 1999
Posts: 4,642
|
10.3 years, 1 post.
nice.
|
|
|
05-04-2012, 03:01 PM
|
#45
|
|
Senior Member
Join Date: May 2005
Posts: 938
|
He's wins the Ultimate Lurker Award. lol
|
|
|
05-05-2012, 11:27 AM
|
#46
|
|
Diamond Member
Join Date: Jan 2005
Posts: 5,378
|
Quote:
Originally Posted by grimpr
AMDs APP SDK is seriously ahead of Intels software stack. It even supports AVX in compatible CPUs.

|
I thin the interesting thing here is that the Intel cpus are so competitive with basically everything except AMD's most recent cards, in a task that's extremely parallelizable.
|
|
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT -5. The time now is 05:37 AM.
|