algorithm question..

holycow · Apr 11, 2002

hi,

i have some question about algorithm..

for example if i wrote a function in c++ that takes in a string and replace every characters from lower case to upper case.. what i would do is replacing every single character to upper case one by one using a for loop.. the run time of my function is O(n) where n is the number of characters in ths input string..

how do i make this algorithm to run twice faster?

CodeJockey · Apr 11, 2002

Run it on a system that is twice as fast as your current one.

Just kidding. I think O(n) is about as fast as such an algorithm can get (unless you can work out a way of having it reliably operate upon two or four characters at a time (in which case your for-loop will do fewer iterations, and you get close to O(n/2) or O(n/4).

This would be tricky with null-terminated strings, since you don't want to uppercase the null-terminator, nor any characters after it (you can't be certain that you actually own those bytes of memory). The code needed within the loop to check for these conditions might very well erase any performance gains you get from processing the string in larger chunks.

It is usually not worthwhile trying to squeeze extra performance from an algorithm unless it is O(2n), O(n^2), or worse.

Armitage · Apr 11, 2002

<< Run it on a system that is twice as fast as your current one.

Just kidding. I think O(n) is about as fast as such an algorithm can get (unless you can work out a way of having it reliably operate upon two or four characters at a time (in which case your for-loop will do fewer iterations, and you get close to O(n/2) or O(n/4). >>

You could probably use SIMD, in particular MMX to accomplish this.
Just load the data into the registers, and add or subtract the difference in ASCII representation.
At least that's my impression of how SIMD works ... I haven't actually gotten around to playing with it myself

<< This would be tricky with null-terminated strings, since you don't want to uppercase the null-terminator, nor any characters after it (you can't be certain that you actually own those bytes of memory). The code needed within the loop to check for these conditions might very well erase any performance gains you get from processing the string in larger chunks. >>

This wouldn't be to bad. You know how long your string is (N), and you know how many characters you can process simultaneously (n). So you could do something like this:

N = strlen(str)
for(i=0; i<N; i+=n)
do_simultaneous_alg(str + i)

i -= n
for(j=i; j<N; j++)
do_single_alg(str + j)

You're setting up an extra loop, so it's a bit more overhead.
If your N >> n and the simultaneous algorithm is significantly faster then the single algorithm, then this should give you an overall boost.

<< It is usually not worthwhile trying to squeeze extra performance from an algorithm unless it is O(2n), O(n^2), or worse. >>

Yea, I can't really see going to all this trouble to switch case in a string faster

Argo · Apr 11, 2002

There isn't really a good way of doing this. With algorithms, your goal rarely is to make it run twice faster. Most of the time, you try to make it run faster by an order of magnitude. Otherwise, with today's computer speeds it's not really worth it.

Armitage · Apr 11, 2002

<< There isn't really a good way of doing this. With algorithms, your goal rarely is to make it run twice faster. Most of the time, you try to make it run faster by an order of magnitude. Otherwise, with today's computer speeds it's not really worth it. >>

I dunno ... I have some codes that I'd be very happy to run twice as fast. 1 week instead of two weeks would be great

Anything a faster processor can give me on top of that is just more gravy!

EagleKeeper · Apr 11, 2002

if it is a NULL terminated string then testing for NULL in a while loop and ignoring the string length will be a fraction faster.
I suspect that the speed gain will not be noticable by a human

O(char *string)
{
char *p = string;
while (*p != 0x00)
{
// if (*p is lowercase)
// convert to uppercase

p++;
}

CodeJockey · Apr 11, 2002

<< if it is a NULL terminated string then testing for NULL in a while loop and ignoring the string length will be a fraction faster. >>

Exactly. The strlen() function itself must iterate through the string, counting characters, until it finds the NULL terminator. Therefore, by eliminating or avoiding the use of strlen() you are speeding up the overall algorithm.

Speed improvements in the code are usually a good thing. The exception would be if it makes the code so complex that it cannot be maintained anymore.

Search

algorithm question..

holycow

Senior member

CodeJockey

Member

Armitage

Banned

Argo

Lifer

Armitage

Banned

EagleKeeper

Discussion Club Moderator<br>Elite Member

CodeJockey

Member

TRENDING THREADS