This problem is tormenting us for over a year now. Nobody can find a solution. I myself have tried many things. And we have an external IT service provider who manages our network and servers and three different employees have tried to solve the problem for hours and hours at considerable cost and it was all for nothing.
And this is what happens:
Applications often just stop working; the window fades out and the cursor turns into the rotating circle indicating that the computer is busy. Sometimes the computer returns to normal after a really long time (several minutes), but usually nothing can be done except killing the application. Often even that is impossible and the PC has to be rebooted. The application is usually Word or Excel, but it can also be Firefox or our email software Tobit David, so I don't think it's a MS Office thing. When the application hangs and you open "my computer", the contents usually do not load as if the network share is unavailable. The folder windows usually shows the green loading bar indefinitely.
This happens mostly when loading or saving, though occasionally it happens when you just open a folder or sort the contents of one or just look at it. But it ONLY happens when the user is working with files on our network share. It NEVER happens when the user has only files opened that are on local drives. The network share is located on the domain controller.
Now more details:
As I said the problem occurs when the user is working with files on the network share, but it does not happen for all users or for all of them at the same time.
Some of our PCs are not affected at all, they always work fine. These PCs are two older ones that run XP and Vista, respectively. But we also have a couple of DELL laptops running Win 7 or Vista and they also never hang even when the affected PCs are having the issue. So it's not a general problem with the network share.
The PCs that are affected have all been bought from our IT provider, but over a period of several years. They have different mainboards and NICs. All of them run Win 7, some 32-bit, some 64-bit. some of them have been with us for years and ran smoothly until about 14-15 months ago when the trouble started.
Sometimes several PCs have the problem at once or in short succession. Other times, one user can work a whole day without problems while another can hardly work at all because every time he tries to save his work the PC hangs. The next day it might be reversed. Some days nobody is having a problem at all. We even had periods of a whole week without the problem, and then it was back worse than ever. There were no regularities, like always the same colleague on vacation when it didn't happen or something like that.
So we have no way to reliably reproduce the error. Many times we were hoping to have found the cure because we did something and the problem did not occur for several days, but then it was back.
More details about the hardware and network:
During the long time we are already having this issue a lot about our hardware changed without any noticeable effect on the problem: Our network switch broke and we replaced it. We moved our whole business to another building, so all the cabling is brand new. I replaced several of the NICs in the PCs. We even have a new Lancom router. We also bought some new PCs; all of them are affected, but also several PCs we already had before.
The physical network connection seems to be perfectly fine. For testing we had a ping to the server running on the affected machines. Even when the PC started to hang, the ping continued smoothly with <1ms latency.
When the problem occurs, the server (domain controller) never shows anything unusual. It's not overloaded or unresponsive. Our IT provider checked the event reports and never found anything. For a while the server often ran out of RAM, but we recently upgraded it so it never goes above 40% memory used. We also upgraded the read/write cache which increased performance considerably in normal operation but did not eliminate the hang-ups.
Other short facts:
- the DC is a HP ProLiant running Windows Server 2003
- there's another backup DC (the old server) that runs Server 2000
- we have about a dozen workstations and about ten laptops
- McAfee is used for security
- Tobit David email server is running on the DC
- several MS SQL databases are also running on it (MSSQL 2008 Express)
- backup of the network share is done with Acronis
I'll greatly appreciate any suggestions. Even if it sounds improbable... we probably tried all the probable things already. Or maybe not.
And this is what happens:
Applications often just stop working; the window fades out and the cursor turns into the rotating circle indicating that the computer is busy. Sometimes the computer returns to normal after a really long time (several minutes), but usually nothing can be done except killing the application. Often even that is impossible and the PC has to be rebooted. The application is usually Word or Excel, but it can also be Firefox or our email software Tobit David, so I don't think it's a MS Office thing. When the application hangs and you open "my computer", the contents usually do not load as if the network share is unavailable. The folder windows usually shows the green loading bar indefinitely.
This happens mostly when loading or saving, though occasionally it happens when you just open a folder or sort the contents of one or just look at it. But it ONLY happens when the user is working with files on our network share. It NEVER happens when the user has only files opened that are on local drives. The network share is located on the domain controller.
Now more details:
As I said the problem occurs when the user is working with files on the network share, but it does not happen for all users or for all of them at the same time.
Some of our PCs are not affected at all, they always work fine. These PCs are two older ones that run XP and Vista, respectively. But we also have a couple of DELL laptops running Win 7 or Vista and they also never hang even when the affected PCs are having the issue. So it's not a general problem with the network share.
The PCs that are affected have all been bought from our IT provider, but over a period of several years. They have different mainboards and NICs. All of them run Win 7, some 32-bit, some 64-bit. some of them have been with us for years and ran smoothly until about 14-15 months ago when the trouble started.
Sometimes several PCs have the problem at once or in short succession. Other times, one user can work a whole day without problems while another can hardly work at all because every time he tries to save his work the PC hangs. The next day it might be reversed. Some days nobody is having a problem at all. We even had periods of a whole week without the problem, and then it was back worse than ever. There were no regularities, like always the same colleague on vacation when it didn't happen or something like that.
So we have no way to reliably reproduce the error. Many times we were hoping to have found the cure because we did something and the problem did not occur for several days, but then it was back.
More details about the hardware and network:
During the long time we are already having this issue a lot about our hardware changed without any noticeable effect on the problem: Our network switch broke and we replaced it. We moved our whole business to another building, so all the cabling is brand new. I replaced several of the NICs in the PCs. We even have a new Lancom router. We also bought some new PCs; all of them are affected, but also several PCs we already had before.
The physical network connection seems to be perfectly fine. For testing we had a ping to the server running on the affected machines. Even when the PC started to hang, the ping continued smoothly with <1ms latency.
When the problem occurs, the server (domain controller) never shows anything unusual. It's not overloaded or unresponsive. Our IT provider checked the event reports and never found anything. For a while the server often ran out of RAM, but we recently upgraded it so it never goes above 40% memory used. We also upgraded the read/write cache which increased performance considerably in normal operation but did not eliminate the hang-ups.
Other short facts:
- the DC is a HP ProLiant running Windows Server 2003
- there's another backup DC (the old server) that runs Server 2000
- we have about a dozen workstations and about ten laptops
- McAfee is used for security
- Tobit David email server is running on the DC
- several MS SQL databases are also running on it (MSSQL 2008 Express)
- backup of the network share is done with Acronis
I'll greatly appreciate any suggestions. Even if it sounds improbable... we probably tried all the probable things already. Or maybe not.