Brovane
Diamond Member
We have a really weird issue with a 4507R at a remote site yesterday. One of the LAN team members is in the switch making changes to a port and we get a syslog message from the supervisor engine that it has lost handshaking with the standby sup engine. The redundant engine refuses to come backup remotely. Talk to the boss and we decided that we will remove and re-seat the standby sup engine early next morning during SLA window.
I go home and then my boss calls me at home because about 4-hours later the switch looks like it goes down and the Junior Admin is freaking out. He is driving to the site with a spare parts to get the switch backup. I look at my BB and judging by the alerts I don't think the switch is down. I go into our networking monitoring gear and the switch stopped responding to pings on its main IP and cannot access through SSH but about 80% of the equipment connected into the switch is up, working fine and responding to pings. What the hell.
So I dig a little deeper and find that I can ping the one of the local networks setup on the switch but I still cannot ping the main IP on VLAN 1. I start looking over the config and I see a bad default gateway. The IP route for 0.0.0.0 0.0.0.0 was set correctly but the default gateway was not correct. However this default gateway has been in the config for at least 3-months so I am not even sure if this caused a problem. When the Jr Admin arrived onsite before he started ripping gear out I had him remove both sup engines and then bring them up one at a time and the switch came up fine.
I went in and corrected the default gateway. Opened a TAC case and Cisco cannot figure out what happened either. Has anybody else seen this before that a bad default gateway works fine and then just stops working? Just kind of a weird issue.
I go home and then my boss calls me at home because about 4-hours later the switch looks like it goes down and the Junior Admin is freaking out. He is driving to the site with a spare parts to get the switch backup. I look at my BB and judging by the alerts I don't think the switch is down. I go into our networking monitoring gear and the switch stopped responding to pings on its main IP and cannot access through SSH but about 80% of the equipment connected into the switch is up, working fine and responding to pings. What the hell.
So I dig a little deeper and find that I can ping the one of the local networks setup on the switch but I still cannot ping the main IP on VLAN 1. I start looking over the config and I see a bad default gateway. The IP route for 0.0.0.0 0.0.0.0 was set correctly but the default gateway was not correct. However this default gateway has been in the config for at least 3-months so I am not even sure if this caused a problem. When the Jr Admin arrived onsite before he started ripping gear out I had him remove both sup engines and then bring them up one at a time and the switch came up fine.
I went in and corrected the default gateway. Opened a TAC case and Cisco cannot figure out what happened either. Has anybody else seen this before that a bad default gateway works fine and then just stops working? Just kind of a weird issue.