My networks gone a bit stupid today. Last night I rebooted my exchange server and it didn't boot up right. I think it had something to do with a disk cleanup that was run which cleaned some files it didn't like (or I deleted some tmp files that shouldn't have). Anyway it wouldn't boot unless I removed the etherent cable (this is a rack server so it was a pain), and I managed to get it booted up and all the services working.
I went home happy(ish)
Today alot of the pupils can't log on. It hangs at loading personal settings and stays there. Now I've removed the GP and it loads straight up but I get a network error when it tries and writes the profile back to the server. So I thought it was a netowrk error. For some reason I thought it might be the exchange server, no it wasn't cos I removed that.
This seems to happen in rooms more than a few hops from the server which i would have hoped would have narrowed it down but it hasn't as I'm getting the error in different places. The weird thing is that 2 machines next to each other which have been cloned will load up at different speeds.
This is annoying the hell out of me cos I can't work out of its a hardware issue, or a software one.
I'm rebooting all the switches before I leave tonight hoping that it will be nice and smooth tommorow but if it isn't then can someone offer some advice.
I think I'll go home and see if I can get some more netowrk tools and see if there's anyting flying about that shouldn't.
Had a similar thing yesterday pm. Servers and clients hanging all over the network as if there was some massive broadcast storm going on... spent hours rebooting servers, switches, running AV and checking event logs with nothing untoward showing up - then it cleared and has been fine since with not a sniff of any related errors in logs anywhere....
makes ya wonder....
Ive had similar problems today as well. Im currently in the middle of migrating roles to new servers spreading the load by they are just running side by side at the moment where possible.
Basically one of the old servers is a DC/DNS and the file server. When this machine goes off it effects everything for some reason. Even down to freezing windows when its loaded!! I put this down to the redirected startmenus and default profile maybe which should be fine as its in the netlogon directory and theres a perfectly healthy (2 infact) DCs to serve the clients.
I changed DHCP so none of the clients are looking at that machine for DNS, I moved the Startmenus to a share on another machine and updated GP but the machines were still taking ages to log on!!!! It seems to me something is taking far too long to time out.
Anyway currently my file server is serving files in safe mode with networking :? while I try and figure out why it wont start properly.
Between that and wrestling a 42U rack into a room with a narrow coridoor I have had lots of fun today
Anyway I will step up my efforts and seize the roles from this server tommorow and probably reinstall server on it as a member server just serving files.
I just find it very strange in a so called multimaster server scenario everything goes tits up when that server isnt up. I mean I would expect a small timeout period and them to be able to log on with no start menu and not be connected to their home directory.
I know someone is going to scream its DNS but Im failing to see where when I have 2 other healthy DNS servers and that srver is no longer looked at for resolution.
We also had a similar problem today and yesterday, when a user logged in it was taking an age for the re-directed start menu to load. The problem seems to have corrected itself for now, i did have ethereal most of the day and didnt detect anything strange on the network.
Its all very strange. . . .
I rebooted everything, including the switches this morning at 7am and it looks okay now.
Very very weird.
Only thing that I noticed was unusual was in the event log when I rebooted the main server. A DCOM error to do with Sophos (again) which was running under the admin account which I fixed and the network card showed it turning on and off 4 times just before it loaded all the services. This has never happened before so I enabled the other network card int he server and switched that on too. No harm in running two. I don't fancy bridging them just yet though just in case its a faulty card.
Anyway, the funs over, back to resetting the kids passwords and finding work that they've 'lost'
Well ours went Tits up - big style - we lost our PDC completely, so had to seize all the roles to the BDC
Rebooted all the servers, then had problems with the kids PC no longer seeing the domain and unable to log in
Eventually rememebered that the PDC (deceased) had the DHCP scopes and DNS entries!! so had to recreate all them
Still couldn't get PC's to log in, eventually found that we had to unjoin them all from the domain, and re add them back again!
On top of that staff took this opportunity to fire stupid faults at us (ie SIMs not working, can you setup an email account, I;m out of Ink!! :twisted: )
I don't think it's Sophos, unless Moorside has changed its AV software. Does everyone who has posted to this site use server 2003 with XP Pro clients? Is there a common thread here? A new patch or update perhaps. All this within the space of just over 24 hours.
Pass me the meerschaum Watson, I think this is a 3 pipe problem.
XP Pro SP2, all latest updates through WSUS, Server 2003 SP1 and again all updates. Its freakey, but it didn't happen today I don't think, although I was in a meeting between 9:30 and 10:30 which is when it occured for me yesterday