+ Post New Thread
Page 1 of 2 12 LastLast
Results 1 to 15 of 16
Hardware Thread, Temperature Monitoring Software for Servers in Technical; Now I know all about the freebie ones (Motherboard Monitor, HDD Temperature), but does anyone know if there's anything out ...
  1. #1

    Join Date
    Oct 2005
    Location
    Anywhere but in a school ;o)
    Posts
    522
    Thank Post
    0
    Thanked 0 Times in 0 Posts
    Rep Power
    0

    Temperature Monitoring Software for Servers

    Now I know all about the freebie ones (Motherboard Monitor, HDD Temperature), but does anyone know if there's anything out there that's free which will monitor server temps - specifically hard drives and server ambient temps? The two I mention don't recognise our HP SCSIs.

    Reason I ask is that I'm thinking of putting a case together about HDD lifespans, money (potentially) wasted with the servers running too hot etc etc for the Governing Body to ask for efficient and correctly-sized air con for our server room (ex office).

  2. #2

    Geoff's Avatar
    Join Date
    Jun 2005
    Location
    Fylde, Lancs, UK.
    Posts
    11,803
    Thank Post
    110
    Thanked 583 Times in 504 Posts
    Blog Entries
    1
    Rep Power
    224

    Re: Temperature Monitoring Software for Servers

    Did you enable SMART support in the bios? IIRC monitoring software requires this to monitor the tempratures.

    Under Linux, you can use 'smartmontools' to monitor the hard drive(s). Here's a sample output from my desktop machine at home.

    Code:
    root@terror:/home/gking/iso# smartctl -a /dev/hdc
    smartctl version 5.34 [i686-pc-linux-gnu] Copyright (C) 2002-5 Bruce Allen
    Home page is http://smartmontools.sourceforge.net/
    
    === START OF INFORMATION SECTION ===
    Device Model:     SAMSUNG SP6003H
    Serial Number:    0441J1FTB05960
    Firmware Version: QV100-60
    User Capacity:    60,060,155,904 bytes
    Device is:        In smartctl database [for details use: -P show]
    ATA Version is:   6
    ATA Standard is:  ATA/ATAPI-6 T13 1410D revision 1
    Local Time is:    Mon Jul  3 16:20:25 2006 BST
    
    ==> WARNING: May need -F samsung or -F samsung2 enabled; see manual for details.
    
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled
    
    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED
    
    General SMART Values:
    Offline data collection status:  (0x00) Offline data collection activity
                                            was never started.
                                            Auto Offline Data Collection: Disabled.
    Self-test execution status:      (   0) The previous self-test routine completed
                                            without error or no self-test has ever
                                            been run.
    Total time to complete Offline
    data collection:                 (2700) seconds.
    Offline data collection
    capabilities:                    (0x1b) SMART execute Offline immediate.
                                            Auto Offline data collection on/off support.
                                            Suspend Offline collection upon new
                                            command.
                                            Offline surface scan supported.
                                            Self-test supported.
                                            No Conveyance Self-test supported.
                                            No Selective Self-test supported.
    SMART capabilities:            (0x0003) Saves SMART data before entering
                                            power-saving mode.
                                            Supports SMART auto save timer.
    Error logging capability:        (0x01) Error logging supported.
                                            No General Purpose Logging support.
    Short self-test routine
    recommended polling time:        (   1) minutes.
    Extended self-test routine
    recommended polling time:        (  45) minutes.
    
    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate     0x000b   100   100   051    Pre-fail  Always       -       3
      4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       48
      5 Reallocated_Sector_Ct   0x0033   253   253   010    Pre-fail  Always       -       0
      7 Seek_Error_Rate         0x000b   253   253   051    Pre-fail  Always       -       0
      8 Seek_Time_Performance   0x0024   253   253   000    Old_age   Offline      -       0
      9 Power_On_Hours          0x0032   097   097   000    Old_age   Always       -       2138573
     12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       35
    194 Temperature_Celsius     0x0022   112   112   000    Old_age   Always       -       42
    197 Current_Pending_Sector  0x0033   253   253   010    Pre-fail  Always       -       0
    198 Offline_Uncorrectable   0x0031   253   253   010    Pre-fail  Offline      -       0
    199 UDMA_CRC_Error_Count    0x000a   100   100   000    Old_age   Always       -       0
    200 Multi_Zone_Error_Rate   0x000b   100   100   051    Pre-fail  Always       -       0
    201 Soft_Read_Error_Rate    0x000b   100   100   051    Pre-fail  Always       -       1
    
    SMART Error Log Version: 1
    No Errors Logged
    
    SMART Self-test log structure revision number 1
    No self-tests have been logged.  [To run self-tests, use: smartctl -t]
    
    
    Device does not support Selective Self Tests/Logging
    You can see it's reporting its temprature in that last table as 42C.

  3. #3

    Ric_'s Avatar
    Join Date
    Jun 2005
    Location
    London
    Posts
    7,592
    Thank Post
    109
    Thanked 769 Times in 598 Posts
    Rep Power
    181

    Re: Temperature Monitoring Software for Servers

    HP servers tend to come with their own software for monitoring these things... I just can't remember the name of it at the moment.

  4. #4

    Join Date
    Jul 2005
    Location
    Corby
    Posts
    1,056
    Thank Post
    12
    Thanked 20 Times in 18 Posts
    Rep Power
    24

    Re: Temperature Monitoring Software for Servers

    I use Dell OpenManage for our PowerEdge servers, and our other servers use Intel management tools for server boards. You will usually find that if you have a branded server then it comes with some kind of vendor stamped management/monitoring suite.

  5. #5


    Join Date
    Dec 2005
    Location
    In the server room, with the lead pipe.
    Posts
    4,652
    Thank Post
    275
    Thanked 780 Times in 607 Posts
    Rep Power
    224

    Re: Temperature Monitoring Software for Servers

    Our ml350 will dump errors to event logs if disk temperatures exceed safe levels and you could pipe this to a syslog server using Snare.

  6. #6

    john's Avatar
    Join Date
    Sep 2005
    Location
    London
    Posts
    10,362
    Thank Post
    1,499
    Thanked 1,053 Times in 922 Posts
    Rep Power
    303

    Re: Temperature Monitoring Software for Servers

    I just use the HP Systems Management software that gets installed as part of Smart Start when I set them up, its good for alsorts of things for viewing on them.

  7. #7
    eean's Avatar
    Join Date
    May 2006
    Location
    Kuala Lumpur
    Posts
    566
    Thank Post
    68
    Thanked 54 Times in 38 Posts
    Rep Power
    30

    Re: Temperature Monitoring Software for Servers

    Just thinking 'out of the box' here: how about borrowing some data logging equipment from the science department and hooking up some temperature probes. Should get you your ambient temperature at least.
    (I went ultra low tech and just stuck a thermometer in there!)

  8. #8
    tosca925's Avatar
    Join Date
    Aug 2005
    Location
    Midlands
    Posts
    1,547
    Thank Post
    4
    Thanked 4 Times in 4 Posts
    Rep Power
    22

    Re: Temperature Monitoring Software for Servers

    At my old place we used somthing call HHD Life, we used it to monitor the temps of the hard drives, it sounds an alarm when it reaches a set temp.

    I think you have topay for this though, i dont think its a free download, been a long time since we used it but its good to convince SLT on AIR Con

  9. #9


    Join Date
    Jan 2006
    Posts
    8,202
    Thank Post
    442
    Thanked 1,032 Times in 812 Posts
    Rep Power
    339

    Re: Temperature Monitoring Software for Servers

    lm-sensors is also useful, although I've not got it to work consistently on different hardware.
    this machine (nvidia/athlon3200) with Ubuntu outputs this:
    Code:
    sensors
    w83627hf-isa-0290
    Adapter: ISA adapter
    VCore 1:   +1.60 V  (min =  +1.57 V, max =  +1.73 V)
    VCore 2:   +2.62 V  (min =  +1.57 V, max =  +1.73 V)       ALARM
    +3.3V:     +3.33 V  (min =  +3.14 V, max =  +3.47 V)
    +5V:       +4.95 V  (min =  +4.76 V, max =  +5.24 V)
    +12V:     +11.98 V  (min = +10.82 V, max = +13.19 V)
    -12V:     -12.28 V  (min = -13.18 V, max = -10.80 V)
    -5V:       -5.30 V  (min =  -5.25 V, max =  -4.75 V)       ALARM
    V5SB:      +5.51 V  (min =  +4.76 V, max =  +5.24 V)       ALARM
    VBat:      +3.50 V  (min =  +2.40 V, max =  +3.60 V)
    fan1:     3970 RPM  (min = 84375 RPM, div = 4)              ALARM
    fan2:     3096 RPM  (min = 7031 RPM, div = 4)              ALARM
    fan3:     1776 RPM  (min = 9926 RPM, div = 4)              ALARM
    temp1:       +36°C  (high =   +66°C, hyst =   +48°C)   sensor = thermistor
    temp2:     +48.0°C  (high =   +75°C, hyst =   +70°C)   sensor = thermistor           (beep)
    temp3:     -48.0°C  (high =   +75°C, hyst =   +70°C)   sensor = thermistor
    vid:      +1.650 V  (VRM Version 9.0)
    alarms:
    beep_enable:
              Sound alarm enabled
    My masterplan (that is of the lowest priority ever) is/was to get the output and hook it up to a serial infrared transmitter that could adjust the temperature of the Aircon depending upon how hot the computer was. Somehow I doubt I'll get around to doing it.

  10. #10

    mac_shinobi's Avatar
    Join Date
    Aug 2005
    Posts
    9,811
    Thank Post
    3,327
    Thanked 1,056 Times in 977 Posts
    Rep Power
    365

    Re: Temperature Monitoring Software for Servers

    Everest from lavalys is pretty good :

    http://www.lavalys.com/

    You can get the home edition which you can get a trial version for about 30 days or so. Other then that I agree with kingswood with regards to the tools / software that originally came with the server.

  11. #11


    Join Date
    Jan 2006
    Posts
    8,202
    Thank Post
    442
    Thanked 1,032 Times in 812 Posts
    Rep Power
    339

    Re: Temperature Monitoring Software for Servers

    software that originally came with the server.
    ...or the OS

  12. #12

    Join Date
    Jun 2006
    Posts
    21
    Thank Post
    0
    Thanked 0 Times in 0 Posts
    Rep Power
    0

    Re: Temperature Monitoring Software for Servers

    We use Everest - I agree with the above, it is good.

    Interesting fact, normal server drive is rated at 800k hours MTBF. This assumes a max. operating temp of 52deg. For every hour spent at 1deg over this limit, MTBF reduces by 20k hours.

    So a bit of money spent on air con/air flow can save you money on new drives and support time!!

  13. #13

    Join Date
    Oct 2005
    Location
    Anywhere but in a school ;o)
    Posts
    522
    Thank Post
    0
    Thanked 0 Times in 0 Posts
    Rep Power
    0

    Re: Temperature Monitoring Software for Servers

    Where'd you get that information from? I've been trying to find that officially on the web (I did know it, I don't know how I knew it though) so that I can quote it to SMT and prove my point!

  14. #14

    mac_shinobi's Avatar
    Join Date
    Aug 2005
    Posts
    9,811
    Thank Post
    3,327
    Thanked 1,056 Times in 977 Posts
    Rep Power
    365

    Re: Temperature Monitoring Software for Servers

    I didnt know about the stats with regards to the MTBF of scsi hard drives and such like , also with regards to my above post I was supposed to say that they have a home edition and an ultimate edition and that the home edition is free ( and that I wasnt sure if it would run on a server because of it being on a domain and all that ) As home edition detects if it is running on a domain or not and it refuses to run on a domain. Where as the ultimate edition will run but only for the trial period which I believe is 30 days.

  15. #15

    Geoff's Avatar
    Join Date
    Jun 2005
    Location
    Fylde, Lancs, UK.
    Posts
    11,803
    Thank Post
    110
    Thanked 583 Times in 504 Posts
    Blog Entries
    1
    Rep Power
    224

    Re: Temperature Monitoring Software for Servers

    My masterplan (that is of the lowest priority ever) is/was to get the output and hook it up to a serial infrared transmitter that could adjust the temperature of the Aircon depending upon how hot the computer was. Somehow I doubt I'll get around to doing it.
    Well, you could install net-snmp and use cacti to at least monitor the tempratures.

SHARE:
+ Post New Thread
Page 1 of 2 12 LastLast

Similar Threads

  1. Replies: 10
    Last Post: 11th December 2007, 02:25 PM
  2. Server room temperature
    By netadmin in forum General Chat
    Replies: 30
    Last Post: 11th June 2007, 07:35 AM
  3. UPS Monitoring software
    By Dos_Box in forum Downloads
    Replies: 5
    Last Post: 12th October 2006, 10:41 AM
  4. Temperature /Air Con / Health and Safety
    By CyberNerd in forum Hardware
    Replies: 18
    Last Post: 30th July 2006, 12:00 PM
  5. Web traffic monitoring software.
    By Dos_Box in forum Wireless Networks
    Replies: 1
    Last Post: 5th July 2005, 09:33 AM

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •