![]() | Register | FAQ | Members | Social Groups | User Map | Calendar | Search | Today's Posts | Mark Forums Read |
| Blue Skies Somewhere for EduGeekers to discuss those far away ideals of major (or maybe minor) changes that will have a large impact on schools, the jobs of technical staff and learning. |
| | | LinkBack | Thread Tools | Search Thread | Language |
| Sponsored Links |
| | #1 |
![]() Join Date: Oct 2007 Location: Auckland, New Zealand
Posts: 1,825
Thanks: 60
Thanked 261 Times in 237 Posts
Blog Entries: 2 Rep Power: 58 | I just read this article on how the shear volume of data now available to organizations like Google will drive a new wave of science. Instead of running countless experiments and attempting to formulate a model that explains them this new method simply analyzes the vast amount of data available and looks for correlation. This effectively turns the whole world into the lab and society itself is running some of the experiments by people simply existing in their day to day lives. I don't think it will replace the traditional science lab anytime soon but eventually every science class may need access to a supercomputing cluster just to keep up. |
| |
| | #2 |
![]() Join Date: Aug 2005 Location: London
Posts: 1,180
Thanks: 1
Thanked 123 Times in 108 Posts
Rep Power: 31 | It's a fascinating idea and I'm sure there's value in it but it does need care. So much of the "data" on the internet is garbage that any kind of aggregation and analysis needs to take great care that it weights stuff in a sensible way (just because a million web sites say drug X is not safe does not mean that we should go with the majority view, for example) |
| |
| | #3 |
![]() Join Date: Jul 2006 Location: South Yorkshire
Posts: 2,807
Thanks: 108
Thanked 109 Times in 81 Posts
Rep Power: 34 | Eh? Surely the whole point of proper scientific research is determining a relevant set of criteria, applying it to the data set (which you keep to a specific level) and ensuring the data set isn't polluted with erroneous rubbish that will skew it. By definition there's already a huge amount of data available by "just watching people go about their every day lives" that doesn't involve the net at all but it still isn't done that way because the data collection is a nightmare of contradictory rubbish and skewed cr*p. Any scientist with half a brain is going to see a search algorithm as skewing the results before you've even started or am I just being a layperson with attitude? |
| |
| | #4 | ||
![]() Join Date: Aug 2005 Location: Alton, Hampshire
Posts: 1,513
Thanks: 128
Thanked 99 Times in 94 Posts
Rep Power: 30 | Quote:
Quote:
-- David Hicks | ||
| |
| | #5 | |||
![]() Join Date: Oct 2007 Location: Auckland, New Zealand
Posts: 1,825
Thanks: 60
Thanked 261 Times in 237 Posts
Blog Entries: 2 Rep Power: 58 | Quote:
The point is that running the right types of algorithms on this kind of data will turn up things that are significant that we weren't specifically looking for. Effectively it shows us patterns in the data that can point out mechanisms that we are not aware of. You are right about the amount of rubbish data produced on the internet but there is rubbish data (interference) in any data source and more effective algorithms can be used to compensate but there may even be useful patterns in the interference that we are unable to see. An algorithm looking at a certain type of language may show up some unexpected results but these in themselves could be useful patterns for a different problems, SQL vulnerabilities etc. The other thing to take into consideration is that the absence of data on such a large data set can also be a strong indication of a pattern. Quote:
Quote:
| |||
| |
| | #6 | |
![]() Join Date: Aug 2005 Location: Alton, Hampshire
Posts: 1,513
Thanks: 128
Thanked 99 Times in 94 Posts
Rep Power: 30 | Quote:
-- David Hicks | |
| |
| |
| | ||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| AQA e-Science How Science Works Plug-In | NickJones | Educational Software | 0 | 10-10-2007 08:01 AM |
| Best server 2003 backup method? | starscream | How do you do....it? | 3 | 27-06-2007 02:26 PM |
| Scientific Notation in office 2007 | wesleyw | How do you do....it? | 7 | 12-06-2007 06:42 PM |
| What scripting method would you recommend | SimpleSi | Scripts | 24 | 15-11-2006 11:25 AM |
| Environment variables or other method | HodgeHi | Scripts | 3 | 21-06-2006 09:05 AM |
| Tags |
| cluster, google, science, statistics, triple blind |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | Search Thread |
|
|






