As opposed to Chris Anderson ’s viewpoint, theory is not quite dead yet… but science is indeed changing. Engineering and new innovations have always accelerated new science, but today this is happening faster than ever. Besides the fact that computers are increasingly used to store and process scientific data, instrumentation to measure more things, more accurately, and more often are rapidly developed.

The instruments provide masses of new data where storage capacity and processing capacity are amazingly capable of keeping up with the pace of data generation. Hence, computational capacity is not appearing to be a limiting factor for progress towards new scientific discoveries. Google Research Datasets is one new example. New data analyses and visualization tools are needed to handle the masses of the new multi-dimensional datasets from across disciplines, and that is where Peter Norvig and other machine learning campers appeared to have the most rational approach for an answer. Creative visualization solutions are also emerging, for example, a demo of trendalyzer , a way to visualize multi-dimensional data was given to those who went on the Googleplex tour.
Different sciences are at different stages when it comes to the imbalance (The Gap ) between data and theory. A heated discussing chaired by KK about the future of the scientific method generated many interesting ideas: Generalizing about too much data destroying theory might not be applicable in all cases. In physics, for example, theory is ahead. Most theories in physics cannot be tested experimentally. Experimental physics is going to receive a boost from the CERN collider , projected to produce petabytes of new data. In biology, on the other hand, instrumentation like fast DNA sequencing (of everything) are producing data more rapidly than we can fully computationally digest (at least right away). Hence, in biology, theory is lagging behind the data. In climate science, where theory has been traditionally ahead of the data, a shift towards the opposite is soon coming.

The tools used to analyze and visualize data across disciples seem to have commonalities. This should be exploited so efforts are not duplicated and progress can be achieved more rapidly. Peter Fox suggestion of X-informatics offers to build a community that will be developing a library of data analysis tools and resources that can be used across disciplines to narrow the divided between data and theory.
With all this optimism and excitement, and the observation that more things are discovered faster than ever before, time is of the essence. Global warming and the destruction of our own environment are of great concern. Our inability to increase the drug discovery rate and develop new and more successful treatments for diseases such as HIV and cancer (although great progress is being made) is another issue. Another concern was demonstrated through a 3D visualization of real-time Google searches around the world. It shows that Africa is left behind in this information age (the Information Divide). Cell phone technology that would deliver applications and access to everyone is a potential solution suggested by Joel Selanikio . This is in accordance with Google’s mission to provide computational resources for all.
Chris Anderson’s and Peter Norvig’s idea that theory will not be needed when everything will be measured is an interesting perspective. But in my opinion new theories will emerge when we will be able to fully understand those masses of new data. Not just through machine learning approaches or static network analyses but also by using advanced computer simulations and other tools that will track network dynamics. The theories that will emerge from looking at these massive and diverse datasets, collected from natural and man-made systems, are those complex systems design principles that emerge when you take a step back and look at common features shared among different systems. Speaking of man-made systems, hardware and robotics, as well as synthetic biology seems to be the next two frontiers. When we get to a point that we almost fully understand natural systems, we will always have to continually study the complex systems that we are creating.
I am lucky that my current boss and Ph.D. thesis mentor Ravi could not go and kindly suggested to the organizers to invite me. My first blog post was mainly a result of reflections from the first SciFoo in ’06, and just like the first time, this time around I learned a lot and had a great time.