Data is growing exponentially everywhere - in business, web, finance,
government, science, and in the world of sensors and smart grids.
Speaking earlier this week at OSBC, Tim O'Reilly said "The future will be
all about who has most data, and who is able to extract meaning from it and
deliver it in real time". He noted that the IT industry is now in the
process of being reinvented around the idea of realtime analysis of "Big
Data" in the cloud, as a must-have adjunct to the much more limited kinds of
data processing and analytics that can be performed on desktop PCs or mobile
Today, every organization needs to become as expert at exploiting "Big Data"
as leading edge companies such as Google and Facebook have become. So how are
they going to get in the game? How is the ordinary business professional, for
example, going to handle the explosively growing volu... (more)
Two weeks ago I wrote about "The Need for Speed" in cloud computing, and
asked "Who is going to build the low-latency cloud for enterprise
customers?". Today Werner Vogels and his team at Amazon announced their
Cluster Compute Instances offering.
This is a very important step forward towards the kind of realtime, high
performance cloud that customers such as Cloudscale require to deliver the
next generation of cloud services. In our case, it means we now have three
distinct alternatives for deployment of our massively parallel realtime data
warehouse architecture: standard public... (more)
In big data computing, and more generally in all commercial highly parallel
software systems, speed matters more than just about anything else. The
reason is straightforward, and has been known for decades.
Put very simply, when it comes to massively parallel software of the kind
need to handle big data, fast is both better AND cheaper. Faster means lower
latency AND lower cost.
At first this may seem counterintuitive. A high-end sports car will be much
faster than a standard family sedan, but the family sedan may be much
cheaper. Cheaper to buy, and cheaper to run. But massively ... (more)
Cloudcel on Ulitzer
At the recent Hadoop World conference, Doug Cutting, Hadoop Project Founder,
remarked that "The Dream" was to provide non-programmers with the power of
parallel cloud computing tools such as MapReduce and Hadoop, via simple,
easy-to-use spreadsheet-like interfaces.
With Cloudcel, the non-programmers of the world (and the programmers too!)
can "live that dream" today.
Not only can you develop and launch massively parallel MapReduce/Hadoop-style
cloud computations, simply and seamlessly from within the standard Excel
interface, you can also go way beyond tools ... (more)
Cloudcel on Ulitzer
Back in 1985, the world was pre-web, data volumes were small, and no one was
grappling with information overload. Relational databases and the shiny new
SQL query language were just about perfect for this era. At work, 100% of the
data required by employees was internal business data, the data was highly
structured, and was organized in simple tables. Users would pull data from
the database when they realized they needed it.
Fast forward to 2010. Today, everyone is grappling constantly with
information overload, both in their work and in their social life. Most ... (more)