Cleversafe provides dispersed storage solutions that give infinite scale
and cost-effective data storage/protection/access. Apache Hadoop and the CDH4
distribution provides all the required software for implementing MapReduce
and the other chores associated with analysis over massive quantities of
data. What if these two capabilities could be combined in a smart, well
engineered way? The potential impact on analysis of Big Data capabilities
would be huge if this were to happen.
Cleversafe engineers have been working on this very thing and today are
announcing a dispersed storage solution can run Hadoop. This is way cool.
I had an opportunity to speak with Cleversafe’s VP of Product Strategy,
Russ Kennedy and their Director of Federal Solutions, Bobby Caudill about
this announcement. Here is what I took away from the conversation:
This new ability to combine computat... (more)
By the way, if Ellen Degeneres wants to respond in a reasonable and
constructive way to the lawsuits over her use of song snippets to dance to,
she could always start using Creative Commons-licensed music, with a nice
plug for the open-hearted musicians making our lives more tuney.
As a former enterprise CTO and current technology watcher, I was struck at
the incredible brilliance of yesterday’s announcement by Dell and
Cloudera. In a move that will help enterprises of all sizes serve a very wide
range of missions, those two organizations have announced a new relationship
that will enable something that has never really existed before. Enterprises
can now buy a Hadoop-centric solution that has been validated from end-to-end
and is available from a single vendor. And they can do this on systems that
have hardware designed for performing Big Data analytics.
Since their formation, Cloudera has been working to bring clarity and unity
and support to the entire stack of software around Apache Hadoop. A key
benefit of the Cloudera Distribution Including Apache Hadoop (CDH) is that it
makes enterprise support in production environments possible. Every ... (more)
I spent some time last week with several vendors and users of Hadoop, the
formless data repository that is the current favorite of many dot coms and
the darling of the data nerds. It was instructive. Moms and Dads, tell your
kids to start learning this technology now. The younger the better.
I still know relatively little about the Hadoop ecosystem, but it is a big
tent and getting bigger. To grok it, you have to cast aside several long-held
tech assumptions. First, that you know what you are looking for when you
build your databases: Hadoop encourages pack rats to store every log entry,
every Tweet, every Web transaction, and other Internet flotsam and jetsam.
The hope is that one day some user will come with a question that can’t be
answered in any way other than to comb through this morass. Who needs to
spend months on requirements documents and data dictionaries ... (more)
I saw a conversation today on Twitter that asked why we don’t just embed
proper security into Hadoop instead of suggesting the API gateway approach
to Hadoop security that my colleague Blake proposed. The same could be
asked about any number of applications and services, but the bottom line is
that we believe that a two-pronged approach is best.
Internally, we have dramatically improved Hadoop’s security capabilities
via Project Rhino. This enables best security practices like encryption at
rest, which cannot be implemented anywhere else. We are also working to
standardize the authorization framework and implement token based
authentication with single sign-on. These are all core capabilities that
absolutely need to be added to Hadoop’s code base.
The gateway approach addresses something else – the API layer. While I
agree that any application should protect ag... (more)
Social media has become an integral part of our PR campaigns at BLASTmedia,
especially Twitter. Clients often ask us, "Who is using Twitter and why
should it matter to us?" eMarketer recently published an article that answers
both of those questions, and many more. Want to know the demographics and
usage habits of the microblogging crowd? Read this article!
Here are a few highlights:
More than one-half of all Twitter users (53%) are women, and the majority are
75% of all Twitter activity comes from just 5% of users.
Not surprisingly, the more followers a user has, the more that person tweets.
Funambol, the leading provider of open source mobile cloud sync and push
email for billions of phones, today announced that its new version 8 software
is generally available. Funambol v8 makes it easy to keep the data and
content on billions of mobile phones in sync with social networks, email
systems and personal computers via the Internet cloud.
Funambol v8 has a completely redesigned AJAX web portal. The portal's
striking interface makes it simple for users to access mobile data and rich
media in the cloud, using any modern browser
"With the exploding popularity of smartphones, people are carrying more of
their lives on their phones than ever before," said Fabrizio Capobianco,
Funambol CEO. "People increasingly expect to access their data and content in
the cloud and other systems. The new Funambol Portal makes it super easy to
access mobile data and content, any... (more)
A friend of mine from my IBM days (an expert in Data Warehousing, BI, etc.)
told me about the Hadoop conference he attended in San Jose few weeks back.
When he attended the same conference two years ago in New York, there were
hardly 200 attendees whereas this time, the number exceeded 2000 and it was a
sold out event. This just proves how fast Hadoop has generated interest. He
said that one theme in every presentation was the need for Hadoop skills as
almost every presentation had a slide, “we are hiring”.
Hadoop offers a massively scalable data management and analysis environment
that can handle many different data types without the complicated
transformation and schema changes required to load diverse data into a
conventional RDBMS. Remember the days of ETL (Extraction, Transformation,
Loading) when data massaging and cleansing preceded the creation of the Data
CTOlabs.com, a subsidiary of the technology research, consulting and services
firm Crucial Point LLC and a peer site of CTOvision.com, has just published a
white paper providing context and use cases on Hadoop For Law Enforcement, an
important mission-focused domain ripe for the application of more Big Data
From the report:
Big Data, the data too large and complex for your current information
infrastructure to store and analyze, has changed every sector in government
and industry. Today’s sensors and devices produce an overwhelming amount of
information that is often unstructured, and solutions developed to handle Big
Data now allowing us to track more information and run more complex analytics
to gain a level of insight once thought impossible.
The dominant Big Data solution is the Apache Hadoop ecosystem which provides
an open source platform for reli... (more)
Shrinking your Url’s is all the rage nowadays. If you are on Twitter,
then odds are you have used one. Despite CodingHorror’s distaste for them
in his recent blog post on Url Shorteners: Destroying the Web since 2002,
they are a fact of life when we live in a world of 140 character status
So what’s a URL shrinking service anyway? Well, to put it simply, you
supply them with a URL, they then supply you with a shorter URL containing a
lookup “key”. When future requests are made to this shorter URL,
connections are routed to that services website where they convert the short
URL to the original URL and issue a HTTP Redirect back to your browser to
send you off to the original long url website.
So, what’s a guy, or gal, to do if they want to set their status
programmatically on Twitter, Facebook, FriendFeed, or the other gazillion
social networking sites ... (more)
"We believe that the presence of Cloudera KK will significantly propel Hadoop
deployments and its application in Big Data forward in Japan," said Mr.
Hiroshi Endo, Sr. VP Senior Executive Manager System Platforms Sector, NTT
DATA, as Cloudera today announced that it has established a Japanese
subsidiary, and an office in Japan.
To provide Hadoop enterprise support services in Japan, Cloudera KK is
working with NTT DATA and Nippon Steel Solutions Corporation.
"We are excited to work closely together to deliver superior support services
and solutions for enterprises, and also to promote open source software such
as Apache Hadoop-based solutions," Endo continued. "NTT DATA looks forward to
the global development of the Hadoop platform. By partnering with the
industry leader, Cloudera, we are able to address the growing demand of
enterprise customers for a more effectiv... (more)