Author Archives: Big Data Perspectives

About Big Data Perspectives

Erik Ottem has over 25 years of technology experience with IBM, Seagate, Gadzoox Networks and Agilent. Observations and comments about Big Data are presented for your review.

Bridging the Gap Between Structured and Unstructured Data

Relational databases like structured data, tables of columns and rows in a defined schema so everyone knows what to expect in every place.  Unstructured data like text, data or numeric values are a different story.  Hadoop certainly fills a need … Continue reading

Posted in Uncategorized | 1 Comment

The Problem with Machine Learning

Machine learning is widely perceived as getting its start with chess.  When the skills of the program exceeded the skills of the programmer, the logic went,  you’ve created machine learning.  The machine now has capabilities that the programmer didn’t.  Of … Continue reading

Posted in Uncategorized | Tagged | Leave a comment

Hadoop Summit 2013

The Hadoop Summit for 2013 has just concluded in San Jose.  There were a few themes that seemed to recur throughout the two-day summit with over 2,500 people.  The overall story is the continued progress to take Hadoop out of … Continue reading

Posted in Uncategorized | Leave a comment

Why Would I Use NoSQL?

In any job, it helps when you use the right tool for the job. In the Big Data universe there can be many different kinds of data. Structured data in tables. Text from email, tweets, facebook, or other sources. Log … Continue reading

Posted in Uncategorized | Tagged , , , , | Leave a comment

Managing a Flood of Data

Managing a Flood of Data With increasing connectedness of devices and people, the data just keeps coming. What to do with all that data is becoming an increasing problem, or opportunity if you have the right mindset. In general there … Continue reading

Posted in Uncategorized | Tagged , , , , , | Leave a comment

Information Extraction- Ready for Prime Time?

Oren Etzioini of the University of Washington held a talk at Adobe in March, and gave a rundown on the current state of the art in IE.  We’ll get to that in a minute, but what is IE?  Information Extraction … Continue reading

Posted in Uncategorized | Leave a comment

“Active Flash” for Big Data Analytics on SSD-based Systems

FAST13 USENIX Conference on File and Storage Technologies February 12–15, 2013 in San Jose, CA If you’re not familiar with the geekfest called USENIX and their file and storage technology conference, it is a very scholarly affair. Papers are submitted on … Continue reading

Posted in Uncategorized | Tagged , , , , , | Leave a comment