The Way from Data to Information

Data Mining

Subscribe to Data Mining: eMailAlertsEmail Alerts newslettersWeekly Newsletters
Get Data Mining: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn


Top Stories

This term Data Unification is new in the Big Data lexicon, pushed by varieties of companies such as Talend, 1010Data, and TamR. Data unification deals with the domain known as ETL (Extraction, Transformation, Loading), initiated during the 1990s when Data Warehousing was gaining relevance. ETL refers to the process of extracting data from inside or outside sources (multiple applications typically developed and supported by different vendors or hosted on separate hardware), transform it to fit operational needs (based on business rules), and load it into end target databases, more specifically, an operational data store, data mart, or a data warehouse. These are read-only databases for analytics. Initially the analytics was mostly retroactive (e.g. how many shoppers between age 25-35 bought this item between May and July?). This was like driving a car looking at the ... (more)

Demystifying #DataScience | @CloudExpo #BigData #AI #ArtificialIntelligence

[Opening Scene]: Billy Dean is pacing the office. He’s struggling to keep his delivery trucks at full capacity and on the road. Random breakdowns, unexpected employee absences, and unscheduled truck maintenance are impacting bookings, revenues and ultimately customer satisfaction. He keeps hearing from his business customers how they are leveraging data science to improve their business operations. Billy Dean starts to wonder if data science can help him. As he contemplates what data science can do for him, he slowly drifts off to sleep, and visions of Data Science starts dancing in his head… [Poof! Suddenly Wizard Wei appears]: Hi, I’m your data science wizard to help alleviate your data science concerns. I don’t understand why folks try to make the data science discussion complicated. Let’s start simple with a simple definition of data science: Data science is a... (more)

The Future Is Intelligent Apps | @ThingsExpo #IoT #M2M #BigData #Analytics

I have seen the future! Of course, I seem to say that every other month (maybe that’s because the future keeps changing?), but this is a good one. The future is a collision between big data (and data science) and application development that will yield a world of “intelligent apps.” These “intelligent apps” combine customer, product and operational insights (uncovered with predictive and prescriptive analytics) with modern application development tools and user-centric design to create a more compelling, more prescriptive user experience. These intelligent apps not only know how to support or enable key user decisions, but they continually learn from the user interactions to become even more relevant and valuable to those users. Several developments and posts by industry leaders over the past few weeks have started to add some substance to this intelligent apps tre... (more)

A Hybrid Data Pipeline | @CloudExpo @ProgressSW #BigData #AI #DataLake

Building a Hybrid Data Pipeline for Salesforce and Hadoop My team embarked on building a data lake for our sales and marketing data to better understand customer journeys. This required building a hybrid data pipeline to connect our cloud CRM with the new Hadoop Data Lake. One challenge is that IT was not in a position to provide support until we proved value and marketing did not have the experience, so we embarked on the journey ourselves within the product marketing team for our line of business within Progress. In his session at @BigDataExpo, Sumit Sarkar, Product Marketing Engineer at Progress, will discuss how the key to delivering on this was using standard interfaces using a bi-directional data pipeline to connect the systems. On the Salesforce side, we were able to get frictionless access to the data lake using clicks-not-code via OData. On the Hadoop side,... (more)

Citizen Data Scientist, Jumbo Shrimp | @CloudExpo @Schmarzo #BigData

Citizen Data Scientist, Jumbo Shrimp, and Other Descriptions That Make No Sense Okay, let me get this out there: I find the term “Citizen Data Scientist” confusing. Gartner defines a “citizen data scientist as “a person who creates or generates models that leverage predictive or prescriptive analytics but whose primary job function is outside of the field of statistics and analytics.” While we teach business users to “think like a data scientist” in their ability to identify those variables and metrics that might be better predictors of performance, I do not expect that the business stakeholders are going to be able to create and generate analytic models. I do not believe, nor do I expect, that the business stakeholders are going to be proficient enough with tools like SAS or R or Python or Mahout or MADlib to 1) create or generate the models, and then 2) be profi... (more)

$1B Opportunity: #DarkData | @CloudExpo #BigData #BI #AI #ML #DataScience

Every organization collects, stores and retains portions of dark data. It's the digital equivalent of emotional baggage which hangs around after every user interaction, transaction, and customer engagement. In fact, not using data effectively is costing United Airlines almost $1 Billion annually in lost revenue. Gartner Inc. describes dark data as "information assets that organizations collect, process and store in the course of their regular business activity, but fail to use for other purposes." For travel companies with a strong online presence, dark data represents a sizable portion of all data stored. Such examples might include: How many times a user resets their password IP address when a user logs into your website/app Last email communication date to your customers Mobile handset type, or web browser version Free text feedback on a hotel stay or recent flig... (more)

Putting Things to Work in the "Internet of Things"

Connected cars, factory equipment and household products communicating over the Internet is increasingly becoming a reality – one that might soon elicit headlines like “Is the Internet of Things a big bust?” That’s because it’s one thing to connect a device to the Internet and direct data back to the manufacturer or service provider. It’s another, to derive new information from those data streams. The ability to analyze data in the IoT is critical to designing better products, predicting maintenance issues, and even improving quality of life. Understanding the Internet of Things The Internet is no longer just a network of people using computers and smart devices to communicate with each other. In the not too distant future, everything from the factory floor to a city street will be connected to the Internet. Three out of four global business leaders are exploring the... (more)

Keys to the Kingdom

According to various history sites, the earliest known lock to be key operated was from Egypt, some 4000 years ago.  It was wooden and actually used moveable pegs that fell into holes to secure the ‘bolt.’  The wooden key would move the pins back into place to allow the lock to be opened.  And, of course, Caesar is credited with inventing the first cipher.  Ahh, love history and always fun to know where some of today’s technologies came from. In security, specifically cryptography, a key is a specific number value that when used with an algorithm can encrypt and decrypt a block of data – usually text.  The key length or size, typically in bits or bytes, determines how strong the encryption is and thus how difficult it might be to decrypt. There is Symmetric and Asymmetric encryption.  With symmetric encryption, only one ‘secret key’ is used on both ends to encrypt... (more)

Lori MacVittie Interview at Cloud Connect

I got a chance to sit down with another member of the Technical Marketing Team at F5, Lori MacVittie at the Cloud Connect conference in Santa Clara this week.  We chat about Web 2.0, Infrastructure 2.0, dynamic networks, cloud interoperability standards, what 3.0 looks like and a few other things.  Thanks Lori! ... (more)

Big Data Business Model Maturity Index and IoT | @ThingsExpo #BigData #IoT #M2M #API #Wearables

Big Data Business Model Maturity Index and the Internet of Things (IoT) Antonio Figueiredo (@afigueiredo) recently challenged me on twitter with an interesting question: How would the Big Data Business Model Maturity Index (BDBMMI) change to support the Internet of Things (IoT)? My hope is that the BDBMMI would not need to change to support IoT. It is my hope that the BDBMMI could be used to guide any industry that is going through a data and analytics-driven transformation, such as what is happening to many industries due to IoT. Let’s see how one could use the BDBMMI to help organizations to exploit the IoT. But before we start that exercise, let’s start with some key definitions: The Big Data Business Model Maturity Index (BDBMMI) is a framework to measure how effective an organization is at leveraging data and analytics to power the business (see Figure 1). We ... (more)

Cogensia Continues to Expand with the Announcement of 8 New Hires for Customer-Centric Marketing

CHICAGO, Aug. 21, 2014 /PRNewswire/ -- In order to continue to deliver on the promise of customer-centric marketing, Cogensia welcomes 8 new hires to its staff. "Marketing services - in particular, data mining, data sciences, and the management of big data - require growing technical expertise, and we are pleased to continue our leadership in the industry with this expansion," said Cogensia President and CEO, Brad Rukstales. Mindi Barber joins Cogensia as an Account Manager. Before joining Cogensia, Mindi worked on Social CRM campaigns for Hyundai Motor America, and also spent 4 years at Harte-Hanks. Ina Rajewski joins Cogensia as an Account Coordinator. Before joining Cogensia, Ina spent 5 years at Experian as an Account Manager and Marketing Services Account Director. Thomas Schar joins Cogensia as Director, Information Technology with experience in back-end pro... (more)