The Way from Data to Information

Data Mining

Subscribe to Data Mining: eMailAlertsEmail Alerts newslettersWeekly Newsletters
Get Data Mining: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn


Top Stories

What Tomorrow's Business Leaders Need to Know About Machine Learning Sometimes I write a blog just to formulate and organize a point of view, and I think it’s time that I pull together the bounty of excellent information about Machine Learning. This is a topic with which business leaders must become comfortable, especially tomorrow’s business leaders (tip for my next semester University of San Francisco business students!). Machine learning is a key capability that will help organizations drive optimization and monetization opportunities, and there have been some recent developments that will place basic machine learning capabilities into the hands of the lines of business. By the way, there is an absolute wealth of freely-available material on machine learning, so I’ve included a sources section at the end of this blog for folks who want more details on machine lea... (more)

AI Is Not “Fake” Intelligence | @ExpoDX @Schmarzo #DX #ArtificialIntelligence

Quick quiz! What’s the first thing that comes to mind when you hear the following phrases? Artificial grass Artificial sweeteners Artificial flavors Artificial plants Artificial flowers Artificial diamonds and jewelry Artificial (fake) news These phrases probably evoke thoughts such as “fake,” “not real,” or even “shabby.” Artificial is such a harsh adjective. The word “artificial” is defined as “imitation; simulated; sham” with synonyms such as fake, false, mock, counterfeit, bogus, phony and factitious. The word “artificial” may not be the right term to use to describe “Artificial Intelligence,” because “artificial intelligence” is anything but fake, false, phony, or a sham. Maybe a better term is “Augmented Human Intelligence,” or a phrase that highlights both the importance of augmenting the human’s intelligence as well as to alleviate the fears that AI means ... (more)

Big Data Isn’t a Thing; Big Data is a State of Mind

“Big Data is dead.” “Big Data is passé.” “We no longer need Big Data; we need Machine Learning now.” As we end 2017 and look forward to big (data) things in 2018, the most important lessons of 2017 – in fact, maybe the most important lesson going forward – is that Big Data is NOT a thing. Big Data isn’t about the volume, variety or velocity of data any more than car racing is about the gasoline. Big Data is a state of mind. Big Data is about becoming more effective at leveraging data and analytics to power your business models (see Figure 1). Figure 1: Becoming More Effective at Leveraging Big Data to Power your Business   Big Data is a State of Mind Big Data is about improving an organization’s ability to leverage data and analytics to power their business models; to optimize key business and operational use cases; reduce security and compliance risk; to uncover n... (more)

In-Stream Processing | @CloudExpo @robinAKAroblimo #BigData #AI #BI #DX

Most of us have moved our web and e-commerce operations to the cloud, but we are still getting sales reports and other information we need to run our business long after the fact. We sell a hamburger on Tuesday, you might say, but don't know if we made money selling it until Friday. That's because we still rely on Batch processing, where we generate orders, reports, and other management-useful pieces of data when it's most convenient for the IT department to process them, rather than in real time. That was fine when horse-drawn wagons made our deliveries, but it is far too slow for today's world, where stock prices and other bits of information circle the world (literally) at the speed of light. It's time to move to In-Stream Processing. You can't - and shouldn't - keep putting it off. [Figure 1, courtesy of the Grid Dynamics Blog] This diagram may look complicate... (more)

Data Mining Taken to a New Level

By Marcus Williams Some hot topics we are tracking: Data Mining Taken to a New Level During a recent expo Raytheon, the 5th largest defense contractor, displayed how their Rapid Information Overlay Technology could collect data a on user.  RIOT was designed to search through well known social media sites such as Facebook, Twitter, and Foursquare to gather information that could be  linked to a person’s everyday activity by the hour. Read More Budget Cuts that target Data Centers Across the Nation The imitative to close 400 data centers by October is predicted to save 5 billion dollars by 2015.  Consolidating  government data centers and utilizing the cloud could decrease in cost and increase in productivity over the next two years.  Read More Department of Defense and the Intelligence Community Searching For A Similar Solution Both entities are constructing a compreh... (more)

Demystifying #DataScience | @CloudExpo #BigData #AI #ArtificialIntelligence

[Opening Scene]: Billy Dean is pacing the office. He’s struggling to keep his delivery trucks at full capacity and on the road. Random breakdowns, unexpected employee absences, and unscheduled truck maintenance are impacting bookings, revenues and ultimately customer satisfaction. He keeps hearing from his business customers how they are leveraging data science to improve their business operations. Billy Dean starts to wonder if data science can help him. As he contemplates what data science can do for him, he slowly drifts off to sleep, and visions of Data Science starts dancing in his head… [Poof! Suddenly Wizard Wei appears]: Hi, I’m your data science wizard to help alleviate your data science concerns. I don’t understand why folks try to make the data science discussion complicated. Let’s start simple with a simple definition of data science: Data science is a... (more)

The Real Time Infrastructure Ultimatum

Infrastructure 2.0 Journal For months the infrastructure 2.0 blog has talked about the automation of IT from a network perspective, including the automation of the network itself. While few may question the need for network automation most businesses today still run their networks like they ran their “supply chains” decades ago, before the network. This great irony is about to change. Here’s why: As virtualization entered the data center it became an accidental standard bearer for network automation. The power of virtualization helped to drive a cultural (including x as a service) shift in expectations, just as Nicholas Carr was declaring war on traditional “old world” IT with the help of Google, Amazon and a host of other cloud (and not so cloud) players. IT directors watched operations pros create VMs in seconds while network teams could take hours (or days) to si... (more)

US Government Saves $5.5B From Cloud

Watch CloudViews Unplugged Watch the latest edition of CloudViews Unplugged – a monthly video blog analyzing the top cloud news stories. Cloud News A recent study revealed that the government is saving around $5.5 billion a year since the shift to cloud, according to this CRN article. The study was created from interviews with 108 federal IT managers and CIOs and was published by MeriTalk Cloud Computing Exchange. The study also found that if they had been more aggressive in cloud adoption, the government could have saved $12 billion. The cloud storage wars are heating up with Google’s entry into the market. Google announced Google Drive last week and since then, competing services, such as Dropbox and Microsoft SkyDrive, have announced additional features to draw customers to their service, according to this eWeek article. VMware hypervisor source code was released... (more)

Revolution Analytics: R Language Features

R is an incredibly comprehensive statistics package. Even if you just look at the standard R distribution (the base and recommended packages), R can do pretty much everything you need for data manipulation, visualization, and statistical analysis. And for everything else, there's more than 5000 packages on CRAN and other repositories, and the big-data capabilities of Revolution R Enterprise. As a result, trying to make a list of everything R can do is a difficult task. But we've made an effort in this list of R Language Features, a new section on the Revolution Analytics website. It's broken up into four main sections (analytics, graphics and visualization, R applications and extensions, and programming language features), each with their own subsections: ANALYTICS Basic Mathematics Basic Statistics Probability Distributions Big Data Analytics * Machine Learning Opt... (more)

Application Performance Management Done Right

What is Application Performance Management (APM)? Like a lot of good questions, it depends on your business needs.  What is the goal of an ideal APM?  Does it mean 99.999% availability?  Perhaps it is a favorable overall end user experience when using the application but, as compared to what? My point is that Application Performance Management / Monitoring means different things to different businesses and it can even depend on the application involved. What is the Goal of APM “Begin with the goal in mind.” I wish I could take credit for that quote.  What is the goal of the APM? Have you listed out the objectives you hope to obtain from your APM strategy?  This approach will help your team ensure satisfaction with the final solution chosen.  Here are some examples. Minimum of 99.999% availability with lower Mean Time To Know (MTTK) and Mean Time To Repair (MTTR) Less ... (more)

The CTOvision Disruptive IT List: Firms we believe all enterprise technologists should be tracking

By BobGourley Disruptive IT List The CTOvision.com Disruptive IT List is our assessment of the technology firms with the greatest potential for virtuous disruption of enterprise IT. Our goal is to provide enterprise CTOs with advanced notice of firms they should be evaluating now for use in transforming their technology base. We believe the firms here meet a threshold of significance that warrants special attention.   The CTOvision.com Disruptive IT List includes:   10gen: Production support for MongoDB 10gen’s comprehensive range of services enable you to get the most out of commercial-grade deployments of MongoDB. 10gen develops MongoDB, and offers production support, … [Read More...] Actifio: Radically Simple Actifio solutions are deployed in physical, virtual or hybrid IT environments in enterprise IT organizationsacross all vertical markets and in managed or cl... (more)