Home Technology Data Mining: For Gold

Data Mining: For Gold

Data Mining: For Gold
Fig. 8: Big Data being processed at Google’s data centres (Source: www.whatsthebigdata.com)

Law and order. In the last few years there has been a sharp increase in crime rate in India, a country that is as vast as it is diverse. The police have a very challenging role in terms of thinking more intelligently than the criminals and staying ahead of them. One key result area is higher investigative effectiveness and the need for user-friendly interactive data interfaces when looking for clues and verifying records, among other activities.

Fig. 6: Data mining—a useful tool for various types of complex financial analysis (Source: www.marketoracle.co.uk)
Fig. 6: Data mining—a useful tool for various types of complex financial analysis (Source: www.marketoracle.co.uk)
Fig. 7: UPS, the world’s largest courier company receives 39.5 million tracking requests per day (Source: www.mb.com.ph)
Fig. 7: UPS, the world’s largest courier company receives 39.5 million tracking requests per day (Source: www.mb.com.ph)

National Crime Record Bureau (NCRB) maintains a large crime database and uses crime data mining techniques such as clustering. In earlier times, extracting and exchanging information among police agencies was a very time-consuming process and often data was not available in the time of need.

The government of India made use of IT and implemented a government-to-government (G2G) model called crime criminal information system (CCIS). This system is designed to create computerised storage, analysis and retrieval of criminal records. For example, if a person has been robbed of his or her mobile phone, the police can use data mining tools to search the IMEI number on the basis of the mobile phone number and get it blocked.

Healthcare. Healthcare transactions generate a vast amount of data, and this too is multi-faceted and large to be processed and analysed by conventional means. Data mining can advance decision making by discovering patterns and trends in large amounts of complex data.

In healthcare, major application areas include evaluation of treatment effectiveness, management of healthcare, customer relationship management (CRM), and detection of fraud and abuse.

By comparing and contrasting causes, symptoms and courses of treatments, data mining can provide an analysis of the courses of action that can prove to be successful. For example, results of patient groups treated with various drug regimens for the same ailment can be compared to establish which treatment is most effective and offers higher value for money.

Financial applications. Data mining is a very useful tool in this sector, and forecasting stock markets, currency exchange rates, bank bankruptcies, understanding and managing financial risks, trading futures, credit ratings, loan management, bank customer profiling and money-laundering analyses are some core tasks.

Cross-selling refers to selling a range of related products to a single customer profitably, based on certain assumptions. These assumptions are based on understanding products that have synergy with customer profiles and their requirements.

Fig. 8: Big Data being processed at Google’s data centres (Source: www.whatsthebigdata.com)
Fig. 8: Big Data being processed at Google’s data centres (Source: www.whatsthebigdata.com)

Banks often attempt to cross-sell credit cards, pensions and life insurance policies. Standard Chartered Grindlays Bank faced severe competition from other banks due to poaching, similarity in products and services and other factors. Its business intelligence unit using data analysis and a test-and-learn culture was able to find out the likelihood of customers to take on a new product. For example, it knew which of its card members were more likely to take an auto loan, resulting in more focused marketing campaigns and reduced costs with improved customer satisfaction. As a result, the marketing department was empowered with information to increase cross-holding, target most valuable customers and also help in the next best product strategy for a customer.

It is expected that there will be a high growth of hybrid methods, that is, those that merge diverse models and deliver better performance vis-à-vis individuals in the area of data mining in finance.

In this integrative approach, individual models work like trained artificial experts. Hence, their combinations can be oriented in the same manner as consultation by human experts. Also, these artificial experts can be successfully pooled with human experts. In times to come, these artificial experts will be configured as autonomous intelligent software agents.

Sales and marketing applications. Data mining is used to arrive at a customer’s value, also known as lifetime value (LTV), which is a useful concept in measuring customer retention. Data mining can also be used to predict a customer’s likelihood to switch to competition. Probability scores for each customer can be calculated based on certain given inputs, for example, a churn score of 0.85 can be read as an 85 per cent chance of cancelling service.

While talking about data mining, we cannot ignore the popular buzzword of recent times—Big Data. Big Data slowly came to be differentiated from small data since it was not generated purely by a firm’s internal transaction systems. It was externally sourced as well, coming from the Internet, sensors of different kinds, public data initiatives such as the human genome project, and captures of audio and video recordings.

Imagine this, UPS, the world’s largest courier company, captures information on as many as 16.3 million packages, on average, that it delivers daily, and it receives 39.5 million tracking requests a day. The most recent source of Big Data at UPS is the use of telematics sensors in almost 50,000 company trucks that track parameters such as speed, direction and braking.

It is said that it is almost like there are two eras for organisations: Before Big Data, or BBD, and After Big Data, or ABD.

In BBD era, analysts spent much of their time preparing data for analysis and relatively lesser time on the more important part, that is, the analysis itself. Most business intelligence activity catered to only what had happened in the past and offered no enlightenments or predictions.

The ABD phase began around 2005 when we saw several dotcoms or Internet based and social network firms come up, mainly in the Silicon Valley. These firms such as Google and eBay started gathering and dissecting new type of data.

LinkedIn, for example, today offers several value-added data products such as people you may know, jobs you may be interested in and so on. This is all a result of data mining.

As the Internet penetration increases and customers become more demanding, greater will be the demand on more sophisticated infrastructure, that is, hardware, software and apps.

Also, there will be a need for sharp and smartdata scientists who can slice and dice the data in a number of ways.

We should not forget that companies like Google, LinkedIn, Facebook and Amazon became what they are today not by giving customers mere information but by giving them shortcuts to key decisions and actions that helped make their life better.