Skip to main content


Swipe to navigate through the articles of this issue

01-12-2014 | Research | Issue 1/2014 Open Access

Journal of Big Data 1/2014

A big data methodology for categorising technical support requests using Hadoop and Mahout

Journal of Big Data > Issue 1/2014
Arantxa Duque Barrachina, Aisling O’Driscoll
Important notes

Electronic supplementary material

The online version of this article (doi:10.​1186/​2196-1115-1-1) contains supplementary material, which is available to authorized users.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

ADB designed the study, developed the methodology, implemented the system, performed the analysis and evaluation and wrote the manuscript. AOD supervised the project. Both authors read and approved the final manuscript.


Technical Support call centres frequently receive several thousand customer queries on a daily basis. Traditionally, such organisations discard data related to customer enquiries within a relatively short period of time due to limited storage capacity. However, in recent years, the value of retaining and analysing this information has become clear, enabling call centres to identify customer patterns, improve first call resolution and maximise daily closure rates. This paper proposes a Proof of Concept (PoC) end to end solution that utilises the Hadoop programming model, extended ecosystem and the Mahout Big Data Analytics library for categorising similar support calls for large technical support data sets. The proposed solution is evaluated on a VMware technical support dataset.
Supplementary Material
About this article

Other articles of this Issue 1/2014

Journal of Big Data 1/2014 Go to the issue

Premium Partner

    Image Credits