skip to main content
10.1145/2837060.2837076acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbigdasConference Proceedingsconference-collections
research-article

Big Data Mining Applications and Services

Authors Info & Claims
Published:20 October 2015Publication History

ABSTRACT

Data mining and analytics aims to analyze valuable data and extract implicit, previously unknown, and potentially useful information from the data. Due to advances in technology, high volumes of valuable data are generated at a high velocity in high varieties of data sources in various real-life business, scientific and engineering applications. Due to their high volumes, the quality and accuracy of these data depend on their veracity (uncertainty of data). This leads us into the new era of Big Data. This paper presents some works on big data mining and computing, especially on an important task of frequent pattern mining, which computes and mines from big data for interesting knowledge in the forms of frequently occurring sets of merchandise items in shopping markets, interesting co-located events, and/or popular individuals in social networks. The paper also shows how big data mining contributes to real-life applications and services.

References

  1. R. Agrawal, T. Imieliński & A. Swami A. Mining association rules between sets of items in large databases. In Proc. ACM SIGMOD 1993, pp. 207--216. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Agrawal & J.C. Shafer. Parallel mining of association rules. IEEE TKDE, 8(6): 962--969 (1996) Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Agrawal & R. Srikant. Fast algorithms for mining association rules in large databases. In Proc. VLDB 1994, pp. 487--499. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Ceglar & J.F. Roddick. Association mining. ACM Computing Surveys, 38(2): art. 5 (2006) Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Dean & S. Ghemawat, S. MapReduce: a flexible data processing tool. CACM, 53(1): 72--77 (2010) Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Han, J. Pei & Y. Yin. Mining frequent patterns without candidate generation. In Proc. ACM SIGMOD 2000, pp. 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Hipp, U. Güntzer, & G. Nakhaeizadeh. Algorithms for association rule mining -- a general survey and comparison. ACM SIGKDD Explorations, 2(1): 58--64 (2000) Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Jiang, K. Kawagoe & C.K. Leung. Big social network mining for "following" patterns. In Proc. C3S2E 2015, pp. 28--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. F. Jiang & C.K. Leung. A business intelligence solution for frequent pattern mining on social networks. In Proc. IEEE ICDM Workshops 2014, pp. 789--796.Google ScholarGoogle ScholarCross RefCross Ref
  10. F. Jiang & C.K. Leung. A data analytic algorithm for managing, querying, and processing uncertain big data in cloud environments. Algorithms, 8(4): 1175--1194 (2015)Google ScholarGoogle ScholarCross RefCross Ref
  11. F. Jiang, C.K. Leung & R.K. MacKinnon. BigSAM: mining interesting patterns from probabilistic databases of uncertain big data. In Proc. PAKDD Workshops 2014, pp. 780--792.Google ScholarGoogle ScholarCross RefCross Ref
  12. Y. Kim, K. Shim, M.-S. Kim & J.S. Lee. DBCURE-MR: an efficient density-based clustering algorithm for large data using MapReduce. Information. Systems, 42: 15--35 (2014) Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Lee, S.-T. Hong, H.J. Kim & J.-W. Chang. A travel time prediction algorithm using rule-based classification on MapReduce. In Proc. DEXA 2015, Part II, pp. 440--452.Google ScholarGoogle ScholarCross RefCross Ref
  14. C.K. Leung. Constraint-based association rule mining. In Encyclopedia of data warehousing and mining, 2nd ed., pp. 307--312 (2009)Google ScholarGoogle Scholar
  15. C.K. Leung. Mining frequent itemsets from probabilistic datasets. In Proc. EDB 2013, pp. 137--148.Google ScholarGoogle Scholar
  16. C.K. Leung. Uncertain frequent pattern mining. In C.C. Aggarwal & J. Han (eds.), Frequent pattern mining, pp. 339--367 (2014)Google ScholarGoogle Scholar
  17. C.K. Leung. A. Cuzzocrea & F. Jiang. Discovering frequent patterns from uncertain data streams with time-fading and landmark models. Transactions on Large-Scale Data- and Knowledge-Centered Systems, 8: 174--196 (2013)Google ScholarGoogle Scholar
  18. C.K. Leung & Y. Hayduk. Mining frequent patterns from uncertain data with MapReduce for big data analytics. In Proc. DASFAA 2013, Part I, pp. 440--455.Google ScholarGoogle ScholarCross RefCross Ref
  19. C.K. Leung & F. Jiang. A data science solution for mining interesting patterns from uncertain big data. In Proc. IEEE BDCloud 2014, pp. 235--242. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C.K. Leung, M.A.F. Mateo & D.A. Brajczuk. A tree-based approach for frequent pattern mining from uncertain data. In Proc. PAKDD 2008, pp. 653--661. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C.K. Leung, R.K. MacKinnon & S.K. Tanbeer. Fast algorithms for frequent itemset mining from uncertain data. In Proc. IEEE ICDM 2014, pp. 893--898. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C.K. Leung, I.J.M. Medina & S.K. Tanbeer. Analyzing social networks to mine important friends. In G. Xu & L. Li (eds.), Social media mining and social network analysis: Emerging research, pp. 90--104 (2013)Google ScholarGoogle Scholar
  23. H. Li, Y. Wang, D. Zhang, M. Zhang & E.Y. Chang. PFP: parallel FP-growth for query recommendation. In Proc. ACM RecSys 2008, pp. 107--114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. C. Liao & A.C. Squicciarini. Towards provenance-based anomaly detection in MapReduce. In Proc. IEEE/ACM CCGrid 2015, pp. 647--656.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Lin & C. Dyer. Data-intensive text processing with MapReduce (2010) Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M.-Y. Lin, P.-Y. Lee & S.-C. Hsueh. Apriori-based frequent itemset mining algorithms on MapReduce. In Proc. ICUIMC 2012, art. 76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. S. Madden. From databases to big data. IEEE Internet Computing, 16(3): 4--6 (2012) Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Pei, J. Han, H. Lu, S. Nishio, S. Tang & D. Yang. H-Mine: hyper-structure mining of frequent patterns in large databases. In Proc. IEEE ICDM 2001, pp. 441--448. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. A. Rajaraman & J.D. Ullman. Mining of massive datasets (2011) Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. A. Savasere, E. Omiecinski & S. Navathe. An efficient algorithm for mining association rules in large databases. In Proc. VLDB 1995, pp. 432--444. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. P. Shenoy, J.R. Bhalotia, M. Bawa & D. Shah. Turbo-charging vertical mining of large databases. In Proc. ACM SIGMOD 2000, pp. 22--33. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. K. Shim. MapReduce algorithms for big data analysis. PVLDB, 5(12): 2016--2017 (2012) Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Y. Tong, L. Chen, Y. Cheng & P.S. Yu. Mining frequent itemsets over uncertain databases. PVLDB, 5(11): 1650--1661 (2012) Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. J.D. Ullman. A survey of association-rule mining. In Proc. DS 2000, pp. 1--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. K. Wang, L. Tang, J. Han & J. Liu. Top down FP-growth for association rule mining. In Proc. PAKDD 2002, pp 334--340. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. M.J. Zaki. Parallel and distributed association mining: a survey. IEEE Concurrency, 7(4): 14--25 (1999) Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. M.J. Zaki. Scalable algorithms for association mining. IEEE TKDE, 12(3): 372--390. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Big Data Mining Applications and Services

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      BigDAS '15: Proceedings of the 2015 International Conference on Big Data Applications and Services
      October 2015
      321 pages
      ISBN:9781450338462
      DOI:10.1145/2837060

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 October 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader