Skip to main content
Top

2023 | OriginalPaper | Chapter

Sparse Transformer Hawkes Process for Long Event Sequences

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Large quantities of asynchronous event sequence data such as crime records, emergence call logs, and financial transactions are becoming increasingly available from various fields. These event sequences often exhibit both long-term and short-term temporal dependencies. Variations of neural network based temporal point processes have been widely used for modeling such asynchronous event sequences. However, many current architectures including attention based point processes struggle with long event sequences due to computational inefficiency. To tackle the challenge, we propose an efficient sparse transformer Hawkes process (STHP), which has two components. For the first component, a transformer with a novel temporal sparse self-attention mechanism is applied to event sequences with arbitrary intervals, mainly focusing on short-term dependencies. For the second component, a transformer is applied to the time series of aggregated event counts, primarily targeting the extraction of long-term periodic dependencies. Both components complement each other and are fused together to model the conditional intensity function of a point process for future event forecasting. Experiments on real-world datasets show that the proposed STHP outperforms baselines and achieves significant improvement in computational efficiency without sacrificing prediction performance for long sequences.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bacry, E., Mastromatteo, I., Muzy, J.F.: Hawkes processes in finance. Market Microstruct. Liquidity 1(01), 1550005 (2015)CrossRef Bacry, E., Mastromatteo, I., Muzy, J.F.: Hawkes processes in finance. Market Microstruct. Liquidity 1(01), 1550005 (2015)CrossRef
2.
go back to reference Bai, T., et al.: CTRec: a long-short demands evolution model for continuous-time recommendation. In: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 675–684 (2019) Bai, T., et al.: CTRec: a long-short demands evolution model for continuous-time recommendation. In: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 675–684 (2019)
4.
go back to reference Child, R., Gray, S., Radford, A., Sutskever, I.: Generating long sequences with sparse transformers. arXiv preprint arXiv:1904.10509 (2019) Child, R., Gray, S., Radford, A., Sutskever, I.: Generating long sequences with sparse transformers. arXiv preprint arXiv:​1904.​10509 (2019)
5.
go back to reference Deshpande, P., Marathe, K., De, A., Sarawagi, S.: Long horizon forecasting with temporal point processes. In: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 571–579 (2021) Deshpande, P., Marathe, K., De, A., Sarawagi, S.: Long horizon forecasting with temporal point processes. In: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 571–579 (2021)
6.
go back to reference Du, N., Dai, H., Trivedi, R., Upadhyay, U., Rodriguez, M., Song, L.: Recurrent marked temporal point process. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 447–456 (2016) Du, N., Dai, H., Trivedi, R., Upadhyay, U., Rodriguez, M., Song, L.: Recurrent marked temporal point process. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 447–456 (2016)
7.
go back to reference Du, N., Dai, H., Trivedi, R., Upadhyay, U., Gomez-Rodriguez, M., Song, L.: Recurrent marked temporal point processes: embedding event history to vector. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 1555–1564 (2016) Du, N., Dai, H., Trivedi, R., Upadhyay, U., Gomez-Rodriguez, M., Song, L.: Recurrent marked temporal point processes: embedding event history to vector. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 1555–1564 (2016)
8.
go back to reference Farajtabar, M., Du, N., Rodriguez, M.G., Valera, I., Zha, H., Song, L.: Shaping social activity by incentivizing users. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS), pp. 2474–2482 (2014) Farajtabar, M., Du, N., Rodriguez, M.G., Valera, I., Zha, H., Song, L.: Shaping social activity by incentivizing users. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS), pp. 2474–2482 (2014)
9.
go back to reference Farajtabar, M., Wang, Y., Rodriguez, M.G., Li, S., Zha, H., Song, L.: Coevolve: a joint point process model for information diffusion and network co-evolution. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS), pp. 1954–1962 (2015) Farajtabar, M., Wang, Y., Rodriguez, M.G., Li, S., Zha, H., Song, L.: Coevolve: a joint point process model for information diffusion and network co-evolution. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS), pp. 1954–1962 (2015)
12.
go back to reference Jaszczur, S., et al.: Sparse is enough in scaling transformers. Adv. Neural. Inf. Process. Syst. 34, 9895–9907 (2021) Jaszczur, S., et al.: Sparse is enough in scaling transformers. Adv. Neural. Inf. Process. Syst. 34, 9895–9907 (2021)
13.
go back to reference Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Proceedings International Conference on Learning Representations (ICLR) (2015) Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Proceedings International Conference on Learning Representations (ICLR) (2015)
14.
go back to reference Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012–10022 (2021) Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012–10022 (2021)
15.
go back to reference Mei, H., Eisner, J.M.: The neural Hawkes process: a neurally self-modulating multivariate point process. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS), pp. 6754–6764 (2017) Mei, H., Eisner, J.M.: The neural Hawkes process: a neurally self-modulating multivariate point process. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS), pp. 6754–6764 (2017)
16.
go back to reference Mohler, G., Porter, M.D., Carter, J., LaFree, G.: Learning to rank spatio-temporal event hotspots. In: Proceedings of the 7th International Workshop on Urban Computing (2018) Mohler, G., Porter, M.D., Carter, J., LaFree, G.: Learning to rank spatio-temporal event hotspots. In: Proceedings of the 7th International Workshop on Urban Computing (2018)
17.
go back to reference Mohler, G., Raje, R., Carter, J., Valasik, M., Brantingham, J.: A penalized likelihood method for balancing accuracy and fairness in predictive policing. In: Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 2454–2459 (2018) Mohler, G., Raje, R., Carter, J., Valasik, M., Brantingham, J.: A penalized likelihood method for balancing accuracy and fairness in predictive policing. In: Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 2454–2459 (2018)
18.
go back to reference Ross, S.M., et al.: Stochastic Processes, vol. 2. Wiley, New York (1996)MATH Ross, S.M., et al.: Stochastic Processes, vol. 2. Wiley, New York (1996)MATH
19.
go back to reference Shang, J., Sun, M.: Geometric Hawkes processes with graph convolutional recurrent neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 4878–4885 (2019) Shang, J., Sun, M.: Geometric Hawkes processes with graph convolutional recurrent neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 4878–4885 (2019)
20.
go back to reference Shelton, C.R., Qin, Z., Shetty, C.: Hawkes process inference with missing data. In: Proceedings of the AAAI Conference on Artificial Intelligence (2018) Shelton, C.R., Qin, Z., Shetty, C.: Hawkes process inference with missing data. In: Proceedings of the AAAI Conference on Artificial Intelligence (2018)
21.
go back to reference Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017) Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
22.
go back to reference Wang, L., Zhang, W., He, X., Zha, H.: Supervised reinforcement learning with recurrent neural network for dynamic treatment recommendation. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2447–2456 (2018) Wang, L., Zhang, W., He, X., Zha, H.: Supervised reinforcement learning with recurrent neural network for dynamic treatment recommendation. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2447–2456 (2018)
24.
go back to reference Xiao, S., Farajtabar, M., Ye, X., Yan, J., Song, L., Zha, H.: Wasserstein learning of deep generative point process models. In: Advances in Neural Information Processing Systems, vol. 30 (2017) Xiao, S., Farajtabar, M., Ye, X., Yan, J., Song, L., Zha, H.: Wasserstein learning of deep generative point process models. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
25.
go back to reference Xu, H., Farajtabar, M., Zha, H.: Learning granger causality for Hawkes processes. In: Proceedings of the International Conference on Machine Learning (ICML), pp. 1717–1726 (2016) Xu, H., Farajtabar, M., Zha, H.: Learning granger causality for Hawkes processes. In: Proceedings of the International Conference on Machine Learning (ICML), pp. 1717–1726 (2016)
26.
go back to reference Yan, X., Lin, L., Mitra, N.J., Lischinski, D., Cohen-Or, D., Huang, H.: ShapeFormer: transformer-based shape completion via sparse representation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6239–6249 (2022) Yan, X., Lin, L., Mitra, N.J., Lischinski, D., Cohen-Or, D., Huang, H.: ShapeFormer: transformer-based shape completion via sparse representation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6239–6249 (2022)
27.
go back to reference Zhang, Q., Lipani, A., Kirnap, O., Yilmaz, E.: Self-attentive Hawkes process. In: International Conference on Machine Learning, pp. 11183–11193. PMLR (2020) Zhang, Q., Lipani, A., Kirnap, O., Yilmaz, E.: Self-attentive Hawkes process. In: International Conference on Machine Learning, pp. 11183–11193. PMLR (2020)
28.
go back to reference Zhou, H., et al.: Informer: beyond efficient transformer for long sequence time-series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 11106–11115 (2021) Zhou, H., et al.: Informer: beyond efficient transformer for long sequence time-series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 11106–11115 (2021)
29.
go back to reference Zuo, S., Jiang, H., Li, Z., Zhao, T., Zha, H.: Transformer Hawkes process. In: International Conference on Machine Learning, pp. 11692–11702. PMLR (2020) Zuo, S., Jiang, H., Li, Z., Zhao, T., Zha, H.: Transformer Hawkes process. In: International Conference on Machine Learning, pp. 11692–11702. PMLR (2020)
Metadata
Title
Sparse Transformer Hawkes Process for Long Event Sequences
Authors
Zhuoqun Li
Mingxuan Sun
Copyright Year
2023
DOI
https://doi.org/10.1007/978-3-031-43424-2_11

Premium Partner