Skip to main content
Erschienen in: Journal of Intelligent Information Systems 3/2020

20.06.2019

IncompFuse: a logical framework for historical information fusion with inaccurate data sources

verfasst von: Jiawei Xu, Vladimir Zadorozhny, John Grant

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We propose a novel framework, called IncompFuse, that significantly improves the accuracy of existing methods for reconstructing aggregated historical data from inaccurate historical reports. IncompFuse supports efficient data reliability assessment using the incompatibility probability of historical reports. We provide a systematic approach to define this probability based on properties of the data and relationships between the reports. Our experimental study demonstrates high utility of the proposed framework. In particular, we were able to detect noisy historical reports with very high detection accuracy.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Almutairi, F.M., Yang, F., Song, H.A., Faloutsos, C., Sidiropoulos, N., Zadorozhny, V. (2018). Homerun: scalable sparse-spectrum reconstruction of aggregated historical data. Journal Proceedings of the VLDB Endowment, 11(11), 1496–1508.CrossRef Almutairi, F.M., Yang, F., Song, H.A., Faloutsos, C., Sidiropoulos, N., Zadorozhny, V. (2018). Homerun: scalable sparse-spectrum reconstruction of aggregated historical data. Journal Proceedings of the VLDB Endowment, 11(11), 1496–1508.CrossRef
Zurück zum Zitat Askarizade, M., Nematbakhsh, M.A., Davoodi Jam, E. (2012). Data conflict resolution among same entities in web of data. In: 2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE) (pp. 278–282). Askarizade, M., Nematbakhsh, M.A., Davoodi Jam, E. (2012). Data conflict resolution among same entities in web of data. In: 2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE) (pp. 278–282).
Zurück zum Zitat Bohannon, P., Fan, W., Flaster, M., Rastogi, R. (2005). A cost-based model and effective heuristic for repairing constraints by value modification. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data (pp. 143–154). ACM. Bohannon, P., Fan, W., Flaster, M., Rastogi, R. (2005). A cost-based model and effective heuristic for repairing constraints by value modification. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data (pp. 143–154). ACM.
Zurück zum Zitat Dong, X.L., Berti-Equille, L., Srivastava, D. (2009). Integrating conflicting data: the role of source dependence. Journal Proceedings of the VLDB Endowment, 2 (1), 550–561.CrossRef Dong, X.L., Berti-Equille, L., Srivastava, D. (2009). Integrating conflicting data: the role of source dependence. Journal Proceedings of the VLDB Endowment, 2 (1), 550–561.CrossRef
Zurück zum Zitat Dong, X.L., & Naumann, F. (2009). Data fusion: resolving data conflicts for integration. Journal Proceedings of the VLDB Endowment, 2(2), 1654–1655.CrossRef Dong, X.L., & Naumann, F. (2009). Data fusion: resolving data conflicts for integration. Journal Proceedings of the VLDB Endowment, 2(2), 1654–1655.CrossRef
Zurück zum Zitat Dong, X.L., Saha, B., Srivastava, D. (2012). . Less is More:, Selecting Sources Wisely for Integration, 6(2), 37–48. Dong, X.L., Saha, B., Srivastava, D. (2012). . Less is More:, Selecting Sources Wisely for Integration, 6(2), 37–48.
Zurück zum Zitat Galland, A., Abiteboul, S., Marian, A., Senellart, P. (2010). Corroborating information from disagreeing views. In: Proceedings of the third ACM international conference on Web search and data mining (pp. 131–140). ACM. Galland, A., Abiteboul, S., Marian, A., Senellart, P. (2010). Corroborating information from disagreeing views. In: Proceedings of the third ACM international conference on Web search and data mining (pp. 131–140). ACM.
Zurück zum Zitat Grant, J., & Martinez, M.V. (2018). Measuring Inconsistency in Information. College Publications. Grant, J., & Martinez, M.V. (2018). Measuring Inconsistency in Information. College Publications.
Zurück zum Zitat Levien, R. (2009). Attack-Resistant Trust Metrics, (pp. 121–132). Berlin: Springer. Levien, R. (2009). Attack-Resistant Trust Metrics, (pp. 121–132). Berlin: Springer.
Zurück zum Zitat Li, X., Dong, X.L., Lyons, K., Meng, W., Srivastava, D. (2012). . Truth Finding on the Deep Web:, Is the Problem Solved?, 6, 97–108. Li, X., Dong, X.L., Lyons, K., Meng, W., Srivastava, D. (2012). . Truth Finding on the Deep Web:, Is the Problem Solved?, 6, 97–108.
Zurück zum Zitat Liu, Z., Song, H.A., Zadorozhny, V., Faloutsos, C., Sidiropoulos, N. (2017). Hfuse: Efficient fusion of aggregated historical data. In: Proceedings of SIAM International Conference on Data Mining. Liu, Z., Song, H.A., Zadorozhny, V., Faloutsos, C., Sidiropoulos, N. (2017). Hfuse: Efficient fusion of aggregated historical data. In: Proceedings of SIAM International Conference on Data Mining.
Zurück zum Zitat Page, L., Brin, S., Motwani, R., Winograd, T. (1999). The pagerank citation ranking: Bringing order to the Web. Report, Stanford InfoLab. Page, L., Brin, S., Motwani, R., Winograd, T. (1999). The pagerank citation ranking: Bringing order to the Web. Report, Stanford InfoLab.
Zurück zum Zitat Pasternack, J., & Roth, D. (2010). Knowing what to believe (when you already know something). In: Proceedings of the 23rd International Conference on Computational Linguistics (pp. 877–885). Association for Computational Linguistics. Pasternack, J., & Roth, D. (2010). Knowing what to believe (when you already know something). In: Proceedings of the 23rd International Conference on Computational Linguistics (pp. 877–885). Association for Computational Linguistics.
Zurück zum Zitat Resnick, P., Kuwabara, K., Zeckhauser, R., Friedman, E. (2000). Reputation systems. Communications of the ACM, 43(12), 45–48.CrossRef Resnick, P., Kuwabara, K., Zeckhauser, R., Friedman, E. (2000). Reputation systems. Communications of the ACM, 43(12), 45–48.CrossRef
Zurück zum Zitat Sharma, D. (2010). Efficient information access in data-intensive sensor networks. PhD dissertation, University of Pittsburgh. Sharma, D. (2010). Efficient information access in data-intensive sensor networks. PhD dissertation, University of Pittsburgh.
Zurück zum Zitat Staworko, S., & Chomicki, J. (2010). Consistent query answers in the presence of universal constraints. Information Systems, 35(1), 1–22.CrossRef Staworko, S., & Chomicki, J. (2010). Consistent query answers in the presence of universal constraints. Information Systems, 35(1), 1–22.CrossRef
Zurück zum Zitat Thimm, M. (2018). On the evaluation of inconsistency measures. In Grant, J., & Martinez, M.V. (Eds.) Measuring Inconsistency in Information. College Publications, London, UK. Thimm, M. (2018). On the evaluation of inconsistency measures. In Grant, J., & Martinez, M.V. (Eds.) Measuring Inconsistency in Information. College Publications, London, UK.
Zurück zum Zitat Yi, R., Zadorozhny, V., Oleshchuk, V., Li, F. (2014). A novel approach to trust management in unattended wireless sensor networks. IEEE Transactions on Mobile Computing, 13(7), 1409–1423.CrossRef Yi, R., Zadorozhny, V., Oleshchuk, V., Li, F. (2014). A novel approach to trust management in unattended wireless sensor networks. IEEE Transactions on Mobile Computing, 13(7), 1409–1423.CrossRef
Zurück zum Zitat Yin, X., Han, J., Philip, S.Y. (2008). Truth discovery with multiple conflicting information providers on the Web. IEEE Transactions on Knowledge and Data Engineering, 20(6), 796–808.CrossRef Yin, X., Han, J., Philip, S.Y. (2008). Truth discovery with multiple conflicting information providers on the Web. IEEE Transactions on Knowledge and Data Engineering, 20(6), 796–808.CrossRef
Zurück zum Zitat Yin, X., & Tan, W. (2011). Semi-supervised truth discovery. In: Proceedings of the 20th International Conference on World Wide Web (pp. 217–226). ACM. Yin, X., & Tan, W. (2011). Semi-supervised truth discovery. In: Proceedings of the 20th International Conference on World Wide Web (pp. 217–226). ACM.
Zurück zum Zitat Zadorozhny, V., & Grant, J. (2016). A systematic approach to reliability assessment in integrated databases. Journal of Intelligent Information Systems, 46(3), 409–424.CrossRef Zadorozhny, V., & Grant, J. (2016). A systematic approach to reliability assessment in integrated databases. Journal of Intelligent Information Systems, 46(3), 409–424.CrossRef
Zurück zum Zitat Zadorozhny, V., & Hsu, Y.-F. (2011). Scalable Uncertainty Management. Fifth International Conference Proceedings. In Benferhat, S., & Grant, J. (Eds.) (pp. 331–345). Berlin: Springer. Zadorozhny, V., & Hsu, Y.-F. (2011). Scalable Uncertainty Management. Fifth International Conference Proceedings. In Benferhat, S., & Grant, J. (Eds.) (pp. 331–345). Berlin: Springer.
Zurück zum Zitat Zadorozhny, V., Krishnamurthy, P., Abdelhakim, M., Pelechrinis, K., Xu, J. (2017). Data credence in iot: Vision and challenges. Open Journal of Internet of Things (OJIOT), 3(1), 114–126. Special Issue:, Proceedings of the International Workshop on Very Large Internet of Things (VLIoT 2017) in conjunction with the VLDB 2017 Conference., 3(1):114–126. Zadorozhny, V., Krishnamurthy, P., Abdelhakim, M., Pelechrinis, K., Xu, J. (2017). Data credence in iot: Vision and challenges. Open Journal of Internet of Things (OJIOT), 3(1), 114–126. Special Issue:, Proceedings of the International Workshop on Very Large Internet of Things (VLIoT 2017) in conjunction with the VLDB 2017 Conference., 3(1):114–126.
Zurück zum Zitat Zadorozhny, V., & Lewis, M. (2013). Information fusion for usar operations based on crowdsourcing. In: 2013 16th International Conference on Information Fusion (FUSION) (pp. 1450–1457). Zadorozhny, V., & Lewis, M. (2013). Information fusion for usar operations based on crowdsourcing. In: 2013 16th International Conference on Information Fusion (FUSION) (pp. 1450–1457).
Zurück zum Zitat Zadorozhny, V., Manning, P., Bain, D.J., Mostern, R. (2013). . Journal of World-Historical Information: JWHI, 1(1), 1.CrossRef Zadorozhny, V., Manning, P., Bain, D.J., Mostern, R. (2013). . Journal of World-Historical Information: JWHI, 1(1), 1.CrossRef
Zurück zum Zitat Zadorozhny, V., & Raschid, L. (2007). Alternative path selection in resilient web infrastructure using performance dependencies. Journal of Web Engineering, 6(2), 121–130. Zadorozhny, V., & Raschid, L. (2007). Alternative path selection in resilient web infrastructure using performance dependencies. Journal of Web Engineering, 6(2), 121–130.
Zurück zum Zitat Ziegler, C.-N., & Lausen, G. (2004). Spreading activation models for trust propagation. In: EEE’04. 2004 IEEE International Conference on e-Technology, e-Commerce and e-Service, 2004 (pp. 83–97). Ziegler, C.-N., & Lausen, G. (2004). Spreading activation models for trust propagation. In: EEE’04. 2004 IEEE International Conference on e-Technology, e-Commerce and e-Service, 2004 (pp. 83–97).
Metadaten
Titel
IncompFuse: a logical framework for historical information fusion with inaccurate data sources
verfasst von
Jiawei Xu
Vladimir Zadorozhny
John Grant
Publikationsdatum
20.06.2019
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 3/2020
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-019-00569-6

Weitere Artikel der Ausgabe 3/2020

Journal of Intelligent Information Systems 3/2020 Zur Ausgabe