Skip to main content
Top
Published in: Journal on Data Semantics 1/2018

05-02-2018 | Original Article

Multi-Query Optimization on RSS Feeds

Authors: Fekade Getahun, Richard Chbeir

Published in: Journal on Data Semantics | Issue 1/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

RSS feeds are text-content rich, semantically heterogeneous, and contain dynamic XML elements streamed in asynchronous and pull strategies. Hence, for efficient retrieval of RSS feeds, semantic-aware querying operators have been proposed in the literature (Getahun and Chbeir in Inf Sci 237(237):313–342, 2013). However, it is commonly admitted that the use of semantic information would improve, on one hand, the relevance of query result but, on the other hand, at the cost of degrading the efficiency and the performance of the system. To benefit from query execution on semantic information while keeping the efficiency of the system, we propose here a multi-query optimization approach for semantic RSS feed queries. Our approach processes queries by examining the semantic relationship between them and their corresponding windows. It generates a multi-query chain for queries using their window relations for faster execution at runtime. In addition, we propose an operator called quickDrop for semantic load shedding to gracefully decrease irrelevant data load. To validate the proposed approach, we developed a prototype and conducted a set of experiments. The obtained results show that the use of our approach significantly improves the performance of the system.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
A window boundary is defined in this example using time in GMT to represent the user need for a specific time of a day.
 
2
A semantic query [34] is inherently fuzzy, and the user typically expects only a subset of the full results.
 
3
Atomic query is a simple query having a source, a predicate with attribute, operator and value.
 
4
However, the building process of the value knowledge base is out of the scope of this paper.
 
5
WordNet is an online lexical reference system (taxonomy), where nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing a lexical concept [Miller 1990], [WordNet 2005].
 
6
\(\ge \) and \(\le \) are considered as single operators put together using the Boolean operator OR.
 
7
Relations will be detailed in the next subsection.
 
8
RSS 0.92 is upward compatible to RSS 0.91, User land specification http://​backend.​userland.​com/​rss09x.
 
9
RSS 1.0 is also called RDF Site Summary, and it is a lightweight multipurpose extensible metadata description and syndication format conforms to the W3C’s RDF specification and is extensible via XML namespace and/or RDF-based modularization. http://​web.​resource.​org/​rss/​1.​0/​spec.
 
10
Precision level, e, is 1-confidence interval, with 95% confidence level and 50% degree of variability [30].
 
Literature
1.
go back to reference Getahun F, Chbeir R (2013) RSS query algebra: towards a better news management. Inf Sci 237(237):313–342CrossRefMATH Getahun F, Chbeir R (2013) RSS query algebra: towards a better news management. Inf Sci 237(237):313–342CrossRefMATH
3.
go back to reference Fabret F, Jacobsen HA, Llirbat F, Pereira J, Ross KA, Shasha D (2001) Filtering algorithms and implementation for very fast publish/subcribe. In: SIGMOD, pp 115–126 Fabret F, Jacobsen HA, Llirbat F, Pereira J, Ross KA, Shasha D (2001) Filtering algorithms and implementation for very fast publish/subcribe. In: SIGMOD, pp 115–126
4.
go back to reference Hammad MA, Franklin MJ, Aref WG, Elmagarmid AK (2003) Scheduling for shared window joins over data streams. In: VLDB, pp 297–308 Hammad MA, Franklin MJ, Aref WG, Elmagarmid AK (2003) Scheduling for shared window joins over data streams. In: VLDB, pp 297–308
5.
go back to reference Madden SR, Shah MA, Hellerstein JM, Raman V (2002) Continuously adaptive continuous queries over streams. In: SIGMOD, pp 49–60 Madden SR, Shah MA, Hellerstein JM, Raman V (2002) Continuously adaptive continuous queries over streams. In: SIGMOD, pp 49–60
6.
go back to reference Zhang R, Koudas N, Ooi BC, Srivastava D (2005) Multiple aggregations over data streams. In: SIGMOD, pp 299–310 Zhang R, Koudas N, Ooi BC, Srivastava D (2005) Multiple aggregations over data streams. In: SIGMOD, pp 299–310
7.
go back to reference Chi Y, Wang H, Yu PS, Muntz RR (2005) Loadstar: a load shedding scheme for classifying data streams. In: SIAM conference on data mining, pp 1302–1305 Chi Y, Wang H, Yu PS, Muntz RR (2005) Loadstar: a load shedding scheme for classifying data streams. In: SIAM conference on data mining, pp 1302–1305
8.
go back to reference Garofalakis M, Gibbons P (2001) Approximate query processing: taming the megabytes. In: VLDB, Rome Garofalakis M, Gibbons P (2001) Approximate query processing: taming the megabytes. In: VLDB, Rome
9.
go back to reference Hellerstein J, Haas P, Wang H (1997) Online aggregation. In: SIGMOD, Tucson, pp 171–182 Hellerstein J, Haas P, Wang H (1997) Online aggregation. In: SIGMOD, Tucson, pp 171–182
10.
go back to reference SELLIS TK (1988) Multiple-query optimization. ACM Trans Database Syst 13(1):23–52CrossRef SELLIS TK (1988) Multiple-query optimization. ACM Trans Database Syst 13(1):23–52CrossRef
11.
go back to reference Jarke M (1985) Common subexpression isolation in multiple query optimization. Springer, Berlin, pp 191–205 Jarke M (1985) Common subexpression isolation in multiple query optimization. Springer, Berlin, pp 191–205
12.
go back to reference Chakravarthy, US, Minker J (1986) Multiple query processing in deductive databases using query graphs. In: Proceedings of the 12th international conference on very large data bases, San Francisco, CA, pp 384–391 Chakravarthy, US, Minker J (1986) Multiple query processing in deductive databases using query graphs. In: Proceedings of the 12th international conference on very large data bases, San Francisco, CA, pp 384–391
13.
14.
go back to reference Arvind A, Jennifer W (2004) Resource sharing in continuous sliding-window aggregates. Technical Report Arvind A, Jennifer W (2004) Resource sharing in continuous sliding-window aggregates. Technical Report
15.
go back to reference Song W, Elke R, Samrat G, Sudeept B (2006) StateSlice: new paradigm of multi-query optimization of window based stream queries. In: VLDB, pp 619–630 Song W, Elke R, Samrat G, Sudeept B (2006) StateSlice: new paradigm of multi-query optimization of window based stream queries. In: VLDB, pp 619–630
16.
go back to reference Mingsheng H, Alan D, Johannes G (2007) Massively multi-query join processing in pub-lish/subscribe systems. In: SIGMOD, pp 761–772 Mingsheng H, Alan D, Johannes G (2007) Massively multi-query join processing in pub-lish/subscribe systems. In: SIGMOD, pp 761–772
17.
go back to reference Krishnamurthy S, Wu C, Franklin M (2006) On-the-fly sharing for streamed aggregation. In: SIGMOD, pp 623–634 Krishnamurthy S, Wu C, Franklin M (2006) On-the-fly sharing for streamed aggregation. In: SIGMOD, pp 623–634
18.
go back to reference Li J, David M, Kristin T, Vassilis P, Peter A (2005) No pane, no gain: efficient evaluation of sliding window aggregates over data streams. In: SIGMOD, pp 39–44 Li J, David M, Kristin T, Vassilis P, Peter A (2005) No pane, no gain: efficient evaluation of sliding window aggregates over data streams. In: SIGMOD, pp 39–44
19.
go back to reference Shenoda G, Mohamed A, Panos K, Alexandros L (2011) Optimized processing of multiple aggregate continuous queries. In: CIKM, pp 1515–1524 Shenoda G, Mohamed A, Panos K, Alexandros L (2011) Optimized processing of multiple aggregate continuous queries. In: CIKM, pp 1515–1524
20.
go back to reference Moustafa A, Michael J, Walid G, Ahmed K (2003) Scheduling for shared window joins over data streams. In: VLDB, pp 297–308 Moustafa A, Michael J, Walid G, Ahmed K (2003) Scheduling for shared window joins over data streams. In: VLDB, pp 297–308
21.
go back to reference Nesime T, Uger C, Stan Z (2003) Load shedding on data streams. In: VLDB, pp 674–683 Nesime T, Uger C, Stan Z (2003) Load shedding on data streams. In: VLDB, pp 674–683
22.
go back to reference Reiss F, Hellerstein J (2005) Data triage: an adaptive architecture for load shedding in telegraphcq. In: IEEE ICDE, Tokyo, pp 155–156 Reiss F, Hellerstein J (2005) Data triage: an adaptive architecture for load shedding in telegraphcq. In: IEEE ICDE, Tokyo, pp 155–156
23.
go back to reference Brian B, Mayur D, Rajeev M (2004) Load shedding for aggregation queries over data streams. In: ICDE, pp 155–156 Brian B, Mayur D, Rajeev M (2004) Load shedding for aggregation queries over data streams. In: ICDE, pp 155–156
25.
go back to reference Getahun F, Tekli J, Atnafu S, Chbeir R (2007) Towards efficient horizontal multimedia database fragmentation using semantic-based predicates implication. In: SBBD 2007, pp 68–82 Getahun F, Tekli J, Atnafu S, Chbeir R (2007) Towards efficient horizontal multimedia database fragmentation using semantic-based predicates implication. In: SBBD 2007, pp 68–82
26.
27.
go back to reference Brill E (1992) A simple rule based part of speech tagger. In: Applied natural language processing (ACL), pp 152–155 Brill E (1992) A simple rule based part of speech tagger. In: Applied natural language processing (ACL), pp 152–155
28.
go back to reference Getahun F, Tekli J, Viviani M, Chbeir R, Yetongnon K (2009) Towards semantic-based RSS merging. In: International symposium on intelligent interactive multimedia systems and services, pp 53–64 Getahun F, Tekli J, Viviani M, Chbeir R, Yetongnon K (2009) Towards semantic-based RSS merging. In: International symposium on intelligent interactive multimedia systems and services, pp 53–64
29.
go back to reference Getahun F, Tekli J, Chbeir R, Viviani M, Yétongnon K (2009) Relating RSS news/items. In: 9th international conference on web engineering ICWE 2009, San Sebastian, Spain, pp 442–45 Getahun F, Tekli J, Chbeir R, Viviani M, Yétongnon K (2009) Relating RSS news/items. In: 9th international conference on web engineering ICWE 2009, San Sebastian, Spain, pp 442–45
30.
go back to reference Yamane T (1967) Statistics an introductory analysis, 2nd edn. Harper and Row, New YorkMATH Yamane T (1967) Statistics an introductory analysis, 2nd edn. Harper and Row, New YorkMATH
Metadata
Title
Multi-Query Optimization on RSS Feeds
Authors
Fekade Getahun
Richard Chbeir
Publication date
05-02-2018
Publisher
Springer Berlin Heidelberg
Published in
Journal on Data Semantics / Issue 1/2018
Print ISSN: 1861-2032
Electronic ISSN: 1861-2040
DOI
https://doi.org/10.1007/s13740-018-0085-3

Premium Partner