No abstract available.
Proceeding Downloads
The web of linked data: a global public dataspace on the web: WebDB 2010 keynote
In 2005, Michael Franklin, Alon Halevy, and David Maier coined the term dataspaces as a new abstraction and target architecture for data management. In 2006, Tim Berners-Lee introduced the Linked Data principles a set of best practices for publishing ...
An agglomerative query model for discovery in linked data: semantics and approach
Data on the Web is increasingly being used for discovery and exploratory tasks. Unlike traditional fact-finding tasks that require only the typical single-query and response paradigm, these tasks involve a multistage search process in which bits of ...
XML-based RDF data management for efficient query processing
The Semantic Web, which represents a web of knowledge, offers new opportunities to search for knowledge and information. To harvest such search power requires robust and scalable data repositories that can store RDF data and support efficient evaluation ...
Querying Wikipedia documents and relationships
Wikipedia has become an important source of information which is growing very rapidly. However, the existing infrastructure for querying this information is limited and often ignores the inherent structure in the information and links across documents. ...
WikiAnalytics: disambiguation of keyword search results on highly heterogeneous structured data
Wikipedia infoboxes is an example of a seemingly structured, yet extraordinarily heterogenous dataset, where any given record has only a tiny fraction of all possible fields. Such data cannot be queried using traditional means without a massive a priori ...
Find your advisor: robust knowledge gathering from the web
We present a robust method for gathering relational facts from the Web, based on matching generalized patterns which are automatically learned from seed facts for relations of interest. Our approach combines these generalized patterns for high recall ...
Redundancy-driven web data extraction and integration
A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although such information is spread across a myriad of sources, the web scale ...
Using latent-structure to detect objects on the web
An important requirement for emerging applications which aim to locate and integrate content distributed over the Web is to identify pages that are relevant for a given domain or task. In this paper, we address the problem of identifying pages that ...
Popularity-guided top-k extraction of entity attributes
Recent progress in information extraction technology has enabled a vast array of applications that rely on structured data that is embedded in natural-language text. In particular, the extraction of concepts from the Web---with their desired attributes--...
Manimal: relational optimization for data-intensive programs
The MapReduce distributed programming framework is very popular, but currently lacks the optimization techniques that have been standard with relational database systems for many years. This paper proposes Manimal, which uses static code analysis to ...
Learning topical transition probabilities in click through data with regression models
The transition of search engine users' intents has been studied for a long time. The knowledge of intent transition, once discovered, can yield a better understanding of how different topics are related and be used in many applications, such as building ...
Improved recommendations via (more) collaboration
We consider in this paper a popular class of recommender systems that are based on Collaborative Filtering (CF for short). CF is the process of predicting customer ratings to items based on previous ratings of (similar) users to (similar) items, and is ...
Concurrent one-way protocols in around-the-clock social networks
We introduce and study concurrent One-Way Protocols in social networks. The model is motivated by the rise of online social networks and the fast development of automation features in them. In a One-Way architecture, used, e.g., by Twitter, participants ...
Reconciling two models of multihierarchical markup
For documents with complex or atypical annotations, multihierarchical structures play the role of the document tree in traditional XML documents. We define a model of overlapping or multihierarchical markup that has both a graph-based and a text-and-...
Tree patterns with full text search
Tree patterns with full text search form the core of both XQuery Full Text and the NEXI query language. On such queries, users expect a relevance-ranked list of XML elements as an answer. But this requirement may lead to undesirable behavior of XML ...
Index Terms
- Procceedings of the 13th International Workshop on the Web and Databases