Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval

VLS-MCMR '10: Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval

October 2010

2010 Proceeding

Program Chairs:
Benoit Huet
EURECOM, France
,
Tat-Seng Chua
National University of Singapore, Singapore
,
Alexander Hauptmann
Carnegie Mellon University, USA

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

MM '10: ACM Multimedia Conference Firenze Italy 29 October 2010

ISBN:

978-1-4503-0166-4

Published:

29 October 2010

Sponsors:

SIGMM

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

MM '24

Sponsor:
sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

MM '24 website

Bibliometrics

Citation count

Downloads (6 weeks)

Downloads (12 months)

Downloads (cumulative)

2,559

Sections

VLS-MCMR '10: Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval

2010

Previous Next

Skip Abstract Section

Abstract

Welcome to the International Workshop on Very-Large-Scale Multimedia Corpus, Mining and Retrieval (VLS-MCMR'10). The purpose of this workshop is to bring together researchers interested in the construction and analysis of Very Large Scale Multimedia Corpus, as well as the methodologies to Mine and Retrieve information from them. The Workshop will provide a forum to consolidate key issues related to research on very large scale multimedia dataset such as the construction of dataset, creation of ground truth, sharing and extension of such resources in terms of ground truth, features, algorithms and tools etc. The Workshop will discuss and formulate action plan towards these goals.

This workshop welcomes contributions on the following topics:

Construction, Unification and Evolution of Corpus
Framework for sharing of dataset, ground truth, features, algorithms and tools
Indexing and retrieval for large multimedia collections (including images, video, audio and other multi-modal systems)
Large-scale video event and temporal analysis over diverse sources
Automatic machine tagging, semantic annotation and object recognition on massive multimedia collections
Interfaces for exploring, browsing and visualizing large multimedia collections
Scalable and distributed machine learning and data mining methods for multimedia data
Performance evaluation methodologies and standards
Large-scale copy detection and near-duplicate detection
Web-scale combined analysis of social and content networks
Scalable and distributed systems for multimedia content analysis

Large-Scale multimedia applications are among the potential topics for the ACM multimedia 2010 hosts the "Multimedia Grand Challenge." The availability of Large-Scale Corpus would effectively boost research in this direction and foster many new applications for the years to come.

The call for papers attracted 26 submissions from Asia, North-America, Europe and Africa. The program committee accepted 10 high quality papers. In addition, the program includes a panel on the topics addressed by the workshop and a keynote speech. We hope that the proceedings will serve as a valuable reference for multimedia researchers and developers as well as encourage new research direction and results.

Looking over the papers accepted for the workshop, we observe three major trends. First there are approaches that attempt to benefit from the user-contributed data in order to facilitate modeling, mining and retrieval. Second, there are studies that focus on the algorithmic issues related to the use of massively parallel computing facilities. Finally, there is work that addresses the scalability issues when going very-large-scale.

Tong tackles large scale image annotation using user contributed annotation (tags etc.) provided from social media network; where scalability is achieved through the use of the GRID'5000 computation mresources. Zhou et al. identify relevant text terms from text blocks that surround the web images in order to improve the accuracy of web image annotation on a 5 million image dataset. Wang et al. propose a deep model-based and data-driven hybrid architecture for annotating images. It is shown that DMD can scale-up well, thanks to its sparse regularization and scalable supervised learning steps. Creating a corpus is both expensive and time consuming. Liu and Huet propose a technique to automatically augment the training set for concept detector refinement. Two kinds of information is used to select the training data, one is visual feature, where video shots with high confidence scores are selected, the other is tags, in which tags are used to filter out video shots not tagged with the concept.

Wu et al. present an unsupervised fully automatic algorithm for detecting commercials in broadcast TV. Their solution is scalable and efficient for fast, large scale, unsupervised commercial detection. Gudmund et al. revisit a cluster pruning algorithm, considering factors such as CPU/IO cost and memory constraint for large-scale copy detection. The method shows interesting clustering and retrieval computational cost when scaling up. In Kosh, processing and optimizing strategies are presented along with a cost model for integrating a similarity-based image join in a multimedia database.

Nagy et al. addresses the scalability issues for visual vocabulary based image annotation algorithms as new object categories are added. To this end, a hierarchical approach is proposed based on classspecific vocabulary and a scoring function. Fan et al. reported an extensive analysis of user behavior in online video streaming based on a large-scale trace database of online video access sessions. The study of the statistical characteristics of user behavior patterns shows that user behavior in a video access session is not only related to the content of the video, but also has strong correlation with the behaviors of previous access sessions. Wang and Merialdo propose an approach to boost the performance of video concept detection based on the Bag-of-Words through the assignment different weights to the visual words according to their informativeness for the detection of different concepts.

Proceeding Downloads

PDF(title page, copyright, foreword, contents, organization, sponsors)

PDF(author index)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: Session 1

section

Session details: Session 1

Tat-Seng Chua

https://doi.org/10.1145/3258355

- 0
Metrics
Total Citations0

research-article

Incremental multi-classifier learning algorithm on grid'5000 for large scale image annotation

Yubing Tong,
Bahjat Safadi,
Georges Quénot

pp 1–6https://doi.org/10.1145/1878137.1878139

With our previous research, active learning with multi-classifier showed considering performance in large scale data but much calculation was involved. In this paper, we proposed an incremental multi-classifier (SVM classifiers were used) learning ...

- 1
- 119
Metrics
Total Citations1
Total Downloads119
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Automatic image annotation by using relevant keywords extracted from auxiliary text documents

Ning Zhou,
Yi Shen,
Jianping Fan

pp 7–12https://doi.org/10.1145/1878137.1878140

In this paper, a novel algorithm is developed to enable automatic image annotation by aligning web images with their most relevant auxiliary text terms. First, large-scale web pages are crawled and automatic web page segmentation is performed to extract ...

- 1
- 207
Metrics
Total Citations1
Total Downloads207
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

A deep-learning model-based and data-driven hybrid architecture for image annotation

Zhiyu Wang,
Dingyin Xia,
Edward Y. Chang

pp 13–18https://doi.org/10.1145/1878137.1878141

Does adding more training data always help improve the effectiveness of a machine-learning or pattern-recognition task? Recent evidences in machine translation and speech recognition seem to suggest that the data-driven approach outperforms the ...

- 4
- 392
Metrics
Total Citations4
Total Downloads392
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Concept detector refinement using social videos

Xueliang Liu,
Benoit Huet

pp 19–24https://doi.org/10.1145/1878137.1878142

The explosion of social video sharing sites gives new challenges on video search and indexing techniques. Because of the concept diversity in social videos, it is very hard to build a well annotated dataset that provides good coverage over the whole ...

- 3
- 98
Metrics
Total Citations3
Total Downloads98
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

SESSION: Session 2

section

Session details: Session 2

Alexander Hauptmann

https://doi.org/10.1145/3258356

- 0
Metrics
Total Citations0

research-article

Commercial film detection and identification based on a dual-stage temporal recurrence hashing algorithm

Xiaomeng Wu,
Narongsak Putpuek,
Shin'ichi Satoh

pp 25–30https://doi.org/10.1145/1878137.1878144

This paper proposes a dual-stage temporal recurrence hashing algorithm for fully unsupervised and super-fast Commercial Film (CF) mining in large-scale broadcast video archives. The first-stage hashing algorithm converts a large amount of video segments ...

- 4
- 163
Metrics
Total Citations4
Total Downloads163
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

A large-scale performance study of cluster-based high-dimensional indexing

Gylfi Þór Gudmundsson,
Björn Þór Jónsson,
Laurent Amsaleg

pp 31–36https://doi.org/10.1145/1878137.1878145

High-dimensional clustering is used by some content-based image retrieval systems to partition the data into groups; the groups (clusters) are then indexed to accelerate processing of queries. Recently, the Cluster Pruning approach was proposed as a ...

- 24
- 253
Metrics
Total Citations24
Total Downloads253
Last 12 Months3
Last 6 weeks2

Abstract
Get Access

research-article

Optimizing similarity-based image joins in a multimedia database

Harald Kosch

pp 37–42https://doi.org/10.1145/1878137.1878146

Commonly used content-based image retrieval systems focus on the problem of finding similar images for a given single query object out of a database of media objects. We consider a similarity-based image join of two image tables, where the image data ...

- 1
- 247
Metrics
Total Citations1
Total Downloads247
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

SESSION: Session 3

research-article

Towards extensible automatic image annotation with the bag-of-words approach

Robert Nagy,
Klaus Meyer-Wegener

pp 43–48https://doi.org/10.1145/1878137.1878148

Visual-word-based image categorization has proven to be very effective in several publications and contests. Recently, various approaches have been proposed to address the need for scalability and computational performance of classification based on Bag ...

- 1
- 161
Metrics
Total Citations1
Total Downloads161
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

An analysis of user behavior in online video streaming

Fan Qiu,
Yi Cui

pp 49–54https://doi.org/10.1145/1878137.1878149

Understanding user behavior in online video streaming is essential to designing streaming systems which provide user-oriented service. However, it is challenging to gain insightful knowledge of the characteristics of user behavior due to its high ...

- 10
- 790
Metrics
Total Citations10
Total Downloads790
Last 12 Months40
Last 6 weeks8

Abstract
Get Access

research-article

Weighting informativeness of bag-of-visual-words by kernel optimization for video concept detection

Feng Wang,
Bernard Merialdo

pp 55–58https://doi.org/10.1145/1878137.1878150

Bag-of-Visual-Words (BoW) feature has been demonstrated effective and widely used in video concept detection due to its discriminative ability by capturing the local information in images. In the current approaches, all the words in the visual ...

- 2
- 117
Metrics
Total Citations2
Total Downloads117
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Contributors

Benoît Huet
EURECOM- Graduate School and Research Center in Digital Sciences
- Publication Years2002 - 2022
- Publication counts65
- Citation count384
- Available for Download34
- Downloads (cumulative)7,607
- Downloads (12 months)396
- Downloads (6 weeks)71
- Average Downloads per Article224
- Average Citation per Article6
View Full Profile
Tat-Seng Chua
National University of Singapore
- Publication Years1984 - 2024
- Publication counts475
- Citation count22,325
- Available for Download323
- Downloads (cumulative)305,585
- Downloads (12 months)58,346
- Downloads (6 weeks)8,078
- Average Downloads per Article946
- Average Citation per Article47
View Full Profile
Alexander Georg Hauptmann
Carnegie Mellon University
- Publication Years1986 - 2022
- Publication counts256
- Citation count6,269
- Available for Download120
- Downloads (cumulative)62,699
- Downloads (12 months)3,017
- Downloads (6 weeks)393
- Average Downloads per Article522
- Average Citation per Article24
View Full Profile

Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval

Recommendations

ICMR '19: Proceedings of the 2019 on International Conference on Multimedia Retrieval
Read More
LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Read More
ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval
Read More

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

ICMR '19: Proceedings of the 2019 on International Conference on Multimedia Retrieval

LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining

ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval