skip to main content
10.1145/1273496acmotherconferencesBook PagePublication PagesicmlConference Proceedingsconference-collections
ICML '07: Proceedings of the 24th international conference on Machine learning
ACM2007 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
ICML '07 & ILP '07: The 24th Annual International Conference on Machine Learning held in conjunction with the 2007 International Conference on Inductive Logic Programming Corvalis Oregon USA June 20 - 24, 2007
ISBN:
978-1-59593-793-3
Published:
20 June 2007
Sponsors:
Machine Learning Journal

Bibliometrics
Skip Abstract Section
Abstract

This volume contains the papers accepted to the 24th International Conference on Machine Learning (ICML 2007), which was held at Oregon State University in Corvalis, Oregon, from June 20th to 24th, 2007. ICML is the annual conference of the International Machine Learning Society (IMLS), and provides a venue for the presentation and discussion of current research in the field of machine learning. These proceedings can also be found online at: http://www.machinelearning.org.

This year there were 522 submissions to ICML. There was a very thorough review process, in which each paper was reviewed by three program committee (PC) members. Authors were able to respond to the initial reviews, and the PC members could then modify their reviews based on online discussions and the content of this author response. For the first time this year there were two discussion periods led by the senior program committee (SPC), one just before and one after the submission of author responses. At the end of the second discussion period, the SPC members gave their recommendations and provided a summary review for each of their papers. Also for the first time, authors were asked to submit a list of changes with their final accepted papers, which was checked by the SPCs to ensure that reviewer comments had been addressed. Apart from the length restrictions on papers and the compressed time frame, the review process for ICML resembles that of many journal publications. In total, 150 papers were accepted to ICML this year, including a very small number of papers which were initially conditionally accepted, yielding an overall acceptance rate of 29%.

ICML attracts submissions from machine learning researchers around the globe. The 150 accepted papers this year were geographically distributed as follows: 66 papers had a first author from the US, 32 from Europe, 19 from China or Hong Kong, 11 from Canada, 6 from India, 5 each from Australia and Japan, 3 from Israel, and 1 each from Korea, Russia and Taiwan.

In addition to the main program of accepted papers, which includes both a talk and poster presentation for each paper, the ICML program included 3 workshops and 8 tutorials on machine learning topics which are currently of broad interest. We were also extremely pleased to have David Heckerman (Microsoft Research), Joshua Tenenbaum (Massachussetts Institute of Technology), and Bernhard Schölkopf (Max Planck Institute for Biological Cybernetics) as the invited speakers this year. Thanks to sponsorship by the Machine Learning Journal, we were able to award a number of outstanding student paper prizes.

We were fortunate this year that ICML was co-located with the International Conference on Inductive Logic Programming (ILP 2007). ICML and ILP held joint sessions on the first day of ICML 2007.

Article
Quantum clustering algorithms

By the term "quantization", we refer to the process of using quantum mechanics in order to improve a classical algorithm, usually by making it go faster. In this paper, we initiate the idea of quantizing clustering algorithms by using variations on a ...

Article
Learning random walks to rank nodes in graphs

Ranking nodes in graphs is of much recent interest. Edges, via the graph Laplacian, are used to encourage local smoothness of node scores in SVM-like formulations with generalization guarantees. In contrast, Page-rank variants are based on Markovian ...

Article
Uncovering shared structures in multiclass classification

This paper suggests a method for multiclass learning with many classes by simultaneously learning shared characteristics common to the classes, and predictors for the classes in terms of these characteristics. We cast this as a convex optimization ...

Article
Two-view feature generation model for semi-supervised learning

We consider a setting for discriminative semi-supervised learning where unlabeled data are used with a generative model to learn effective feature representations for discriminative training. Within this framework, we revisit the two-view feature ...

Article
Scalable training of L1-regularized log-linear models

The L-BFGS limited-memory quasi-Newton method is the algorithm of choice for optimizing the parameters of large-scale log-linear models with L2 regularization, but it cannot be used for an L1-regularized loss due to its non-differentiability whenever ...

Article
Multiclass core vector machine

Even though several techniques have been proposed in the literature for achieving multiclass classification using Support Vector Machine(SVM), the scalability aspect of these approaches to handle large data sets still needs much of exploration. Core ...

Article
The rendezvous algorithm: multiclass semi-supervised learning with Markov random walks

We consider the problem of multiclass classification where both labeled and unlabeled data points are given. We introduce and demonstrate a new approach for estimating a distribution over the missing labels where data points are viewed as nodes of a ...

Article
Focused crawling with scalable ordinal regression solvers

In this paper we propose a novel, scalable, clustering based Ordinal Regression formulation, which is an instance of a Second Order Cone Program (SOCP) with one Second Order Cone (SOC) constraint. The main contribution of the paper is a fast algorithm, ...

Article
Learning distance function by coding similarity

We consider the problem of learning a similarity function from a set of positive equivalence constraints, i.e. 'similar' point pairs. We define the similarity in information theoretic terms, as the gain in coding length when shifting from independent ...

Article
Structural alignment based kernels for protein structure classification

Structural alignments are the most widely used tools for comparing proteins with low sequence similarity. The main contribution of this paper is to derive various kernels on proteins from structural alignments, which do not use sequence information. ...

Article
Discriminative learning for differing training and test distributions

We address classification problems for which the training instances are governed by a distribution that is allowed to differ arbitrarily from the test distribution---problems also referred to as classification under covariate shift. We derive a solution ...

Article
Solving multiclass support vector machines with LaRank

Optimization algorithms for large margin multiclass recognizers are often too costly to handle ambitious problems with structured outputs and exponential numbers of classes. Optimization algorithms that rely on the full gradient are not effective ...

Article
Efficiently computing minimax expected-size confidence regions

Given observed data and a collection of parameterized candidate models, a 1 -- α confidence region in parameter space provides useful insight as to those models which are a good fit to the data, all while keeping the probability of incorrect exclusion ...

Article
Multiple instance learning for sparse positive bags

We present a new approach to multiple instance learning (MIL) that is particularly effective when the positive bags are sparse (i.e. contain few positive instances). Unlike other SVM-based MIL methods, our approach more directly enforces the desired ...

Article
Cluster analysis of heterogeneous rank data

Cluster analysis of ranking data, which occurs in consumer questionnaires, voting forms or other inquiries of preferences, attempts to identify typical groups of rank choices. Empirically measured rankings are often incomplete, i.e. different numbers of ...

Article
Feature selection in a kernel space

We address the problem of feature selection in a kernel space to select the most discriminative and informative features for classification and data analysis. This is a difficult problem because the dimension of a kernel space may be infinite. In the ...

Article
Learning to rank: from pairwise approach to listwise approach

The paper is concerned with learning to rank, which is to construct a model or a function for ranking objects. Learning to rank is useful for document retrieval, collaborative filtering, and many other applications. Several methods for learning to rank ...

Article
Local similarity discriminant analysis

We propose a local, generative model for similarity-based classification. The method is applicable to the case that only pairwise similarities between samples are available. The classifier models the local class-conditional distribution using a maximum ...

Article
Direct convex relaxations of sparse SVM

Although support vector machines (SVMs) for binary classification give rise to a decision rule that only relies on a subset of the training data points (support vectors), it will in general be based on all available features in the input space. We ...

Article
Minimum reference set based feature selection for small sample classifications

We address feature selection problems for classification of small samples and high dimensionality. A practical example is microarray-based cancer classification problems, where sample size is typically less than 100 and number of features is several ...

Article
Learning to compress images and videos

We present an intuitive scheme for lossy color-image compression: Use the color information from a few representative pixels to learn a model which predicts color on the rest of the pixels. Now, storing the representative pixels and the image in ...

Article
Magnitude-preserving ranking algorithms

This paper studies the learning problem of ranking when one wishes not just to accurately predict pairwise ordering but also preserve the magnitude of the preferences or the difference between ratings, a problem motivated by its key importance in the ...

Article
Full regularization path for sparse principal component analysis

Given a sample covariance matrix, we examine the problem of maximizing the variance explained by a particular linear combination of the input variables while constraining the number of nonzero coefficients in this combination. This is known as sparse ...

Article
Kernel selection forl semi-supervised kernel machines

Existing semi-supervised learning methods are mostly based on either the cluster assumption or the manifold assumption. In this paper, we propose an integrated regularization framework for semi-supervised kernel machines by incorporating both the ...

Article
Boosting for transfer learning

Traditional machine learning makes a basic assumption: the training and test data should be under the same distribution. However, in many cases, this identical-distribution assumption does not hold. The assumption might be violated when a task from one ...

Article
Intractability and clustering with constraints

Clustering with constraints is a developing area of machine learning. Various papers have used constraints to enforce particular clusterings, seed clustering algorithms and even learn distance functions which are then used for clustering. We present ...

Article
Information-theoretic metric learning

In this paper, we present an information-theoretic approach to learning a Mahalanobis distance function. We formulate the problem as that of minimizing the differential relative entropy between two multivariate Gaussians under constraints on the ...

Article
An integrated approach to feature invention and model construction for drug activity prediction

We present a new machine learning approach for 3D-QSAR, the task of predicting binding affinities of molecules to target proteins based on 3D structure. Our approach predicts binding affinity by using regression on substructures discovered by relational ...

Article
Percentile optimization in uncertain Markov decision processes with application to efficient exploration

Markov decision processes are an effective tool in modeling decision-making in uncertain dynamic environments. Since the parameters of these models are typically estimated from data, learned from experience, or designed by hand, it is not surprising ...

Article
Unsupervised prediction of citation influences

Publication repositories contain an abundance of information about the evolution of scientific research areas. We address the problem of creating a visualization of a research area that describes the flow of topics between papers, quantifies the impact ...

Cited By

  1. Li X, Sun L, Chen S, Wang H, Wen Q, Zhang X, Ding X and Loskot P (2023). Behavior sequence aggregation and attention mechanism-based interest recommendation Third International Conference on Advanced Algorithms and Neural Networks (AANN 2023), 10.1117/12.3005141, 9781510668355, (130)
  2. Jaya Mabel Rani A, Pravin A and Sahin B (2022). Clustering by Hybrid K-Means-Based Rider Sunflower Optimization Algorithm for Medical Data, Advances in Fuzzy Systems, 2022, Online publication date: 1-Jan-2022.
  3. ACM
    Zerveas G, Rekabsaz N, Cohen D and Eickhoff C Mitigating Bias in Search Results Through Contextual Document Reranking and Neutrality Regularization Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, (2532-2538)
  4. Immel P, Desai M, Moody D, Messinger D and Velez-Reyes M (2019). Optimizing deep learning model selection for angular feature extraction in satellite imagery Algorithms, Technologies, and Applications for Multispectral and Hyperspectral Imagery XXV, 10.1117/12.2519373, 9781510626379, (47)
  5. De Bari B, Vallati M, Gatta R, Lestrade L, Manfrida S, Carrie C and Valentini V (2016). Development and validation of a machine learning-based predictive model to improve the prediction of inguinal status of anal cancer patients: A preliminary report, Oncotarget, 10.18632/oncotarget.10749, 8:65, (108509-108521), Online publication date: 12-Dec-2017.
  6. Sadjadi F, Mahalanobis A, Ding Z, Nasrabadi N and Fu Y (2016). Deep transfer learning for automatic target classification: MWIR to LWIR SPIE Defense + Security, 10.1117/12.2228378, , (984408), Online publication date: 12-May-2016.
  7. Egiazarian K, Agaian S, Gotchev A, Bak S, Martins F and Bremond F (2015). Person re-identification by pose priors IS&T/SPIE Electronic Imaging, 10.1117/12.2083862, , (93990H), Online publication date: 16-Mar-2015.
  8. Bastos M, Mercea D and Charpentier A Tents, Tweets, and Events: The Interplay between Ongoing Protests and Social Media, SSRN Electronic Journal, 10.2139/ssrn.2575268
  9. He J, Ding L, Jiang L, Li Z and Hu Q (2014). Intrinsic dimensionality estimation based on manifold assumption, Journal of Visual Communication and Image Representation, 10.1016/j.jvcir.2014.01.006, 25:5, (740-747), Online publication date: 1-Jul-2014.
  10. Vuksanovic B, Zhou J, Verikas A, Tran M and Le N (2013). An analysis of inhibitory pseudo-interconnections in unsupervised neural networks Sixth International Conference on Machine Vision (ICMV 13), 10.1117/12.2052652, , (90671R), Online publication date: 24-Dec-2013.
Contributors
  • University of Cambridge
  1. Proceedings of the 24th international conference on Machine learning

      Recommendations

      Acceptance Rates

      Overall Acceptance Rate140of548submissions,26%
      YearSubmittedAcceptedRate
      ICML '0654814026%
      Overall54814026%