Abstract
One of the main characteristics of Internet era is the free and online availability of extremely large collections of images located on distributed and heterogeneous platforms over the web. The proliferation of millions of shared photographs spurred the emergence of new image retrieval techniques based not only on images’ visual information, but on geo-location tags and camera exif data. These huge visual collections provide a unique opportunity for cultural heritage documentation and 3D reconstruction. The main difficulty, however, is that the internet image datasets are unstructured containing many outliers. For this reason, in this paper a new content-based image filtering is proposed to discard image outliers that either confuse or significantly delay the followed e-documentation tools, such as 3D reconstruction of a cultural heritage object. The presented approach exploits and fuses two unsupervised clustering techniques: DBSCAN and spectral clustering. DBSCAN algorithm is used to remove outliers from the initially retrieved dataset and spectral clustering discriminate the noise free image dataset into different categories each representing characteristic geometric views of cultural heritage objects. To discard the image outliers, we consider images as points onto a multi-dimensional manifold and the multi-dimensional scaling algorithm is adopted to relate the space of the image distances with the space of Gram matrices through which we are able to compute the image coordinates. Finally, structure from motion is utilized for 3D reconstruction of cultural heritage landmarks. Evaluation on a dataset of about 31,000 cultural heritage images being retrieved from internet collections with many outliers indicate the robustness and cost effectiveness of the proposed method towards a reliable and just-in-time 3D reconstruction than existing state-of-the-art techniques.
Similar content being viewed by others
References
Agarwal S, Snavely N, Simon I, Seitz SM, Szeliski R (2009) “Building Rome in a day,” in 2009 IEEE 12th International Conference on Computer Vision, pp. 72–79
Arampatzis A, Zagoris K, Chatzichristofis SA (2013) Dynamic two-stage image retrieval from large multimedia databases. Inf Process Manag 49(1):274–285
Bach FR, Jordan MI (2003) “Learning Spectral Clustering”, Computer Science Division. University of California at Berkeley, California, Berkeley
Barone S, Paoli A, Razionale AV (2012) “3D virtual reconstructions of artworks by a multiview scanning process,” in 2012 18th International Conference on Virtual Systems and Multimedia (VSMM), pp. 259–265
Bay H, Tuytelaars T, Gool LV (2006) SURF: Speeded Up Robust Features. In: Leonardis A, Bischof H, Pinz A (eds) Computer Vision – ECCV 2006. Springer, Berlin, pp 404–417
Bunsch E, Guzowska A, Sitnik R (2012) “3D scanning documentation of two different objects - The King’s Chinese Cabinet in Wilanow Palace Museum and a Roman gravestone from archeological excavations in Moesia Inferior as a part of multidisciplinary research,” in 2012 18th International Conference on Virtual Systems and Multimedia (VSMM), pp. 633–636
Calonder M, Lepetit V, Strecha C, Fua P (2010) BRIEF: Binary Robust Independent Elementary Features. In: Daniilidis K, Maragos P, Paragios N (eds) Computer Vision – ECCV 2010. Springer, Berlin, pp 778–792
Cayton L (2006) “Algorithms for manifold learning,” University of California, San Diego, Tech. Rep. CS2008-0923
Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) “Total Recall: Automatic Query Expansion with a Generative FeatureModel for Object Retrieval,” in IEEE 11th International Conference on Computer Vision, 2007. ICCV 2007, pp. 1–8
Cox T, Cox M, Cox T (2000) Multidimensional Scaling, Second Edition. {Chapman & Hall/CRC}
Doulamis A, Doulamis N (2004) Generalized nonlinear relevance feedback for interactive content-based retrieval and organization. IEEE Trans Circ Syst Video Technol 14(5):656–671
Doulamis N, Doulamis A (2006) Evaluation of relevance feedback schemes in content-based in retrieval systems. Signal Process Image Commun 21(4):334–357
Doulamis AD, Doulamis ND, Kollias SD (2000) A fuzzy video content representation for video summarization and content-based retrieval. Signal Process 80(6):1049–1067
Doulamis N, Doulamis A, Varvarigou TA (2003) Adaptive algorithms for interactive multimedia. IEEE Multimed 10(4):38–47
Ester M, Kriegel H, JS, Xu X (1996) “A density-based algorithm for discovering clusters in large spatial databases with noise,” pp. 226–231
Fan K (1951) Maximum Properties and Inequalities for the Eigenvalues of Completely Continuous Operators. Proc Natl Acad Sci U S A 37(11):760–766
Halkos D, Doulamis N, Doulamis A (2009) A secure framework exploiting content guided and automated algorithms for real time video searching. Multimed Tools Appl 42(3):343–375
IoannidesM, Hadjiprocopis A, Doulamis N, Doulamis A, Protopapadakis E,Makantasis K, Santos P, Fellner D, Stork A, Balet O, Julien M, Weinlinger G, Johnson PS, Klein M, Fritsch D (2013) “Online 4D Reconstruction Using Multi-Images Available Under Open Acess,” ISPRS Ann Photogramm Remote Sens Spat Inf Sci, vol. II–5/W1, pp. 169–174
Karaszewski M, Sitnik R, Bunsch E (2012) On-line, collision-free positioning of a scanner during fully automated three-dimensional measurement of cultural heritage objects. Robot Auton Syst 60(9):1205–1219
Kekre DHB, Sarode TK, Thepade SD, Vaishali V (2011) Improved texture feature based image retrieval using Kekre’s fast codebook generation algorithm. In: Pise SJ (ed) Thinkquest~2010. Springer, India, pp 143–149
Kosmopoulos DI, Doulamis A, Makris A, Doulamis N, Chatzis S, Middleton SE (2009) Vision-based production of personalized video. Signal Process Image Commun 24(3):158–176
Lowe DG (2004) Distinctive Image Features from Scale-Invariant Keypoints. Int J Comput Vis 60(2):91–110
Lv Q, Josephson W, Wang Z, Charikar M, Li K (2007) “Multi-probe LSH: Efficient Indexing for High-dimensional Similarity Search”, in Proceedings of the 33rd International Conference on Very Large Data Bases. Austria, Vienna, pp 950–961
Min R, Cheng HD (2009) Effective image retrieval using dominant color descriptor and fuzzy support vector machine. Pattern Recognit 42(1):147–157
Murthy VSVS, Kumar S, Rao PS (2010) “Content Based Image Retrieval using Hierarchical and K-Means Clustering Techniques,” Int. J. Eng. Sci. Technol., vol. 2
Ntalianis KS, Doulamis AD, Tsapatsoulis N, Doulamis N (2010) Human action annotation, modeling and analysis based on implicit user interaction. Multimed Tools Appl 50(1):199–225
Papadakis N, Doulamis A, Litke A, Doulamis N, Skoutas D, Varvarigou T (2008) MI-MERCURY:Amobile agent architecture for ubiquitous retrieval and delivery of multimedia information. Multimed Tools Appl 38(1):147–184
Papadopoulos S, Zigkolis C, Kompatsiaris Y, Vakali A (2010) “Cluster-based Landmark and Event Detection on Tagged Photo Collections,” IEEE Multimed
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) “Object retrieval with large vocabularies and fast spatialmatching,” in IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR ’07, pp. 1–8
Rosin PL (1999) Measuring Corner Properties. Comput Vis Image Underst 73(2):291–307
Rosten E, Drummond T (2006) Machine Learning for High-Speed Corner Detection. In: Leonardis A, Bischof H, Pinz A (eds) Computer Vision – ECCV 2006. Springer, Berlin, pp 430–443
Rublee E, Rabaud V, Konolige K, Bradski G (2011) “ORB: An efficient alternative to SIFT or SURF,” in 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571
Satopaa V, Albrecht J, Irwin D, Raghavan B (2011) “Finding a ‘Kneedle’ in a Haystack: Detecting Knee Points in System Behavior”, in Proceedings of the 2011 31st International Conference on Distributed Computing Systems Workshops. Washington, DC, USA, pp 166–171
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEETrans PatternAnalMach Intell 22(8):888–905
Simon I, Snavely N, Seitz SM (2007) “Scene Summarization for Online Image Collections”, in IEEE 11th International Conference on Computer Vision, 2007. ICCV 2007:1–8
Sitnik R, Karaszewski M (2010) “Automated Processing of Data from 3D Scanning of Cultural Heritage Objects,”. In: Ioannides M, Fellner D, Georgopoulos A, Hadjimitsis DG (eds) Digital Heritage. Springer, Berlin Heidelberg, pp 28–41
Snavely N, Seitz SM, Szeliski R (2006) Photo Tourism: exploring Photo Collections in 3D. ACM Trans Graph 25(3):835–846
Wu C, Agarwal S, Curless B, Seitz SM(2011) “Multicore bundle adjustment,” in 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3057–3064
Wu C, Agarwal S, Curless B, Seitz SM(2012) “Schematic surface reconstruction,” in 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1498–1505
Wu C, Frahm J-M, Pollefeys M (2011) “Repetition-based dense single-view reconstruction,” in 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3113–3120
Yu SX, Shi J (2003) “Multiclass spectral clustering,” in Ninth IEEE International Conference on Computer Vision, 2003. Proceedings, pp. 313–319 vol.1
Zheng Y-T, Zhao M, Song Y, Adam H, Buddemeier U, Bissacco A, Brucher F, Chua T-S, Neven H (2009) “Tour the world: Building a web-scale landmark recognition engine,” in IEEE Conference on Computer Vision and Pattern Recognition, 2009. CVPR 2009, pp. 1085–1092
Acknowledgments
The research leading to these results has been supported by Marie Curie IAPP project 4D-CH-World: Four Dimensional Cultural Heritage World. Grant agreement number324523.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Makantasis, K., Doulamis, A., Doulamis, N. et al. In the wild image retrieval and clustering for 3D cultural heritage landmarks reconstruction. Multimed Tools Appl 75, 3593–3629 (2016). https://doi.org/10.1007/s11042-014-2191-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-2191-z