Skip to main content
Top
Published in: Artificial Life and Robotics 2/2020

Open Access 27-01-2020 | Original Article

Evaluation of visual image for remotely controlled ship

Authors: Ai Hoshino, Ayako Umeda, Takumi Nishina, Hidemasa Kimura, Katsuya Hakozaki, Tsuyoshi Ode, Etsuro Shimizu

Published in: Artificial Life and Robotics | Issue 2/2020

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In recent years, automation technology for navigation support is widely accepted not only in the automotive industry but also in the maritime industry. Particularly, the research related to autonomous and remotely controlled ships has become the major topic in the maritime field. At Tokyo University of Marine Science and Technology (TUMSAT), the research for the operation of remotely controlled ships has been started since 2015. In this paper, we report the results of experiments on the image evaluation method required for the remote-control system using wireless communication. The remote-control system is required to install sensors that replace merchant marines’ visual, hearing and other means in manned ship. The camera image is a substitutable method for the merchant marines’ vision and is thought to be one of the most necessary information for the operator of the land station who carries out remote ship maneuvering.
Notes
This work was presented in part at the 24th International Symposium on Artificial Life and Robotics, Beppu, Oita, January 23–25, 2019.
The original version of this article was revised due to open access cancellation request.
A correction to this article is available online at https://​doi.​org/​10.​1007/​s10015-020-00588-5.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

1 Introduction

Contrary to the increase in transportation volume in recent years on a global scale, the merchant marines’ shortage and the price competition are serious situation. Therefore, the research related to autonomous ships has become the major topic in the maritime field for the purpose of making up for merchant marines’ shortage by improving working environment and reducing the cost by reducing personnel expenses. According to the report by the Japan Coast Guard, approximately 74% of maritime accidents in the past 5 years are attributed to human factors [1]. Figure 1 shows the sorting of the cause of marine accident in the coastal area of Japan in 2017.
It is expected that the realization of autonomous navigation will contribute to the reduction of such maritime accidents. Studies related to the realization of autonomous navigation are progressing in the world as well. Maritime Unmanned Navigation through Intelligence in Networks (The MUNIN) project was held for technical, economic and legal review of ships controlled by a system that combines autonomous operation and remote ship maneuvering from land between 2012 and 2015. According to the final report of the project, it was evaluated that the unmanned ship is one order of magnitude lower both sinking risk and collision risk than manned ship [2]. Advanced autonomous waterborne applications initiative (AAWA) project from 2015 to 2016 by Rolls-Royce Holdings investigated the concept of an unmanned vessel. The project assumed a system that remotely monitors autonomous navigation vessels and examined optimal methods and legal and regulation issues for efficiently combining technologies. The final report had been published in June 2016 [3].
Recently, Kongsberg Maritime announced jointly with YARA International to realize a fully autonomous maneuvering container ship connecting the three ports in Norway by 2022 [4].
It was decided to start consideration on the safety of autonomous shipping vessels with the aim of future unmanned ship in the Maritime Safety Committee (MSC) 98th of the International Maritime Organization (IMO) held in June 2017 [5]. In subsequent MSC 99 of May 2018, the consideration had been actually started and the autonomous ships were classified into following four degrees:
Degree 1: the ship is steered by merchant marines including some automated operation systems,
Degree 2: the ship is remotely controlled with ships merchant marines on board,
Degree 3: the ship is remotely controlled ships without merchant marines s on board,
Degree 4: the ship is fully autonomous ships.
However, Degree 3 and 4 are not permitted under Japan’s existing law [6] which requires merchant marines on board. From this reason, it was agreed to consider international rules that require amendment according to each level of automation, as revision of regulations is indispensable for unmanned operations [7]. At the meeting of the IMO, however, many delegates declared that ships with remote control from shore or with partially autonomous functions would be widely used for a long time before the realization of “unmanned operation” [8]. Therefore, it seems that the maneuvering form combining autonomous operation and remote control is the immediate objective. The boat is basically operated autonomously and controlled remotely by an onshore operator in only emergency situations. Researches on remote-controlled ship with autonomous function will be actively conducted.
In this study, the visual image for remote-controlled ships is discussed. A bandwidth of at least 3000 kbps for data streaming of HD video images is shown during Guidelines for Autonomous Shipping by Bureau Veritas in December 2017 [9]. The minimum condition of the camera image used as the vision has been investigated by actual remote-control experiments. The effect of resolutions, angle of view, bit rate, etc. are examined. In Sect. 3, we capture a visual acuity chart with cameras of various specifications and consider the minimum necessary resolution with the obtained image data. Also, we compare various angles of view using captured image data taken during actual voyages. In Sect. 4, we investigate the lowest bit rate values required for remote navigation in experiments (1)–(3) using wireless communication and consider the optional bit rate of the transmission of the data through the wireless communication channel (Fig. 2).

2 Experimental system

2.1 Battery-powered boat “RAICHO-I”

The outline of the battery-powered boat “RAICHO-I” which equips a remote-control system is shown in Fig. 3 and Table 1. A propeller of RAICHO-I is rotated by an electric motor with the lithium-ion battery. Less noise and good response compared with an internal combustion engine have been realized.
Table 1
Specifications of RAICHO-I
Length
10 m
Width
2.3 m
Depth
1.2 m
Gross tonnage
3.5 t
Motor output
45 kW
Battery capacity
40 kWh

2.2 Remote-control system of “RAICHO-I”

The outline of the remote-control system of RAICHO-I is shown in Fig. 4. In general, a land operator and a boat are connected by the wireless communication. The boat transmits sensor information such as a camera image and position data. The land operator returns a boat maneuvering command to the boat based on the obtained information. The remote-control system of RAICHO-I forms a Local Area Network (LAN) using long-distance Wi-Fi, connects boat side PC and the land station PC and exchanges data and information with each other. RAICHO-I has two network cameras in the cockpit to reproduce the view of the merchant marines and broadcast live on the land station and satellite compasses for obtaining the boat position information, the heading, and the boat speed.
In this research, the long-distance Wi-Fi is used as the communication method. Wi-Fi is an unlicensed band, can be constructed by anyone and is inexpensive. The system shown in Fig. 4 is used for experiments in Sect. 4 and later. Cameras 1–3 shown here are network cameras, and cameras used for experiments in Sect. 4 and subsequent sections are network cameras. They are used to monitor video data at other locations in real time.

3 Evaluation of resolution and angle of view

3.1 Purpose of experiment

Conditions for image capturing are examined.To examine how much image quality should be obtained, it is necessary to consider the angle of the view of the camera and the resolution of the image. According to the current regulation of Japan, merchant marines are required to have visual acuity in both eyes 6/12 or more vision [6]. Therefore, the first target of obstacle recognition level using a camera is that whether the Landolt C of 6/12 vision can be recognized or not. According to the international standard, the gap length of the Landolt C of 6/6 vision is 1.454 mm [10]. Since the size of Landolt C in other visual acuities are converted by multiplying the coefficient, 6/12 vision is twice as 6/6 vision. This means that the gap length for 6/12 vision is 3.0 mm. It is necessary to have 2 pixels or more in the 3 mm gap to identify 6/12 vision Landolt C. In other words, it is assumed that the width of 1 pixel is 1.5 mm or less.

3.2 Experimental method

In this experiment, the Landolt C simple visual acuity test paper (Fig. 5a) [11] is used. Image recordings are made at a distance of 5 m using each five camera types shown in Table 2. We observe focal length, focus type, focal number, resolution and calculated pixel width of them. The angle of view can be derived from the focal length. Calculated pixel width (mm/pixel) is determined from the angle of view and the resolution, assuming that the size of the captured image is the same. After the image capture, we evaluate the captured images to determine if we can recognize 6/12 vision Landolt C of each image taken with each of the five cameras or not.
Table 2
Camera specification
Camera
Focal length (mm)
Focus
Focal number
Resolution
Pixel width (mm/pixel)
Network
f = 1.83
Fixed
F = 2.0
1280 × 720
2.60
360°
A: f = 1.633
B: f = 1.257
Fixed
F = 2.4
3840 × 2160
2.60
DSLR
f = 18–55
Auto
F = 3.5–5.6
1920 × 1080
1.06
HD video
f = 2.9–34.8
Auto
F = 1.8–3.4
1920 × 1080
1.12
4K video
f = 4.4–88
Auto
F = 2.0–3.8
3840 × 2160
0.56

3.3 Experimental results

Only images taken with a 4K video camera could recognize the Landolt C at the required 6/12 vision. Camera images other than 4K video cameras could recognize Landolt C with 6/60, but not 6/12 vision. The images taken with a network and HD video cameras are shown in Fig. 5b, c. The result with the 360° camera is equivalent to the network camera and the DSLR equivalent to the HD video camera, because the width of each single pixel is almost equal.

3.4 Consideration

It is necessary to use a camera with a specification of 0.56 mm/pixel at the maximum if the camera also has the same inspection standard as merchant marines. For the look-out in all directions, it is necessary to use at least six 4K video cameras. In remote-controlled ships, it is necessary to transmit camera images from the ship to the land station where the ship operator is located. At this time, we think that it is difficult to maintain a wireless communication that has a bandwidth that can transmit 4K video. In addition, we need to consider the importance of the real-time performance of the video for look-out for preventing collisions. On the other hand, when using only one camera, it is necessary to prepare a camera that can capture 360° and has a resolution much higher than 4K. It does seem to be such a camera is not realistic. We think that there is a limit to the wide-angle camera that can be used even if it does not necessarily require 0.56 mm/pixel. The images in Fig. 6 were captured by cameras with an angle of view of (a) about 230° and (b) about 80°. Since these images were captured at the same position, the distance to the building in the white frame in Fig. 6 is equal. The images with wide-angle lens are felt farther the distance to the structures compared to narrow-angle lens due to lens curvature. At present, web supposes that the person who remotely controls the ship by watching camera video is a license holder who has practical experience on board. It is considered that the difference in distance sense between the visual observation and the distortion of the videos has an effect on the operator who has been watching visually on board. Although it is possible to correct image distortion due to the lens, we have not considered the correction requirements, because the expected impact to network transmission latency may impair the real-time performance of the system.

4 Evaluation of bit rate

4.1 Purpose of experiments

To transmit images by the wireless communication, it is necessary to encode and decode images captured by the camera. Encoding includes the space compression that compresses frames one by one, and the time compression that compresses the amount of data by removing only the difference of images between the previous frame and the rest of the frame. The compression method is different depending on the type of codec. Typical codecs include MPEG-2 adopted in Blu-ray, H.264 recommended for YouTube, H.265 with a higher compression rate than the H.264 successor standard and video with lossless compression. In all the experiments in Sect. 4.1 and subsequently, the H.264 is used according to the network camera specification. When the video is decoded, the playback quality may decrease in the case of low bit rate. To investigate the minimum bit rate that is not affected the obstacle recognition, following experiments are carried out. The resolution is set to 1280 × 720. The obstacle recognition is not affected by the boat speed, even if the set value of the frame rate is 10 fps. In order not to increase the necessary communication capacity depending on the size of the frame rate, the frame rate is set to 10 fps. In Sect. 4.2, we describe the method and results when investigating the minimum bit rate value required to recognize a moving object with a fixed camera as in Experiment (1). In Sect. 4.3, we explain the method and results when we actually use remote maneuvering and observe the minimum bit rate value that appears to be satisfactory and sufficient for maneuvering in Experiment (2). In Sect. 4.4, we describe the method and result of measuring the amount of data required to transmit the captured video via wireless communication channel in Experiment (3). Finally, in Sect. 4.5, we summarize our observations results of experiments (1)–(3).

4.2 Experiment (1): wired communication

4.2.1 Experimental method

It is assumed that the ship navigation can be recognized even at a bit rate of 64 kbps which is the minimum value of the network camera used in the experiment. A video of boats sailing from land is recorded. The bit rates are 64 kbps, 512 kbps, 1000 kbps, 3000 kbps and 8000 kbps.

4.2.2 Experimental results

Figure 7 shows results obtained for the bit rates of (a) 64 kbps and (b) 512 kbps. The two images are recorded at the same timing. A boat is located in each red circle. This boat is a general small boat (less than 20 m in length) that is commonly used in rivers and cruises at 6 knots. At this experiment, this boat cruises at 300 m ahead from us. At a bit rate of (a) 64 kbps, the captured image is coarse and unclear. The boat is difficult to recognize accurately by the image, although the boat is moving up and down due to its forward motion. On the other hand, at a bit rate of (b) 512 kbps can clearly recognize the boat hull and stern wave. In addition, as the boat moves up and down it becomes easy to recognize due to the vertical motion. There is no problem to recognize general boats of this size in rivers, provided a change in the image quality is achieved with the higher bit rates of 512 kbps or more (512 kbps, 1000 kbps, 3000 kbps, 8000 kbps). Our observations demonstrate that a bit rate of (b) 512 kbps or more is necessary for object recognition.

4.3 Experiment (2): wireless communication

4.3.1 Experimental method

In Sect. 4.2, we evaluate the video displayed on the PC connected to the capture camera via wired communication. In Sect. 4.3, we evaluate the video displayed on the PC connected to the capture camera via wireless communication. In this scenario, we investigate the possibility of image degradation due to compression and decoding of the video signal through wireless communication channel. The equipment setup used for the experiment is equivalent to Sect. 4.2.

4.3.2 Experimental results

Although there are some differences between the respective setting conditions, there is no influence on the recognition of the object by the camera image in the remote control regardless of the setting of any bit rate. From this, it can be said that the delay in only the compression and decompression of the video does not affect the look-out by visual observation. On the other hand, we thought that the video transmission would have a big impact on maneuvering.

4.4 Experiment (3): communication capacity

4.4.1 Experimental method

If the video bit rate is increased in remote control, the required communication capacity is also increased and the communication state becomes unstable gradually. However, considering the occurrence of accidents, it is necessary to transmit as high-resolution images as possible. This is because the H.264 is a codec that performs lossy compression, and obstacles may be removed when the video is compressed. Even if there is a part removed by irreversible compression, if the original video has a high resolution, the influence on obstacle recognition can be suppressed. We also believe that high-resolution images are psychologically necessary for people who recognize obstacles. For these reasons, experiments will be conducted even at a setting of maximum bit rate (8000 kbps), which can be set depending on the specifications of the camera, to examine whether it can be used.
The reachability and round-trip time (RTT) are examined using Ping command. It uses the fact that a reply comes back by sending a packet to the host on the network using Internet Control Message Protocol (ICMP). A ping command is transmitted every second. In addition, we investigate the amount of communication data transmitted from the boat to the land station using a network analyzer.

4.4.2 Experimental results

When the selected bit rates were from 512 kbps to 3000 kbps, there were almost no delay in the video, and the communication appeared to be stable. However, when the bit rate was changed to 8000 kbps, the video that was monitored at the land station was stopped about 10 s. Figure 8 shows the communication capacity for 8000 kbps transmission data rate. The horizontal axis shows time, the vertical axis on the left shows the data amount (kbps), and the vertical axis on the right shows response time per 1 ms (RTT). The blue line is RTT, indicating that the higher the value, the slower the arrival. The gray and orange lines are the amount of data measured on the boat and the amount of data transmitted to the land station via the wireless communication. That is, they show the amount of data to be transmitted and the amount of data actually transmitted. The following can be read from Fig. 8. For some reason, such as the communication line becoming unstable, the amount of communication data decreases and approaches 0 kbps. After this, the amount of communication data increases rapidly after the RTT increases or times out. Although this phenomenon occurs even when the set bit rate value is 512 kbps, 1000 kbps, 3000 kbps, it rapidly increases at 8000 kbps. From this, the following can be considered. This time, this communication uses TCP communication according to the camera specifications. For this reason, after RTT increases or times out, real-time data and the previous data that could not be transmitted are sent together (data is being retransmitted). As the result, the amount of communication data (orange line) increases. On the other hand, data retransmission occurred at any setting. From this result, retransmission does not necessarily affect video delay. It can be said that the problem is the frequent occurrence of retransmissions.

5 Conclusions

Remote control can be realized by combining existing technologies. To carry out look-out by technologies, sight can be replaced by cameras and hearing can be replaced by microphones. However, if the angle of view, the resolution, etc. are not selected appropriately, the necessary information cannot be obtained. When transmitting information by the wireless communication, necessary information may be lost by compression. Real-time performance is very important in remote control. If the amount of information transmitted by the wireless communication is too large, it may affect it.
In this paper, it was confirmed that obstacles can be recognized if the resolution is 1280 × 720, the frame rate is 10 fps, and the bit rate is 512 kbps or more using the network camera with the field angle of 120°. We assume constant luminance in the field of view and the object is a standard type of boat generally used in rivers (length: less than 20 m, speed: about 6 knots) observer at a distance of 300 m from observing cameras. Remote boat maneuvering at the coastal area is possible if the Wi-Fi is used. However, not only the camera images but also other sensor information must be transmitted and received via the wireless communication in the remote control.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Literature
6.
go back to reference Umeda A, Shimizu E, Minami K, Miyoshi T (2018) Countermeasures for legal issues to realize autonomous shipping. Transaction of the JSME, Tokyo, p 6 Umeda A, Shimizu E, Minami K, Miyoshi T (2018) Countermeasures for legal issues to realize autonomous shipping. Transaction of the JSME, Tokyo, p 6
Metadata
Title
Evaluation of visual image for remotely controlled ship
Authors
Ai Hoshino
Ayako Umeda
Takumi Nishina
Hidemasa Kimura
Katsuya Hakozaki
Tsuyoshi Ode
Etsuro Shimizu
Publication date
27-01-2020
Publisher
Springer Japan
Published in
Artificial Life and Robotics / Issue 2/2020
Print ISSN: 1433-5298
Electronic ISSN: 1614-7456
DOI
https://doi.org/10.1007/s10015-020-00584-9

Other articles of this Issue 2/2020

Artificial Life and Robotics 2/2020 Go to the issue