Photonic machine learning implementation for signal recovery in optical communications

Argyris, Apostolos; Bueno, Julián; Fischer, Ingo

doi:10.1038/s41598-018-26927-y

Download PDF

Article
Open access
Published: 31 May 2018

Photonic machine learning implementation for signal recovery in optical communications

Scientific Reports volume 8, Article number: 8487 (2018) Cite this article

11k Accesses
119 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Machine learning techniques have proven very efficient in assorted classification tasks. Nevertheless, processing time-dependent high-speed signals can turn into an extremely challenging task, especially when these signals have been nonlinearly distorted. Recently, analogue hardware concepts using nonlinear transient responses have been gaining significant interest for fast information processing. Here, we introduce a simplified photonic reservoir computing scheme for data classification of severely distorted optical communication signals after extended fibre transmission. To this end, we convert the direct bit detection process into a pattern recognition problem. Using an experimental implementation of our photonic reservoir computer, we demonstrate an improvement in bit-error-rate by two orders of magnitude, compared to directly classifying the transmitted signal. This improvement corresponds to an extension of the communication range by over 75%. While we do not yet reach full real-time post-processing at telecom rates, we discuss how future designs might close the gap.

Time-domain photonic image processor based on speckle projection and reservoir computing

Article Open access 14 September 2023

Tomoya Yamaguchi, Kohei Arai, … Satoshi Sunada

A photonic complex perceptron for ultrafast data processing

Article Open access 10 March 2022

Mattia Mancinelli, Davide Bazzanella, … Lorenzo Pavesi

An on-chip photonic deep neural network for image classification

Article 01 June 2022

Farshid Ashtiani, Alexander J. Geers & Firooz Aflatouni

Introduction

Recent developments in neuro-inspired information processing using recurrent neural networks (RNNs), cognitive computing approaches, machine learning techniques and deep learning^1,2 architectures have had a major impact on solving classification and pattern recognition tasks with remarkable efficiency^3,4,5,6,7. However, there are hardly any solutions available if the task is time-dependent, the speed requirements are very demanding and the signals to be processed are of high complexity. To this end, analogue hardware implementations of these information processing tools have been gaining increasing recognition⁸. In recent years, implementations of feed-forward and recurrent NNs based on extreme learning machines (ELM)^9,10 and reservoir computing (RC) approaches^11,12,13,14 have been presented in optoelectronic^15,16,17,18 and photonic^{19,20,21,22,23,24,25} hardware. These implementations were in some cases assisted by field programmable gate array (FPGA) modules^25,26. So far, they have only been employed for standard benchmark tasks such as pattern classification, speech recognition, nonlinear time series prediction and wireless channel equalization. Evolving these hardware implementations to minimal conceptual complexity and to maximal speeds would enable to address signal processing tasks in critical technological fields. An excellent example with ultra-fast post-processing requirements can be found in the contemporary fibre-optic communication networks that now operate even beyond the Tb/s scale²⁷. The technological advances in this field target on the highest data throughput over the longest distances with energy efficient and low complexity designs. However, transmission impairments²⁸, such as chromatic dispersion, Kerr effect and four-wave mixing, put strict limitations on communication speed and distance in fibre-optic communication systems. Current research aims at extending these limits, by focusing mainly on the two ends of the communication links. At the transmitter side, major efforts target on optimizing the emitter^29,30, as well as the encoding communication scheme, by using multi-level formats and signal shaping^{28,31,32,33,34}. At the receiver side, high-speed digital signal processing (DSP) algorithms^{35,36,37,38,39} with low-complexity have improved signal recovery by mitigating linear and nonlinear signal distortions. The aforementioned approaches in fibre-based communication systems currently shape the status quo of the field, but they are also facing challenges for future trends. For example, the current DSP methods are efficient as long as nonlinear signal distortions do not become too complicated. For this reason, optimal designs of various transmission systems dictate that the launched optical power in standard single mode fibres (SSMF) should be always restricted to moderate levels (around or below 1 mW). Inevitably though, these power levels limit the signal-to-noise ratio (SNR) of the received signal, given the standard detection capabilities of fast photoreceivers. In a reasonable consideration, one could suggest to increase the launched optical power into the fibre. There are numerous semiconductor laser emitters available ready to offer tens of mW of emission at telecom wavelengths. Such signals exhibit higher optical SNR that could lead to increased transmission distance, but at expense of enhancing the nonlinear behaviour of the transmission line. Travelling signals will undergo a more complex nonlinear transformation, and eventually it will be too difficult to identify and interpret them at the receiver. Only lately, machine learning algorithms have been in the spotlight of the optical communications community⁴⁰. They are being considered for optical network monitoring and optimization^41,42,43, optical header recognition⁴⁴ and mitigation of transmission effects^{45,46,47,48,49,50,51,52,53,54}. Still, the drawback of applying these standard tools in ultrafast systems is that they are computationally expensive and still far away from reaching real-time processing at telecom data rates.

In the present work, we provide a first validation that neuro-inspired information processing based on photonic implementations can address critical issues in the field of signal processing for high-speed communications. Specifically, we demonstrate that techniques like ELM and RC can offer solutions to data recovery of distorted signals from extended fibre transmission. We introduce a simplified RC approach with a sequential data processing architecture that allows for a high-speed hardware implementation. Our focus in this work is to efficiently classify signals that have undergone a significant nonlinear transformation with time dependencies. The concept we demonstrate here is generic and powerful and it can be applied to signals that may originate from any optical communication system configurations (different information modulation formats and communication speeds). The obtained results of RC-based processing yields very promising performance, even if it does not yet reach the status of well-established methodologies in signal processing, such as maximized-likelihood sequence detection and back-propagation DSP. However, it represents a revolutionary tool for fast signal processing of problems with increased complexity. RC implementations have already achieved comparable performance to digital algorithms, which are based on Volterra series filters, applied in equalization tasks for nonlinear satellite communication channels⁵⁵, however at much slower speeds. Future works that will compare cognitive computing approaches with current DSP technology will be also of significant interest.

A simplified reservoir computing concept

The originally proposed RC concept^11,12 for data processing consists of three layers: the input, the reservoir and the output layer. The reservoir is described as a recurrent network with randomly connected nonlinear nodes. Its role is to nonlinearly transform the input and, at the same time, to generate a mapping of the input onto a high-dimensional state space. Recently it was shown¹³ that a single nonlinear element with time-delayed feedback can emulate a recurrent network by defining multiple nodes within the feedback loop with delay time τ. These nodes –also denoted as virtual nodes – form a unidirectional ring topology. The output of the single nonlinear element, sampled at time intervals separated by θ, results in a vector of the virtual node values, which is interpreted as the state of the virtual network. When θ < T, with T being the characteristic time of the nonlinear element, the state of each virtual node has, due to inertia, cross-talk with the state of its neighbours, increasing the connectivity among the nodes. Due to the delayed feedback topology, the network states are additionally influenced by their past states, one time-delay before. The final network size N is defined as the number of virtual nodes along the reservoir loop (N = τ/θ). The structure of the input layer defines how the initial information is fed into the reservoir. For temporal information processing, the simplest way to configure the input interface is to have time-multiplexed inputs. For scalar data inputs, one commonly injects sequentially each input value into the N virtual nodes, within one time interval τ. In order to obtain a large number of different transient responses for this input value, temporal masking is applied. The masking sequence is a length N vector of usually random values and is repeated for every interval τ. This mask defines the input weights for each virtual node. In this way, one masked input value is represented by a 1xN vector within one interval τ. This procedure generates a representation of the input in a state space of increased dimensionality before injecting it into the reservoir. When going from scalar data inputs to data vectors with multiple components, the mask is usually replaced by a random connectivity matrix. This matrix defines how the vector components of the input data are fed into the reservoir’s nodes.

The information processing technique we propose is shown in Fig. 1, illustrating the simplified RC concept we will explain below. In our implementation, the input information is a one-dimensional data vector, constituted by a number of samples within one bit duration. The motivation is that in all physical communication systems, the digital information of each bit bⁱ includes a pattern that can be described by a vector aⁱ = {a₁ⁱ a₂ⁱ a₃ⁱ … a_jⁱ} with j samples within the bit period, as shown in Fig. 1a. The actual sampling rate of the communication system defines the dimension of the vector aⁱ. Thus, j input values represent one bit of digital information per time interval τ. If we followed the original approach, we would need to inject these data vectors into the delay reservoir after multiplying them with the random connectivity matrix. This represents a significant complication when implementing the scheme in hardware and at high speeds. Therefore, we introduce a conceptual simplification which still yields excellent results. We choose j to be equal to the number of virtual nodes k. Then we take the elements of the input vector a_jⁱ in the order defined by the input data stream and multiply them by the random mask m = {m₁, m₂, m₃, … m_k}. This is illustrated in Fig. 1b. The dimension of the space spanned by the components of the input samples is the same as the dimension of the space defined by the nonlinear responses of the reservoir within one interval τ. As we will show, the concept works very well since the dimensionality of the representation of the data in these two spaces can differ significantly. We note that the flexibility of our approach allows also that the value k can be chosen differently from the number of samples j; k can even be different from the total number of nodes N which are available within one reservoir’s time delay τ. The importance of selecting these values will be discussed in detail in the results section.

The output layer performs the readout of the reservoir and is implemented in the originally proposed way (Fig. 1a). The responses of the virtual nodes are linearly combined using a set of optimal readout weights (Methods). The latter are determined by a training process (ridge regression algorithm) that minimizes the mean square error between the obtained and the desired output of an initially defined training sequence. After defining the optimal readout weights, these are kept unaltered during the complete information processing task. This means that the training process is applied once in order to define the readout weights. Finally, a linear summation of the weighted reservoir’s responses generates a single computational result per time delay^12,20 which is the prediction output. In the following, the output layer is trained to provide an estimation on the initial binary input $(\tilde{{b}^{i}})$.

For the implementation of the reservoir we consider a photonic topology that consists of a semiconductor laser (SL) with time-delayed optical feedback of controllable strength²⁰ and time-dependent external optical injection that carries the input information (Fig. 1b). In such a system, the identification of the characteristic time T of our nonlinear element is a nontrivial task. The definition of a single characteristic time for the system based on the solitary SL alone would be misleading and inaccurate. Depending on conditions related to laser biasing, optical feedback strength, feedback time-delay, optical injection strength and time-scales of the injected external signal, the characteristic time of the response system can be well above the relaxation oscillation bandwidth of the solitary laser. A solitary SL exhibits inherently a dynamical operating bandwidth of several GHz, which is particularly useful for generating fast transient states when injecting external perturbations. This value can further increase up to tens of GHz, as it has been shown in various schemes that SLs under optical injection can exhibit bandwidth enhanced responses^56,57. In our experiment, depending on the operating conditions of the reservoir and the given SNR that we have available for signal detection, we observe characteristic frequencies T⁻¹ of the reservoir response of up to ∼10 GHz. Accordingly the virtual node spacing θ was selected to be 50 ps.

A similar topology is also used to implement the ELM approach, which is obtained by simply minimizing the optical feedback strength in the reservoir’s loop¹⁰. In our investigations, the performance of the RC and ELM topologies is benchmarked against simplified (no masking of the input, Fig. 1c) or absent (Fig. 1d) photonic processing.

Input signal considerations

The versatile strategies that have been developed for the different needs of communicating counterparts – mainly distinguished by the coverage range – have promoted a diversity of systems. We will demonstrate that our approach is generic and applicable on signals with deterministic distortion that emerge from differently structured transmission systems. We focus here our investigations on two fundamental systems with different ranging applications that operate at the telecom C-band (1550 nm) and use a single channel carrier (Methods). The first is a short-reach transmission system in which the use of any inline component (dispersion compensation fibre - DCF, optical amplification) is avoided. This type of connection is advantageous for data centre intra- and inter- connections⁵⁸, as well as for the next generation DCF-free metro networks, minimizing the cost and complexity of the communication⁵⁹. For this system we consider a non-return-to-zero (NRZ), pulse amplitude modulation (PAM) encoding at a bit rate of R₁ = 25 Gb/s (Supplementary Fig. 1), a target bit-rate for the IEEE 802.3 communication standards⁶⁰. The task of the RC process for this system is to mitigate two coexisting phenomena: a linear distortion caused by chromatic dispersion and a nonlinear distortion caused by the Kerr effect. The second system we investigate is a long-haul transmission link with the same encoding, at R₂ = 10 Gb/s, including dispersion post-compensation and filtered optical amplification every 100 km span (Supplementary Fig. 2). The task of the RC process for this system is to mitigate the Kerr nonlinearity in presence of stochastic noise that originates from the optical amplification modules. In both systems we consider high-power launched optical signals (10 mW), which are usually not favoured in conventional transmission systems. In this way we obtain a high optical SNR (36.6 dB, more than 10 dB higher than conventional systems) for the transmitted signals that allows us to extend the SSMF transmission distance z. Even if the data recovery bit-error-rate (BER) becomes higher than 0.1, by considering incoherent detection in absence of any DSP, this is not due to the limited optical SNR but due to deterministic effects that can be in principle compensated for. Specifically, in the investigated scenario for the short-reach transmission we consider a transmission length of z₁ = 45 km (Supplementary Figs 3 and 4). In the long-haul transmission scenario, we consider a transmission length of z₂ = 4000 km (Supplementary Figs 3 and 4). Both systems have been numerically simulated using the coupled nonlinear Schrödinger equation (CNLSE) model²⁸ (Methods), adjusted to include also effects such as polarization mode dispersion and inter-channel nonlinear effects (four wave mixing, cross-phase modulation). The latter do not apply in our investigation with a single transmission channel. The output signals after simulating transmission and photodetection are used to feed the input of the experimentally-built photonic reservoir. Other modulation formats (eg. PAM4) and at higher data bit rates (eg. 56 Gb/s) have been also tested, with analogous performance, but their description goes beyond the scope of this manuscript.

Implementation of the photonic reservoir

The implemented photonic reservoir is presented in Fig. 2a, following the simplest possible experimental topology. Details for the experimental implementation can be found in Methods. The reservoir consists of a 1542 nm discrete-mode quantum-well SL (response laser), and an optical fibre delay loop (τ = 66.000 ± 0.025 ns) from which the laser receives delayed feedback. A distributed feedback (DFB) SL (injection laser) is used to generate the optical carrier that carries the masked signal from the simulated transmission systems into the reservoir (Fig. 2b,c). Its emission wavelength is temperature-tuned relative to the wavelength of the response laser, allowing us to control the reservoir properties⁶¹. The implementation of the photonic reservoir is based on commercially available fibre-based components, including an optical attenuator, couplers, circulator and polarization controller. The fibre lengths of the input/output ports of such devices result in a long time delay within the photonic reservoir (τ = 66 ns) significantly longer than actually needed for our computations. Given the node separation of θ = 50 ps, we can define in total N = 1320 virtual nodes within one time delay. Ultimately, we only use the first k = 66 virtual nodes of the delay loop and we assign one sample from each bit’s analogue pattern to one virtual node (j = k). In this way, one time delay is only partially filled with a sequential input signal (Fig. 2d) and only a small part of the reservoir’s available virtual nodes are being used. The states of the other virtual nodes in the unused part of the reservoir are omitted from training and testing, while the next bit pattern follows after one time delay (τ) (Fig. 2e). Therefore, the unused part of the fibre does not contribute to the reservoir response and a system with correspondingly shorter delay would perform equally, at the same time providing higher data throughput in the experiment. In this work we will also show numerically how shorter reservoirs – e.g., based on photonic integrated circuits – can perform with equivalent efficiency, while speeding up processing significantly. In the presented scheme of sequential input feeding of the reservoir, 1-bit pattern from the transmission signal is assigned to one time delay of the reservoir. This means that 1-bit of information is time-stretched to fit into the k nodes of the reservoir’s within one time delay. This offline time-stretch defines the so far remaining “speed penalty” of our processing method (Methods). The response output of the reservoir is recorded using a 80GSa/s real-time oscilloscope and is used to train and test the linear classifier (Methods). A weighted summation of reservoir responses provides the estimated bit sequence of the initial data. In the current investigation the classifier is calculated offline. However, the linear regression approach was selected since it can be implemented with hardware approaches that have been reported lately, based on all-optical implementations of temporal integrators^62,63 and fast analogue summation techniques²⁵.

Results and Discussion

In bit streams without temporal cross-talk, a classifier would be trained on the currently evaluated bit only, by solely considering its timeframe. Nevertheless, the presence of deterministic fibre transmission impairments results in patterns that contain information from neighbouring bits also. Contemporary DSP methods commonly use this information by considering one sample per bit in order to improve detection capabilities. Here we follow the same approach, but using multiple samples per bit – the patterns of the neighbouring bit timeframes – in order to improve classification performance (Methods). The optimal number of timeframes to be considered for training depends on the extent of chromatic dispersion and the Kerr nonlinearity. In the notation we adopt, training the classifier with n-bits means that we consider a reservoir’s response of duration n∙τ that includes those bit timeframes as defined in Fig. 3a. The number of transient responses that participate in the training is then n∙k.

Short-reach transmission

When considering the short-reach transmission scenario, an extended distance of z₁ = 45 km leads to a BER value above 0.1 (incoherent detection in absence of any DSP). Training the linear classifier directly on the output signal from transmission (benchmark test of Fig. 1d) within 1-bit timeframe, the BER measured for the test sequence is 0.1 (Fig. 3b, black rectangles). Even though the pattern within one bit duration provides more information than a single sample in the same duration, BER is not reduced significantly. When considering for training an extended sequence of 9-bits, the BER is reduced to 10⁻², driven by the effect of the pattern recognition that uses information from neighbouring bits. Equivalent results are obtained for the benchmark classification test of Fig. 1c. In these tests, the number of samples per bit is equal to the number of nodes that will contribute to the reservoir computation (j = k = 66). Thus, for 9-bit training, 594 reservoir outputs contribute to the linear regression model. When we incorporate the reservoir in the system and optimize its performance with respect to feedback strength and laser frequency detuning (Supplementary Fig. 5), we obtain significantly improved BER values as low as 1.8∙10⁻⁴ (Fig. 3b, red dots). Even when minimizing the feedback in the reservoir loop and operating the system as an ELM, we still obtain an improvement compared to the benchmark tests, with a BER value as low as 7∙10⁻⁴ (Fig. 3b, blue triangles). These findings are very encouraging and indicate two contributing mechanisms of the photonic reservoir to the improvement of the binary classification performance. The first one is attributed to the nonlinear transformation of the injected signal into the response laser (ELM operation) and the second one is attributed to the inherent fading memory⁶¹ of the reservoir (RC operation). Thus, when comparing a temporal pattern of 9-bit duration (4 previous, the current and 4 subsequent bits) we obtain significant improvement compared to the benchmark tests of Fig. 1c,d. Nevertheless, the optimal number of the neighbouring bits we may consider is not fixed; it is related to the extent of transmission impairments and the transmission length, thus it may be selected accordingly.

Long-haul transmission

As a second task, we test our scheme considering a long-haul communication link with z₂ = 4000 km, following the same methodology. Training directly on the transmission output signal we obtain for 1-bit training a BER value as high as 0.056 (Fig. 3c, black rectangles). By considering the photonic reservoir, but without masking the input (benchmark test of Fig. 1c), we obtain the same performance. However, after masking the input signal and optimizing the reservoir’s operating conditions, the BER value we measure is significantly reduced to 1.7∙10⁻³. Here, it is obtained considering only 4-bit or 5-bit timeframes for training (Fig. 3c, red dots and Supplementary Fig. 6). In this transmission scenario, the number of neighbouring bits that affect the current bit profile is smaller, since chromatic dispersion compensation is applied to this system. Also in this type of links, the presence of stochastic optical amplification noise (here we considered an optical amplification noise figure of 5 dB) is an additional limiting factor for data recovery. Reservoir computing cannot compensate for such stochastic processes. Yet again, the RC approach with optimized feedback conditions yields significant nonlinearity mitigation, with slightly better results than the ELM approach (Fig. 3c, blue triangles).

Extension of communication range

The found improvements of the BER level of the detected signals can be directly translated into an excess in usable transmission distance. We focus on BER values around 10⁻³; at this decoding BER threshold, a hard-decision forward error correction (FEC) method can provide an error-free data recovery. FEC codes impose an overhead to the initial data sequence, depending on the data bit-rate (typically 12% for R₂ and 7% for R₁)⁶⁴. In this work, we consider the initial data rates. Evaluating the short-reach transmission scheme, as an example, we obtained a BER improvement of almost two orders of magnitude with respect to the benchmark tests. The resulting gain in transmission distance using the RC post-processing is 75.9% when compared to classifying the transmission output and 200% when compared to the direct detection performance without any processing (Fig. 4). These are remarkable extensions, illustrating the power and potential of the presented approach, extracting the bits via a pattern recognition method based on photonic reservoir computing.

Towards faster reservoirs

Reservoirs with much smaller delays than experimentally demonstrated here can be designed by exploiting photonic integrated circuits, without losing any classification performance but improving on the speed of processing. Nevertheless, the number of virtual nodes that can be defined in short-delay reservoirs, with a given node spacing θ, is limited by the time delay itself. Appropriate designs can take a priori into consideration a suitable number of virtual nodes and transient states that will be needed for optimized training. To assess the limitations imposed by the reservoir size, we numerically simulate the behaviour of short reservoirs with the same virtual node spacing (θ = 50 ps) and various feedback time-delays (τ). We simulate a system that is analogous to the experimental topology of Fig. 2, based on a rate equation model for the response laser dynamics (Methods). Phase dependencies within the short feedback loops are also taken into account. As input signal to the reservoir, we consider the output signal from the short-reach transmission scenario we used before, but for a slightly increased transmission distance (${z^{\prime} }_{1}$ = 50 km). At first, we shorten the reservoir delay loop to τ = 1.6 ns (k = 32) and use an equal number of samples to describe each bit pattern (j = 32). Large values for j can be obtained by oversampling each bit profile. In fact, initial sampling of j = 4 or j = 8 is adequate to describe the attributes of the obtained binary patterns. We obtain data recovery without any errors in the test bit sequence for several operating conditions of the reservoir (Fig. 5). This performance is consistent even when the input signal sampling is j = 4. In all cases that j < k, each sample contributes as input to more than one virtual node (Methods). The sampling rate of j = 4 for 25 Gb/s pulses is possible to obtain with the current state-of-the-art detection equipment, eliminating the need for oversampling. However, further reduced sampling (j = 2) drastically degrades the classification performance. This indicates that the pattern profiles within each bit duration include critical information for the classification task. The speed penalty of this processing approach, for τ = 1.6 ns, is τ/R₁⁻¹ = 1.6 ns/40 ps = 40 (Methods). This value can be further reduced when considering even shorter delays in the proposed time-multiplexed approach, associated with a smaller number of virtual nodes. Nevertheless, numerical simulations show that this comes at the expense of the BER improvement (Fig. 5). A combination of our presented time-multiplexing approach with spectral or spatial encoding could eventually overcome this restriction.

For the case of τ = 1.6 ns and (j, k) = [4, 32], we evaluated the obtained BER as a function of the received optical power after a fibre transmission length of ${z^{\prime} }_{1}$ = 50 km. When considering a direct detection system without including any dispersion compensation, the BER threshold of 10⁻³ – where FEC techniques can apply – can never be reached. Even when considering a linear classifier with 9-bit training on the received signal, the previous threshold is still not reached (Fig. 6, blue triangles). On the contrary, the BER threshold is reached when considering the same classifier with 9-bit training on the photonic reservoir output and at a received power as low as −17.5 dBm (Fig. 6, red dots). As a reference for comparison to this performance, we show in Fig. 6 (black rectangles) the performance of an optimized transmission system that includes also physical dispersion compensation (i.e. DCF), for the same transmission distance ${z^{\prime} }_{1}$. Even though we determine a power penalty of 5.8 dB when considering the RC-based detection, we show that this method has a remarkable potential to mitigate both, linear and nonlinear transmission phenomena. The performance of the reservoir of this case (τ = 1.6 ns, ${z^{\prime} }_{1}$ = 50 km) and for reservoir sampling (j, k) = [4, 32] has been also evaluated as a function of the optical SNR of the received signal after transmission (Supplementary Fig. 7).

Conclusions

Following a pattern recognition approach, our photonic RC-based hardware platform can efficiently process fibre transmission signals that suffer from deterministic transmission impairments. The adopted processing concept is generic and might be applied to various contextual pattern recognition tasks. In the context of communication systems, it can be extended for transmission systems with advanced modulation formats at even higher bit-rates. The concept can also be applied to transmission systems with quadrature modulation formats or coherent receivers. Information that may be encoded in the phase space can be easily converted into an amplitude signal and fed into the photonic reservoir as a microwave modulating signal, as presented in Fig. 2. This approach has proven to be efficient since the reservoir’s states are not disturbed by the phase properties of the transmission signal. Moreover, the polarization state of the received signal after transmission does not interfere with the active polarization state of the reservoir. A future challenge of the proposed scheme is to extend it to a wavelength division multiplexed (WDM) transmission environment, with the presence of nonlinearities that originate from neighbouring channels. The high launched optical power conditions we considered in this work will induce four-wave mixing effects that might degrade the detection performance. The training might need to be performed also on the patterns of neighbouring channels that affect a specific channel’s bit sequence. In a scenario like this, the required training data sets are expected to be significantly larger in order to achieve efficient training. It is an open question to what extent the photonic reservoir’s nonlinear transformation and fading memory will offer improved detection capabilities. Finally, the presented time-multiplexing approach that feeds the input signal into the reservoir represents only one possible coding method. Complementary methods – based on spectral or spatial multiplexing⁶⁵ – for high-dimensional mapping of the input to the reservoir states are envisaged to minimize speed penalty and eventually lead to real-time binary classification.

Methods

Numerical simulation of fibre-optic transmission systems

We model numerically the fibre transmission using a coupled nonlinear Schrödinger equation (CNLSE) propagation model^28,66, in the presence of two orthogonal polarization modes, including also fibre attenuation, chromatic dispersion, Kerr nonlinearity, inter-channel nonlinear effects (cross-phase modulation and four-wave mixing) and stimulated Raman scattering. However, some of the above phenomena are not activated in our investigations due to the selected properties of the transmission systems. For example, we consider only single channel transmission; thus inter-channel nonlinear effects are not present. Moreover, the output optical signal from transmission is photodetected and the electrically converted signal is used as an input to the photonic reservoir. In this case, the reservoir system becomes robust to the polarization state of the received optical signal. Hence, for sake of simplicity we write the nonlinear Schrödinger equation in a form that describes only the phenomena that affect critically the transmitted signal. The slowly varying optical field E_tr(t, z) that travels along the SSMF is given then by:

$$i\frac{\partial {E}_{tr}}{\partial z}+i\frac{{a}_{loss}}{2}{E}_{tr}-\frac{{\beta }_{2}}{2}\frac{{\partial }^{2}{{{\rm E}}}_{tr}}{\partial {t}^{2}}+\gamma {|{{{\rm E}}}_{tr}|}^{2}{{{\rm E}}}_{tr}=0$$

(1)

where z is the distance in km, t is the relative time in the frame that moves with the envelope velocity, a_loss is the fibre transmission loss coefficient, β₂ is the chromatic dispersion coefficient and γ is the instantaneous Kerr nonlinearity coefficient. We consider a DFB single mode laser that emits 10 mW of optical power at 1550 nm, with a linewidth of 0.1 MHz. This launched power level induces Kerr nonlinearity in the transmission line. We consider a typical parameter set for the SSMF transmission: a_loss = 0.2 db/km, β₂ = 21.7 ps²∙km⁻¹ and γ = 3∙10⁻²⁹ km²∙mW⁻¹. In the long-haul transmission modules, the parameter set for the DCF is: α_DCF = 0.6 db/km, β_2,DCF = −128 ps²∙km⁻¹ and γ_DCF = 2.6∙10⁻²⁹ km²∙mW⁻¹. The amplification unit provides a gain of G = 30.2 dB, with a noise figure NF = 5 dB, while the optical filter has a Gaussian profile with 3-dB optical bandwidth equal to 4 times the signal bit-rate. The received optical signals are photodetected with a typical PIN photoreceiver, with responsivity of 0.9 A/W. Thermal noise and dark current noise are included in the photodeteciton stage. Finally, the signals are electrically filtered with a low-pass, 4^th order Bessel filter, at a cut-off frequency of 0.8 the data bit-rate. The numerically generated signals are used to feed our experimentally-built reservoir, as well as the numerical investigations of the RC operation.

Input signal masking

Each bit is represented by an analogue pattern of j samples. If j = k, each sample a_jⁱ of the i^th bit is masked by a random value m_k ∈ [0, 1] and the dimensionality of the corresponding state space is preserved. This condition applies for our experimental implementation. Conventionally, the classification performance in machine learning techniques is achieved by nonlinearly mapping the input onto a higher dimensional state space. The higher dimensionality results in a higher chance of linear separability of the pattern classes that need to be distinguished. Here, we demonstrate that for this mechanism to work it is sufficient to maintain the state space dimensionality defined by the number of samples. Nevertheless, due to the masking, we create a higher dimensional representation of the pattern in the state space. If the analogue input is sampled with fewer samples than the reservoir’s virtual nodes (j < k) used for classification, each sample a_jⁱ is masked with two or more mask values of the m vector. In this way the dimensionality of the state space is increased. The condition mod(k, j) = 0 should be preserved so that all samples are equivalently masked. When considering short reservoirs with a small number of virtual nodes, the random mask vector m consists only of a few values. Thus, the dependence of the system performance on the chosen mask becomes significant. In the numerical results shown in Fig. 4, the selection of the mask is critical – especially when k ≤ 16. For this reason, 10 different random masks have been considered for evaluating the BER performance for each investigated (j, k) case. The maps presented in Fig. 4 refer to those masks that lead to the lowest BER values. In contrast, in all the experimentally investigated scenarios, the dependence of the BER performance on different tried random mask sequences was insignificant. When k value is small, i.e. a mask of just a few values vector, we work at a lower dimensional state space, and in this case the condition of selecting a random mask is lost. There are insights of selecting a good mask and there also optimization procedures have been proposed⁶⁷. However, here we chose a random mask that consists of random values for reasons of simplicity.

Experimental reservoir configuration

All devices used in the experiment are commercial devices. The response laser is a discrete-mode quantum-well SL from Eblana Photonics emitting at 1542 nm, with a longitudinal mode separation of 145 GHz, side mode suppression ratio of 40 dB and threshold current of I_th = 11.2 mA. The response laser is biased at I_bias = 11.1 mA, corresponding to 1% below solitary threshold. For maximum feedback conditions of the reservoir (zero attenuation at the OA, within the feedback loop), the threshold current is reduced to I_th,fb = 10.5 mA. The injection laser is a DFB laser from Toptica emitting also at 1542 nm, with >30 dB side mode suppression ratio and high power emission (up to 40 mW). Temperature and bias current were stabilized with 0.01 K and 0.01 mA accuracy, respectively. The masked signal from transmission is uploaded on the emitted carrier of the injection laser through a 20 GHz Eospace Mach-Zehnder amplitude modulator (MZM). The output optical signal from the reservoir is amplified by a Covega semiconductor optical amplifier (SOA), filtered by a Santec OTF350 tunable optical filter and detected by a Miteq SCMR-100K20G avalanche photodiode with 20 GHz bandwidth and 2kOhm transimpedance gain. Signal monitoring is performed by a Lecroy Wavemaster 816Zi Oscilloscope and an Aragon Photonics High-resolution optical spectrum analyser with 10 MHz resolution. Optical isolators with 50 dB isolation are included in the optical path to suppress unwanted reflections. The above information can be included in the manuscript.

Training and testing

We generate independent binary streams of 40960 bits, generated with random seeds. From these sequences, we use one stream for training and cross-validation evaluation and one stream as the testing set. From the first sequence, 75% of the bits for training and 25% are used as the cross-validation set. We use a ridge regression algorithm with 10 repetitions of Monte-Carlo cross-validation⁶⁸ and a ridge parameter of 0.01. The test set is finally used to evaluate the BER performance recovery of an independent data set, that the system was not trained on that. In this way we avoid possible pitfalls in biased training⁶⁹. The reservoir’s node responses r_kⁱ that correspond to the i^th bit duration are used to train the linear classifier. In this case we consider only 1-bit timeframe duration for training (the one of the currently predicted i^th bit). However, the predicted bit stream has the generic form of the weighted sums of these responses: ${\tilde{b}}^{i}={\sum }_{k,n}{w}_{k}\cdot {r}_{k}^{n}$, where n indicates an extended number of neighbouring bit responses. Due to fibre nonlinearities, signal properties at neighbouring bit timeframes can include critical information that can be exploited by the classifier. n = [i − m_p, i + m_c], with m_p ∈ ℕ being the number of previous and m_c ∈ ${\mathbb{N}}$ being the number of consecutive bits, all used for the i^th bit classification. The i^th bit prediction is made with a latency of m_c bits. In order to have access to the previous and consecutive bit timeframes, the bit stream considered is slightly shorter in length (40960-m_p-m_c). The optimum weights are found by minimizing the difference between ${\tilde{b}}^{i}$ and bⁱ for all of the bits that are used in the training set. The optimal w_k values are determined by using techniques for extracting eigenvalues from singular matrices such as the linear Moore–Penrose pseudo-inverse operator (denoted by †). If the target matrix $\tilde{B}$ contains the targets ${\tilde{b}}^{i}$ for all of the reservoir’s loops τ that are used for the training, and the response matrix S contains all reservoir’s responses for the same loops, then the matrix $W=\tilde{B}\cdot {S}^{\dagger }$ contains the optimal weights. Training is performed offline on a computer and takes no longer than several seconds.

BER calculation

The BER is calculated from the predicted bit stream ${\tilde{b}}^{i}$, following the same procedure as for conventional communication systems. A decision threshold is set in a binary comparator aiming on BER minimization. Since 25% of the bit stream (10240 bits) is considered for cross-validation, we can measure a minimum BER value of ~9.8∙10⁻⁵ for the training performance. The independent data streams that are used as test sets are evaluated using all 40960 bits, allowing thus a minimum BER value of ~2.4∙10⁻⁵. Depending on the values of m_p and m_c, the total length of the test bit sequence might be shorter. However, this does not affect the minimum BER we can obtain from our measurements. Our interest focuses at a BER levels around 10⁻³ – one order of magnitude higher than the minimum value we can measure – where hard-decision FEC methods operate and offer error-free operation.

Numerical model of the photonic reservoir

We implemented a reservoir nonlinearity that follows the Lang-Kobayashi rate equation model of a SL with time-delayed feedback⁷⁰, with an additional optical injection term with frequency detuning. An analytical description of the implemented model under optical feedback and optical injection of frequency detuned signals can be found in⁷¹. The modelled rate equations for the response SL operation are:

$$\frac{d{E}_{r}(t)}{dt}=\frac{1}{2}(1+ja)[{G}_{r}(t)-{t}_{ph}^{-1}]\cdot {E}_{r}(t)+\frac{{k}_{f}}{{t}_{in}}\cdot {E}_{r}(t-\tau ){e}^{j{\omega }_{0}\tau }+\frac{{k}_{inj}}{{t}_{in}}\cdot {E}_{inj}(t){e}^{-j{\rm{\Delta }}\omega t}+\sqrt{D}\xi (t)$$

(2)

$$\frac{d{N}_{r}(t)}{dt}=\frac{I}{e}-\frac{{N}_{r}(t)}{{t}_{s}}-{G}_{r}(t)\cdot {|{E}_{r}(t)|}^{2}$$

(3)

$${G}_{r}(t)={g}_{n}\cdot {[1+s{|{E}_{r}(t)|}^{2}]}^{-1}\cdot [{N}_{r}(t)-{N}_{0}]$$

(4)

The angular frequency detuning Δω = 2π∙Δf is defined between the emission frequencies of the injection and the response laser. A Gaussian white noise ξ(t) is included for the electrical field with amplitude D = 30 ns⁻¹. The bias current for all lasers is set to I = 15.3·10⁻³A (just below threshold current of solitary operation I_th = 15.37·10⁻³A), while the remaining set of parameters is: a = 3, s = 5·10⁻⁷, N₀ = 1.5·10⁻⁸, g_n = 1.2·10⁻⁵ ns⁻¹, t_s = 2 ns, t_in = 10⁻² ns, t_ph = 2·10⁻³ ns, e = 1.602·10⁻¹⁰A·ns. These parameters do not simulate exactly the SLs that were used in the experiments; however they describe the general performance of the system. The response SL’s angular frequency is ω₀ = 2πc/λ₀ where λ₀ = 1550 nm and c is the speed of light. We define an injection field with the parameters: k_inj = 0.15 and E_inj,0 = 100, so that after conversion this corresponds to an injection optical power of 0.6 mW. This coincides approximately with the optical power that we used in our experiment. When considering the modulation properties of the injection signal due to the masked input, the injected electrical field is of the form:

$${E}_{inj}(t)={E}_{inj,0}\cdot [{b}_{bias}+m(t)\cdot {\rm{a}}(t)]$$

(5)

m(t) ∈ [0, 1] is the mask sequence, a(t) is the photodetected signal after transmission (normalized also in the range [0, 1]) and b_bias is a bias term that ranges from 0 to 1 and controls the average optical power that is injected into the response laser. The feedback ratio k_f in this model expresses the ratio of the response SL’s emitted electrical field that is redirected back to the SL. As presented in Fig. 4, a feedback value of k_f ~ 0.05 results in an optimal operation of the reservoir. When converting this electrical field ratio into optical power attenuation terms with respect to the optical power emitted by the response SL, we get a feedback attenuation of ~26 dB. This theoretically estimated value is consistent with the experimental optimal conditions we observed for the short-reach transmission system, but for the long reservoir delay. As shown in Supplementary Fig. 5, the optical attenuation for which we achieve the optimal BER performance (case of 9τ training) is 12 dB. The fibre loop itself has an initial loss of 7 dB, due to optical splitters and fibre components included in it. An additional loss of ~6 dB should be also considered from the dual-pass loss between SL facet and the fibre pigtail. In total, the experimental feedback attenuation is estimated to be ~25 dB, very close to the numerically foreseen.

Speed penalty

In the simplified approach we adopt in this work, the input layer of the reservoir is implemented by feeding the input sequence using a time-multiplexing approach (sequential introduction of bits). Specifically, we assign the analogue pattern of one bit to be included within one feedback delay of the reservoir. This condition allows the inherent memory of the reservoir to introduce connectivity between subsequent bits (samples that are one τ apart in time), but at the same time induces a speed penalty. In principle, the bit durations (R⁻¹) we are handling in optical communication systems are much smaller than the reservoir’s time-delay τ. In this work we make a time-stretch of the bit time duration R⁻¹ to fit in one τ duration offline. The speed penalty of this RC post-processing step is defined as τ/R⁻¹. In practice, several time-stretching methodologies⁷² could apply for an actual implementation of this step.

References

Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Phot. 11, 441–446 (2017).
Article CAS Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS PubMed CAS Google Scholar
Bishop, C. M. Pattern recognition and machine learning (Springer-Verlag New York, Inc. Secaucus, NJ, USA, 2006).
Bahdanau, D., Cho, K. & Bengio, Y. Neural machine translation by jointly learning to align and translate. arXiv:1409.0473 (2014).
Sutskever, I., Vinyals, O. & Le, Q. V. Sequence to sequence learning with neural networks. Proc. Adv. Neural Inf. Process. Syst., pp. 3104–3112 (2012).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Proc. Adv. Neural Inf. Process. Syst., pp. 1097–1105 (2012).
Torrejon, J. et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428–431 (2017).
Article PubMed PubMed Central CAS Google Scholar
Editorial. Big data needs a hardware revolution. Nature 554, 145–146 (2018).
Brunner, D., Soriano, M. C. & Fischer, I. High-speed optical vector and matrix operations using a semiconductor laser. IEEE Photon. Technol. Lett. 25(17), 1680–1683 (2013).
Article ADS Google Scholar
Ortín, S. et al. A unified framework for reservoir computing and extreme learning machines based on a single time-delayed neuron. Sci. Rep. 5, 14945 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Jaeger, H. & Haas, H. Harnessing non-linearity: predicting chaotic systems and saving energy in wireless communication. Science 304, 78–80 (2004).
Article ADS PubMed CAS Google Scholar
Maass, W., Natschläger, T. & Markram, H. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Comput. 14(11), 2531–2560 (2002).
Article PubMed MATH Google Scholar
Appeltant, L. et al. Information processing using a single dynamical node as complex system. Nat. Commun. 2, 468 (2011).
Article PubMed CAS Google Scholar
Hammer, B., Schrauwen, B. & Steil, J. J. Recent advances in efficient learning of recurrent networks. Proc. Eur. Symp. Artif. Neural Netw., 213–226 (2009).
Larger, L. et al. Photonic information processing beyond Turing: an optoelectronic implementation of reservoir computing. Opt. Express 20, 3241–3249 (2012).
Article ADS PubMed CAS Google Scholar
Paquot, Y. et al. Optoelectronic reservoir computing. Sci. Rep. 2, 287 (2012).
Article PubMed PubMed Central CAS Google Scholar
Martinenghi, R., Rybalko, S., Jacquot, M., Chembo, Y. K. & Larger, L. Photonic nonlinear transient computing with multiple-delay wavelength dynamics. Phys. Rev. Lett. 108, 244101 (2012).
Article ADS PubMed CAS Google Scholar
Larger, L. et al. High-speed photonic reservoir computing using a time-delay-based architecture: million words per second classification. Phys. Rev. X 7, 011015 (2017).
Google Scholar
Vandoorne, K., Dambre, J., Verstraeten, D., Schrauwen, B. & Bienstman, P. Parallel reservoir computing using optical amplifiers. IEEE Trans. Neural Netw. 22, 1469–1481 (2011).
Article PubMed Google Scholar
Brunner, D., Soriano, M. C., Mirasso, C. R. & Fischer, I. Parallel photonic information processing at gigabyte per second data rates using transient states. Nat. Commun. 4, 1364 (2013).
Article ADS PubMed CAS Google Scholar
Hicke, K. et al. Information processing using transient dynamics of semiconductor lasers subject to delayed feedback. IEEE J. Sel. Top. Quantum Electron. 19, 1501610 (2013).
Article ADS CAS Google Scholar
Vandoorne, K. T. et al. Experimental demonstration of a reservoir computing on a silicon photonics chip. Nat. Commun. 5, 3541 (2014).
Article PubMed CAS Google Scholar
Dejonckheere, A. et al. All-optical reservoir computer based on saturation of absorption. Opt. Exp. 22(9), 10868–10881 (2014).
Article ADS Google Scholar
Vinckier, Q. et al. High performance photonic reservoir computer based on a coherently driven passive cavity. Optica 2, 438–446 (2015).
Article Google Scholar
Duport, F., Smerieri, A., Akrout, A., Haelterman, M. & Massar, S. Fully analogue photonic reservoir computer. Sci. Rep. 6, 22381 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Antonik, P. et al. Online training of an opto-electronic reservoir computer applied to real-time channel equalization. IEEE Trans. Neural Net. Learn. Syst. 28(11), 2686–2698 (2017).
Article Google Scholar
Zhang, J., Yu, J. & Chien, C. 1.6Tb/s (4 × 400G) Unrepeatered transmission over 205-km SSMF using 65-GBaud PDM-16QAM with joint LUT pre-distortion and post DBP nonlinearity compensation. Proc. Optical Fiber Communication Conference 2017, Th2A.51 (2017).
Agrawal, G. P. Fibre-optic communication systems (Wiley-Blackwell, New York, 2010).
Cristofori, V. et al. 25-Gb/s transmission over 2.5-km SSMF by silicon MRR enhanced 1.55 μm III-V/SOI DML. IEEE Photon. Technol. Lett. 29(12), 960–963 (2017).
Article ADS CAS Google Scholar
Motaghiannezam, S. M. R. et al. Single chip 52 Gb/s PAM4 transmission through −58 and +10 ps/nm chromatic dispersion using directly modulated laser. Proc. OFC 2016, Th2A.59 (2016).
Winzer, P. J. High-spectral-efficiency optical modulation formats. J. Lightwave Technol. 30, 3824–3835 (2012).
Article ADS Google Scholar
Torrengo, E. et al. Influence of pulse shape in 112-Gbit/s WDM PDM-QPSK transmission. IEEE Photon. Technol. Lett. 22, 1714–1716 (2010).
Article ADS Google Scholar
Bosco, G., Curri, V., Carena, A., Poggiolini, P. & Forghieri, F. On the performance of Nyquist-WDM terabit superchannels based on PM-BPSK, PM-QPSK, PM-8QAM or PM-16QAM subcarriers. J. Lightwave Technol. 29, 53–61 (2011).
Article ADS Google Scholar
Maher, R., Alvarado, A., Lavery, D. & Bayvel, P. Increasing the information rates of optical communications via coded modulation: a study of transceiver performance. Sci. Rep. 6, 21278 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Ip, E. & Kahn, J. M. Compensation of dispersion and nonlinear impairments using digital backpropagation. J. Lightwave Technol. 26, 3416–3425 (2008).
Article ADS Google Scholar
Savory, S. J. Digital filters for coherent optical receivers. Opt. Exp. 16, 804–817 (2008).
Article ADS Google Scholar
André, N. S., Habel, K., Louchet, H. & Richter, A. Adaptive nonlinear Volterra equalizer for mitigation of chirp-induced distortions in cost effective IMDD OFDM systems. Opt. Exp. 21, 26527–26532 (2013).
Article ADS Google Scholar
Derevyanko, S. A., Prilepsky, J. E. & Turitsyn, S. K. Capacity estimates for optical transmission on the nonlinear Fourier transform. Nat. Comm. 7, 12710 (2016).
Article ADS CAS Google Scholar
Turitsyn, S. K. et al. Nonlinear Fourier transform for optical data processing and transmission: advances and perspectives. Optica 4, 307–322 (2017).
Article Google Scholar
Zibar, D., Wymeersch, H. & Lyubomirsky, I. Machine learning under the spotlight. Nat. Phot. 11, 749–751 (2017).
Article CAS Google Scholar
Hunt, S. et al. Correcting errors in optical data transmission using neural networks. International Conference on Artificial Neural Networks 2010, Lecture Notes in Computer Science 6353, 448–457 (2010).
Google Scholar
Thrane, J. et al. Machine learning techniques for optical performance monitoring from directly detected PDM-QAM signals. J. Lightwave Technol. 35, 868–875 (2017).
Article ADS CAS Google Scholar
Zibar, D., Piels, M., Jones, R. & Schäeffer, C. G. Machine learning techniques in optical communication. J. Lightwave Technol. 34, 1442–1452 (2016).
Article ADS Google Scholar
Qin, J., Zhao, Q., Yin, H., Jin, Y. & Liu, C. Numerical simulation and experiment on optical packet header recognition utilizing reservoir computing based on optoelectronic feedback. IEEE Phot. J. 9(1), 7901311 (2017).
Google Scholar
Jarajreh, M. A. et al. Artificial neural network nonlinear equalizer for coherent optical OFDM. IEEE Photon. Technol. Lett. 27(4), 387–390 (2015).
Article ADS Google Scholar
Gaiarin, S. et al. High speed PAM-8 optical interconnects with digital equalization based on neural network. In Proc. Asia Commun. Photon. Conf., AS1C-1 (2016).
Wang, D. et al. System impairment compensation in coherent optical communications by using a bio-inspired detector based on artificial neural network and genetic algorithm. Opt. Commun. 399, 1–12 (2017).
Article ADS CAS Google Scholar
Owaki, S. & Nakamura, M. Equalization of optical nonlinear waveform distortion using neural-network based digital signal processing. In Proc. OptoElectron. Commun. Conf. (OECC)/Int. Conf. Photon. Switching (PS), WA2-40 (2016).
Shen, T. S. R. & Lau, A. P. T. Fiber nonlinearity compensation using extreme learning machine for DSP-based coherent communication systems. In Proc. Opto-Electron. Commun. Conf. (OECC), 816–817 (2011).
Rios-Müller, R., Estaran, J. M. & Renaudier, J. Experimental estimation of optical nonlinear memory channel conditional distribution using deep neural networks. In Proc. Opt. Fiber Commun. Conf., W2A–51 (2017).
Estaran, J. et al. Artificial neural networks for linear and non-linear impairment mitigation in high-baudrate IM/DD systems. In Proc. Eur. Conf. Opt. Commun. (ECOC), M.2.B.2 (2016).
Chen, E., Tao, R. & Zhao, X. Channel equalization for OFDM system based on the BP neural network. In Proc. Int. Conf. Signal Process., vol. 3 (2006).
Giacoumidis, E. et al. Fiber nonlinearity-induced penalty reduction in CO-OFDM by ANN-based nonlinear equalization. Opt. Lett. 40(21), 5113–5116 (2015).
Article ADS PubMed CAS Google Scholar
Ahmad, S. T. & Kumar, K. P. Radial basis function neural network nonlinear equalizer for 16-QAM coherent optical OFDM. IEEE Photon. Technol. Lett. 28(22), 2507–2510 (2016).
Article ADS Google Scholar
Bauduin, M., Smerieri, A., Massar, S. & Horlin, F. Equalization of the non-linear satellite communication channel with an echo state network. Proc. IEEE 81st Veh. Technol. Conf., 1–5 (2015).
Simpson, T. B., Liu, J. M. & Gavrielides, A. Bandwidth enhancement and broadband noise reduction in injection-locked semiconductor lasers. IEEE Photon. Technol. Lett. 7(7), 709–711 (1995).
Article ADS Google Scholar
Lau, E. K. et al. Strong optical injection-locked semiconductor lasers demonstrating 100-GHz resonance frequencies and 80-GHz intrinsic bandwidths. Opt. Express 16(9), 6609–6618 (2008).
Article ADS PubMed Google Scholar
Lee, J. et al. Serial 103.125-Gb/s transmission over 1 km SSMF for low-cost, short-reach optical interconnects. Proc. OFC 2014, Th5A.5 (2014).
Kapinou, F. & Stojanovic, N. & Yu, Zhao. Toward cost-efficient 100 G metro networks IM/DD 10-GHz components, and MLSE receiver. J. Lightwave Technol. 33(19), 4109–4117 (2015).
Article ADS CAS Google Scholar
http://www.ieee802.org/3/.
Bueno, J., Brunner, D., Soriano, M. C. & Fischer, I. Conditions for reservoir computing performance using semiconductor lasers with delayed optical feedback. Opt. Express 25(3), 2401–2412 (2017).
Article ADS PubMed Google Scholar
Ferrera, M. et al. On-chip CMOS-compatible all-optical integrator. Nat. Commun. 1, 29 (2010).
Article PubMed CAS Google Scholar
Liu, W. et al. A photonic temporal integrator with an ultra-long integration time window based on an InP-InGaAsP integrated ring resonator. J. Lightwave Technol. 32, 3654–3659 (2014).
Article ADS Google Scholar
Tzimpragos, G. et al. A survey on FEC codes for 100 G and beyond optical networks. IEEE Commun. Surv. & Tutor. 18(1), 209–221 (2014).
Article Google Scholar
Bueno, J. et al. Reinforcement learning in a large scale photonic recurrent neural network. arXiv 1771, 05133 (2017).
Google Scholar
Newboult, G. K., Parker, D. F. & Faulkner, T. R. Coupled nonlinear Schrödinger equations arising in the study of monomode step‐index optical fibers. J. Math. Phys. 30, 930 (1989).
Article ADS MathSciNet MATH Google Scholar
Nakayama, J. et al. Laser dynamical reservoir computing with consistency: an approach of a chaos mask signal. Opt. Exp. 24(8), 8679 (2016).
Article ADS Google Scholar
Picard, R. R. & Cook, R. D. Cross-validation of regression models. J. Am. Stat. Assoc. 79, 575–583 (1984).
Article MathSciNet MATH Google Scholar
Eriksson, T. A., Bülow, H. & Leven, A. Applying neural networks in optical communication systems: possible pitfalls. IEEE Photon. Technol. Lett. 29(23), 2091–2094 (2017).
Article ADS Google Scholar
Lang, R. & Kobayashi, K. External optical feedback effects on semiconductor injection laser properties. J. Quantum Electron. 16(3), 347–355 (1980).
Article ADS Google Scholar
Ohtsubo, J. Semiconductor lasers: Stability, instability and chaos. Springer series in optical sciences 111 (Springer international publishing, 4^th ed., 2017).
Mahjoubfar, A. et al. Time stretch and its applications. Nat. Photon. 11, 341–351 (2017).
Article ADS CAS Google Scholar

Download references

Acknowledgements

We thank Claudio R. Mirasso, Miguel C. Soriano, Daniel Brunner, Moritz Pflüger and Silvia Ortín for helpful discussions. This work was supported by the Ministerio de Economía y Competitividad and FEDER via project IDEA (TEC2016-80063-C3), and by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie contract 707068.

Author information

Authors and Affiliations

Instituto de Física Interdisciplinar y Sistemas Complejos IFISC (CSIC-UIB), Campus UIB, 07122, Palma de Mallorca, Spain
Apostolos Argyris, Julián Bueno & Ingo Fischer

Authors

Apostolos Argyris
View author publications
You can also search for this author in PubMed Google Scholar
Julián Bueno
View author publications
You can also search for this author in PubMed Google Scholar
Ingo Fischer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.A. and I.F. developed the idea, A.A., J.B. and I.F. planned the experiments, J.B. and A.A. performed the experiments, A.A. did the data analysis and A.A. and I.F. wrote the manuscript.

Corresponding author

Correspondence to Apostolos Argyris.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Argyris, A., Bueno, J. & Fischer, I. Photonic machine learning implementation for signal recovery in optical communications. Sci Rep 8, 8487 (2018). https://doi.org/10.1038/s41598-018-26927-y

Download citation

Received: 03 April 2018
Accepted: 21 May 2018
Published: 31 May 2018
DOI: https://doi.org/10.1038/s41598-018-26927-y

This article is cited by

Emerging opportunities and challenges for the future of reservoir computing
- Min Yan
- Can Huang
- Jie Sun
Nature Communications (2024)
Experimental results on nonlinear distortion compensation using photonic reservoir computing with a single set of weights for different wavelengths
- Emmanuel Gooskens
- Stijn Sackesyn
- Peter Bienstman
Scientific Reports (2023)
Real-time respiratory motion prediction using photonic reservoir computing
- Zhizhuo Liang
- Meng Zhang
- Z. Rena Huang
Scientific Reports (2023)
Echo State Networks and Long Short-Term Memory for Continuous Gesture Recognition: a Comparative Study
- Doreen Jirak
- Stephan Tietz
- Stefan Wermter
Cognitive Computation (2023)
Connecting reservoir computing with statistical forecasting and deep neural networks
- Lina Jaurigue
- Kathy Lüdge
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.