Abstract

Personalized recommended method is widely used to recommend commodities for target customers in e-commerce sector. The core idea of merchandise personalized recommendation can be applied to financial field, which can also achieve stock personalized recommendation. This paper proposes a new recommended method using collaborative filtering based on user fuzzy clustering and predicts the trend of those stocks based on money flow. We use M/G/1 queue system with multiple vacations and server close-down time to measure practical money flow. Based on the indicated results of money flow, we can select the more valued stock to recommend to investors. The experimental results show that the proposed method provides investors with reliable practical investment guidance and receiving more returns.

1. Introduction

The scale of the stock market is growing stronger; stock investment as a kind of high-risk and high-reward investment highlights people’s high attention. For investors, to get the maximum benefit and bear the lowest risk is the best investment. However, in such an ever-changing complicated stock market, those institutions with strong financial strength can use the state-of-the-art stock models for investment operation, having obvious professional advantages [1]. But for the majority of ordinary small and medium-sized stock, investors are still facing the risk which comes from the falling stock prices, single information sources, and improper investment options. Relying only on elementary analysis and technical analysis methods for investment decision-making is not enough. Money flow as a common technical index in stock market is conducting more perfectly in practical research. Therefore, mobile personalized recommended method is very crucial in money flow.

Recommendation system [2] realizes the personalized service well. It not only can be used in the areas of electronic commerce, but also can apply to futures, stocks and other products recommended. Current stock recommended methods are mainly concentrated on two types, online stock recommended methods based on stock comment and price forecasting model based on mathematical analysis. The former cannot meet the demands of investors personalized recommendations, and the application process of latter method is relatively complex; it has certain difficulty for investors to understand and master.

Given this situation, most of scholars abroad and at home have made a study on the stock recommendation. Cowels (1993) [3] found that the performance of those stocks can not satisfy the expected profit of investors, after aiming at the stocks recommended by U.S. stock organization for long-term follow-up observations. Ye (2011) [4] researched the problem of personalized recommendation in financial marketing by applying content-based personalized recommendation that is based on mean constraint sequential pattern mining method. Xin et al. [5] gave a strategy for filtering out users with similar demand characteristics by using collaborative recommendation algorithm with fuzzy clustering method, which exhibits excellent recommended effect. Xu [6] used regression equation to analyze the relation between shareholder personality traits and their investment decisions and established relational model among the corresponding investment decisions, personality traits, and investment behavior.

However, the actual utility of stock recommended methods based on stock comment information remains further textual research, but its blindness and lack of personalized recommendation are inherent weakness, which is the bottleneck of its extensive application. At present, it has become a hot topic about how to integrate data mining, personalized recommendation, and other nontraditional technology into stock recommendation in financial field. Our goal in this paper is to propose a novel personalized stock recommended method based on money flow. At first, we utilize collaborative filtering algorithm [7] based on user fuzzy clustering to generate optimized stock list [8]. Fuzzy clustering technology can greatly improve the speed of online nearest neighbors search and meet the real-time requirement of recommendation system [914]. And then we use M/G/1 queue system [15] with multiple vacations and server close-down time to measure practical money flow. The judgment of money flow is crucial to the analysis of stock market trend and individual stock operation.

In order to analyze the flow of funds from the perspective of quantity, we need to observe trading volume and turnover. In other words, money flow can measure the ups and downs of the market by evaluating the current state of money inflow or outflow of stocks in the stock market. The new personalized recommended method based on money flow using the indicators for investors to measure the capital and the pulsation of all market and considering investors’ preferences and behavior characteristics, which can improve the existing deficiencies of some current stock recommendation. In addition, the proposed method can analyze and filter the recommendation stock returns and improve the investment benefits of investors. And the experiment results show that the proposed method provides investors with valid personalized recommendation.

The rest of this paper is organized as follows: Section 2 describes the collaborative filtering algorithm based on user fuzzy clustering. Section 3 presents a new model for studying money flow. Section 4 presents the simulation experiment of the proposed method, and finally some conclusions are pointed out and future works are offered in Section 5.

In this section, we review the traditional commodity personalized recommendation and learn from its core idea for applying to personalized stock recommendation. We utilize the collaborative filtering algorithm based on user fuzzy clustering to generate optimized stock set. Classifying the groups of investors based on a series of subdividing characteristic index of investors is the guarantee of obtaining relatively accurate recommendation results before taking the personalized stock recommendation.

2.1. Fuzzy Clustering Analysis

The classification principle of clustering method is dividing every data samples into different categories. But the essence of fuzzy clustering analysis [16, 17] is based on studying the attributes of the object itself to construct fuzzy matrix, on this basis, so as to determine its classification relationship. So, it can embody the uncertain characteristics and reflect the real world more objectively.

We subdivide the investors as follows. We build the investor clustering index system in accordance with age, work, years, income level, exchange amount and exchange cycle, and so on. By listing the visual characteristics as segmentation index of investors, it can effectively distinguish different trading habits between those investors and then cluster them together. Such clustering results can reflect investors with similar trading dynamic, which is able to provide more effective and accurate recommendation information for demanders. And then we utilize fuzzy clustering analysis method to subdivide investors [18].

2.1.1. Data Collection

Assume that a stock scored database exists, which includes stock score data rating by investors. The characteristics of investors based on the above segmentation index are given, and we define the set of investors as , and the membership degree of the th investor in the set represented in the form of vector in a certain order where belongs to , . denotes the membership degree of the th element in this set. We can get the feature vector quantitative results based on the investor subdivision index, denoted as , , and all feature indexes of investors form a matrix: where represents the  th characteristic index of theth object.

2.1.2. Data Standardization

Theth of the matrix is calculated as Then, equation transformation is as follows: After standardizing data, we can get the standardized feature index matrix:

2.1.3. Constructing Fuzzy Similar Matrix

According to the standardized data, the similarity coefficient method is used to determine the similarity coefficient among investors and establish the fuzzy similar matrix. Consider where , , , .

If is negative, then .

If does not belong to , then or , , .

Thus, we established a fuzzy similar matrix of by between shareholders

2.1.4. Clustering

With the fuzzy similar matrix , we utilize the square self-synthesis method to carry out the transitive closure , .

Based on the actual situation, we choose the appropriate confidence level value , , so as to get the -matrix of . Having a classification based on , we can get the equivalence classification in the range of the confidence level .

An investor who is the target of stock recommendation will be added to investor information database. We subdivide the new database into several groups by using the fuzzy clustering method in order to determine which group the investor belongs to.

2.2. User-Based Collaborative Filtering Algorithm
2.2.1. Nearest Neighbors Search

The basic idea of collaborative filtering algorithm [19, 20] based on user fuzzy clustering is to compare the degree of similarity in behavior between target investor and other investors in the same cluster, in order to find out the set of neighbor investors with similar preference more accurately. Then, according to the stock ratings of nearest neighbor, we can predict the stock ratings of target investor and generate optimized stock list.

At first, we need to calculate the similarity between investors based on their historical scoring records. Currently, there are some methods to calculate the similarity between users, such as Pearson correlation and cosine-based similarity. For target investor , we need to generate a neighbor set which is ranked on the basic of the similarity (, from to based on the ) in descending order. And then we choose investors as “neighbor users,” and the size of is determined in two ways: one is choosing the correlation that is greater than the predetermined similarity threshold as neighbor users and the other is based on a predetermined number of neighbors () choosing the correlation maximum top- stock as neighbor users.

By the nature of the clustering, target investor and his most nearest neighbors belong to the same cluster. There is no need to search the nearest neighbors of target investor in the whole user space. However, user-based clustering method can greatly improve the speed of online nearest neighbors search and meet the real-time requirement of recommendation system.

2.2.2. Predict and Generate Stock Set

From the above method, we are able to work out the nearest neighbor of the target investor and then refer to the neighbor investors’ actual scores for each stock, so as to predict the unknown stocks that the target investor has not scored. The ratings of investor on the unknowns can be forecast by the following formula [21]: Cosine-based Similarity is as follows: where denotes target investor ’s average rating for all stocks, m denotes the number of investors, is the number of nearest neighbors, (see in formula (6)) denotes the level of similarity between investor and investor , denotes investor and investor score items set commonly, , and denotes investor and investor score for stock , respectively. denotes neighbor investor ’s actual score for stock and denotes neighbor investor’s average ratings for all stocks.

Finally, according to the prediction value of investors’ interest in the known and unknown stock, we select the rated top- stocks as the most interesting stock set.

3. Utilizing Money Flow for Stock Recommendation

Both online stock recommended methods based on stock comment and price forecasting model based on mathematical analysis have some drawbacks. Given this, we proposed a new method for stock recommendation: recommended stocks based on money flow.

Money flow is an important indicator of stock investment, which partly says that it indicates the direction of stock price movements in the future [22]. The process of judging money flow is more complex and not easy to master. Therefore, we establish a new model [23] based on M/G/1 to study money flow, and we are able to know the direction of stock price fluctuation in the future. In addition, consider the preference of stock-buying in different groups of investors, so that it can make appropriate personalized stock recommendation.

3.1. Model Description and Embedded Markov Chain

An exhaustive service and multiple vacations in classical M/G/1 queue system are introduced, which have vacation policy of close-down time. If there are customers in system as at completion of a vacation with length , it stops the vacation and starts a new busy period. If there is no customer waiting at the completion of a vacation with length , it will take another vacation following the same independent distribution of vacation until there are waiting customers at the completion of a vacation. Then, the system will start a new busy period. An exhaustive service rule is established in this section. It enters a close-down time after completion of last customer’s service. If there is no arrival customer during close-down period, it will take vacation after completion of close-down period. If there is at least one arrival customer, the server will complete the close-down period and serve for the customers until there is no waiting customers in system. It enters close-down period at the completion of the last customer and repeats the above process as the above-mentioned rule. The PGF and LST of stationary queue length and waiting time and their stochastic decomposition results are given. Queue discipline is single server and FCFS (first-come-first-serve) [24]. Assuming that interarrival time follows Poisson distribution with parameter , service time is a nonnegative random variable with mean value , and it has a general distribution function and LST . The distribution functions of vacation time, close time are denoted by and , respectively. Their respective LSTs are , , , , , and which are mutually independent and follow FCFS rule.

Let be the number of customers after the nth customer departure instant, let be the number of customers at the start of a busy period, and let be the embedded Markov chain at the departure instant, where Then, the distribution of is given by where . Thus, the transition probability matrix of Markov Chain is given by where , , A denotes the number of arrival customers during a service period and its probability distribution is given by . It is proved that Markov chain is positive recurrence by Foster rule, if and only if , which can be found in [25].

3.2. Stochastic Decomposition of Stationary Queue Length and Waiting Time

Theorem 1. If , stationary queue length can be decomposed into the sum of two independent random variables; namely, , in which denotes the stationary queue length of classical M/G/1 queue without vacation model, and its PGF has been given by And the probability generating function (PGF) of additional queue length is given by where .

Theorem 2. If , stationary waiting time can be decomposed into the sum of two independent random variables; namely, , in which denotes stationary waiting time of classical M/G/1 queue without vacation model, and its Laplace-Satieties transform (LST) has been given in [26] by The Laplace-Satieties transform (LST) of additional delay time is given by

From the results of the above new money flow model, we can know the direction of stock price movements in the future. Generally speaking, the price of those stocks with huge capital inflows will continue to rise. Conversely, the share price has gradually declined under the condition of ongoing outflow of funds. Then, according to the indicated results of money flow, we select the optimal stock in the above TOP- recommendation set in order to recommend to investors.

4. Simulation Experiment

4.1. Experimental Data and Method

In this section, we empirically study the performance of the proposed method. The experimental data come from real stock market, whereby, shareholders’ historical transaction information involves the privacy of individual investment, so the simulated data will be used in this experiment. The basic data used to calculate money flow in this article is market free data between 2013/9/2 and 2013/9/30, crossing off those stocks information with the opening time less than five days during five days of trading, in order to ensure the data neatly.

Using stock return as the stock assessment indicators and analyzing investment matters of one target investor in order to have a comparison on investors’ returns by using different stock recommended methods. Determining whether the proposed recommended method can make returns depends on stock yields. The higher the yields the investors get, the better the effect of the recommended method.

4.2. Data Preprocessing

At first, 1000 investors are clustered according to their information, adjusting the clustering threshold. If the threshold is valued as too high, the classification will be less, which will lead to increase the complexity of computing user neighbors. Contrarily, if the threshold is valued as too small, the classification will become more, which will also decrease the accuracy of computing user neighbors. Therefore, we chose an appropriate clustering threshold through several tries in the experiment and 1000 investors are divided into five categories (see Table 1).

With the complement of clustering, we can use the user-based collaborative filtering recommendation method to figure out the nearest neighbors and preferred stock set. We chose an investor as analyzed target for research and work out that the target investor belongs to cluster 1 after clustering. Use the cosine-based similarity method to calculate the similarity between investor and other members in the same clustering, so as to find out the nearest neighbors (see Table 2).

From the proofs and the results of the above experiment, we know that, by the nature of the clustering, target investor and his most nearest neighbors belong to the same cluster, and there is no need to search the nearest neighbors of target investor in the whole user space. Thus, fuzzy clustering technology can greatly improve the speed of online nearest neighbors search and meet the real-time requirement of recommendation system.

According to the stock rating that is evaluated by target investor, we select the stocks with higher scores as the candidate (see Table 3).

We do some research on several stocks with higher scores by using the proposed money flow model to predict the trend of stocks during the time interval from 2013/9/2 to 2013/9/30 and utilize the new model to filter the candidate stocks, and those stocks with higher investment value will be selected as the recommendation set (see Table 4).

4.3. Experimental Results

In this part, we apply the above model to study the effectiveness of stocks recommendation. We select several candidate stocks at random and process their weekly money flow data though using the proposed method to predict stocks movement. By observing the relationship between the money flow and closing price, we can verify whether the proposed method can predict stocks movements or not. We take China Unicom, Vanke A, and Merchants Bank as the research stock (see Figure 1).

Figure 1 shows the weekly existing cash flow of China Unicom. The red post in the below indicates cash inflows. The blue post in the below indicates cash outflow. The green line up and line down indicate the closing price fluctuation. If red post begins to shorten, it shows that the stock price will be moving lower. If blue post begins to shorten, it shows that the stock price will be moving higher. When the blue post changes to red post, it shows us that the stock price will be improved and we can follow the trend.

Figure 2 shows the weekly existing cash flow of Vanke A, and Figure 3 shows the weekly existing cash flow of Merchants Bank. We can see that the price of those stocks with huge capital inflows will move higher, and the share price has been gradually declined under the condition of ongoing outflow of funds. Furthermore, they more fully illustrate the effectiveness of the proposed algorithm. Money flow can reflect the variation of stock prices and we can choose the high investment value stocks based on the money flow indicator.

According to the above research results, we assume that the target investor purchased two stocks in September, 2003. Different stock recommended methods bring different investment returns (see Figure 4). Figure 4 shows the comparison of the rate of return. The target investor bought the stock of China Unicom and Vanke A by using the traditional recommended method in the past five weeks. And the red line indicates the rate of return of two stocks. If the target investor bought the stocks of China Unicom and Merchants Bank using the new proposed recommended method, they can receive much more return as the blue line indicated.

It is clearly evident from Figure 5 that different number of investors uses proposed recommended method and traditional recommended method, respectively, in September, 2003, which brings different returns as indicated. The blue line indicates the returns of investors using the proposed recommended method, and the red line indicates the returns of investors using traditional recommended method. Obviously, the proposed recommended method brings more returns than traditional method. All in all, by comparing different numbers of investors, it can fully verify the universities and effectiveness of the proposed recommendation method.

Through the above experimental results, it can be observed that the cash flow is very practical in the practical investment. It can guide us effectively to determine market trends.

5. Conclusions

In this paper, we proposed a personalized stock recommended method based on money flow model. And the proposed recommended method based on money flow model not only can satisfy the investors’ investment preferences, but also can filter some low investment value stocks in order to make investors choose the more valued stocks and get more returns. From the experimental results, it is shown that, compared with the traditional recommended method, the proposed method can analyze and filter the recommendation stock returns and improve the investment benefits of investors.

In the future, in order to increase the accuracy of recommendation, we will choose the most comprehensive and representative investor characteristic indexes so as to acquire the exquisite investor segmentation results. And, considering the changes of investor’s preference impacts on investor segmentation, we will improve the recommended method purposefully. Furthermore, the proposed recommended model will be applied to more stock markets and recommend stocks with more investing values for the investors.

Conflict of Interests

The authors declare that they have no conflict of interests in connection with the work submitted.

Acknowledgments

The project was supported by the National Natural Science Foundation of China (no. 61402185 and no. 61370186), Science and Technology Application Foundation Program of Guangzhou (no. 2014J4100032), Ministry of Education and China Mobile Research Fund (no. MCM20121051), and 2012 Knowledge Construction Special Foundation Item of Guangdong Institution of Higher Learning College and University (no. 2012KJCX0079).