天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于特征抽取和分步回歸算法的資金流入流出預測模型

發(fā)布時間:2018-05-25 06:41

  本文選題:資金流預測 + 特征抽取; 參考:《中國科學技術大學》2017年碩士論文


【摘要】:商業(yè)公司的金融平臺往往擁有千萬乃至上億位服務會員,公司的金融業(yè)務場景每天必然會涉及大量的資金流入和流出,面對如此龐大的金融數(shù)據(jù),資金管理壓力會非常大。在既保證資金流動性風險最小,又滿足日常業(yè)務運轉(zhuǎn)的情況下,精準地預測資金的流入流出情況顯得尤為重要。但金融數(shù)據(jù)的變動往往受社會,政治,經(jīng)濟,重大事件等多方面因素影響,數(shù)據(jù)趨勢不穩(wěn)定而且包含多噪聲,給資金流量的預測帶來了困難。本文以金融平臺用戶的資金流量預測為研究背景,旨在構建一個準確、有效的資金流入流出的預測模型,以最大程度上貼近資金流量的真實值,便于資金管理。本文的主要研究內(nèi)容與成果如下:1.本文針對資金流入流出數(shù)據(jù)集初始特征不明顯的特點,利用特征抽取方法挖掘出相關特征,并采取特征選擇策略選出最優(yōu)特征子集。主要是從時間、用戶、利率三個不同角度構造與目標值相關的多個特征,再利用皮埃爾相關系數(shù)法進行初步篩選出最為相關的特征。隨后用特征選擇策略進一步篩選,剔除次相關特征和冗余特征,形成最優(yōu)特征子集。實驗結(jié)果表明,特征抽取方法所選的特征子集對不同回歸算法的預測效果的影響不同,在最終申購值的12列特征、贖回值的10列特征時達到最佳子集,對大多數(shù)不同的回歸算法可以得到較好的預測效果。因此可以確定此特征子集作為下一步算法預測的最優(yōu)特征子集。2.為解決數(shù)據(jù)集不穩(wěn)定,多噪聲的問題,采用分步回歸算法對特征子集進行訓練學習,提高回歸預測準確率。本文提出的是兩步特征預測方法,即單步特征預測是運用灰度預測、時間序列算法對未來時間的未知特征進行預測,將預測的特征添加到未來時段的已知特征子集中。隨后結(jié)合BP神經(jīng)網(wǎng)絡對所有特征集合進行訓練建模,得到最終的預測結(jié)果。將該算法與集成學習方法對比,運用基于Adaboost的梯度提升回歸樹和基于Bagging的隨機森林回歸算法分別對數(shù)據(jù)集進行訓練。由實驗結(jié)果分析,發(fā)現(xiàn)兩步特征預測算法較其他算法減小了預測誤差,部分算法比集成學習方法的預測效果更佳。3.本文對離散類型的特征子集進行one-hot稀疏編碼,考慮因子分解機算法在處理稀疏數(shù)據(jù)集時作用顯著,運用該算法進行回歸預測。由于因子分解機算法可以較好地表達變量間的相互作用,相當于在原有特征變量的基礎上還增加了二次交叉特征,更好地刻畫數(shù)據(jù)集的特點。此外,因子分解機的算法復雜度不太高,且運行效率高。實驗表明,因子分解機算法在一定程度上可以提高資金流入流出量的預測準確率。
[Abstract]:The financial platform of a commercial company often has tens of millions or even hundreds of millions of service members. The financial business scenario of the company is bound to involve a large amount of capital inflow and outflow every day. In the face of such huge financial data, the pressure of capital management will be very great. It is very important to predict the inflow and outflow of funds accurately under the condition that the liquidity risk is minimum and the daily business operation is satisfied. However, the change of financial data is often affected by social, political, economic, major events and other factors. The trend of data is unstable and contains many noises, which makes it difficult to predict the flow of funds. The purpose of this paper is to construct an accurate and effective forecasting model of capital inflow and outflow in order to get close to the real value of capital flow to the greatest extent and to facilitate capital management. The main contents and results of this paper are as follows: 1. In view of the fact that the initial features of the inflow and outflow data sets are not obvious, this paper uses the feature extraction method to find out the relevant features, and adopts the feature selection strategy to select the optimal feature subset. Several features related to the target value are constructed from three different angles of time, user and interest rate, and the most relevant features are preliminarily selected by using Pierre correlation coefficient method. Then the feature selection strategy is used to further screen the subcorrelation feature and redundant feature to form the optimal feature subset. The experimental results show that the feature subset selected by the feature extraction method has different effects on the prediction effect of different regression algorithms, and reaches the best subset when the final purchase value is 12 column feature, the redemption value is 10 column feature. Good prediction results can be obtained for most different regression algorithms. Therefore, this feature subset can be determined as the optimal feature subset. 2. In order to solve the problem of unstable and noisy data sets, stepwise regression algorithm is used to train and learn feature subsets to improve the accuracy of regression prediction. In this paper, a two-step feature prediction method is proposed, that is, single-step feature prediction is based on gray prediction, time series algorithm is used to predict unknown features of future time, and the predicted features are added to the subset of known features in the future period. Then the BP neural network is used to train and model all the feature sets, and the final prediction results are obtained. The algorithm is compared with the ensemble learning method, and the data sets are trained by using the gradient lifting regression tree based on Adaboost and the stochastic forest regression algorithm based on Bagging. By analyzing the experimental results, it is found that the two-step feature prediction algorithm reduces the prediction error compared with other algorithms, and some of the algorithms have better prediction effect than the integrated learning method. In this paper, one-hot sparse coding for discrete feature subsets is carried out, and the factor factoring algorithm is used to predict the sparse data sets. Because the factoring machine algorithm can better express the interaction between variables, it is equivalent to the addition of quadratic cross features on the basis of the original feature variables, which can better describe the characteristics of the data set. In addition, the algorithm complexity of factoring machine is not too high, and the running efficiency is high. The experimental results show that the factor factoring algorithm can improve the accuracy of the forecast of the inflow and outflow of funds to some extent.
【學位授予單位】:中國科學技術大學
【學位級別】:碩士
【學位授予年份】:2017
【分類號】:F224;F832.39;F724.6

【參考文獻】

相關期刊論文 前10條

1 徐子偉;王傳啟;王鵬;黃海;;基于分步特征提取和組合分類器的電信客戶流失預測模型[J];微型機與應用;2016年13期

2 王子豪;徐桂瓊;;基于高階偏差的因子分解機推薦算法[J];計算機應用研究;2017年02期

3 高曉波;方獻梅;李石君;;基于因子分解機的信任感知商品推薦[J];山東大學學報(理學版);2016年01期

4 張玲;劉波;;基于殘差統(tǒng)計的時間序列加性離群點檢測算法研究[J];電子技術應用;2015年09期

5 胡亞慧;李石君;余偉;楊莎;方其慶;;一種結(jié)合文化和因子分解機的快速評分預測方法[J];南京大學學報(自然科學);2015年04期

6 張勇;趙曉輝;;人工神經(jīng)網(wǎng)絡結(jié)合灰度關聯(lián)分析用于吉林省地方稅收收入預測研究[J];電腦知識與技術;2015年20期

7 肖蘇;熊焱;;基于灰度統(tǒng)計和神經(jīng)網(wǎng)絡的物流業(yè)稅收預測模型[J];物流技術;2013年23期

8 劉斌;;淺析營業(yè)稅改征增值稅的影響[J];哈爾濱師范大學社會科學學報;2012年05期

9 謝小璐;;基于小波神經(jīng)網(wǎng)絡的Shibor預測研究[J];金融理論與實踐;2012年08期

10 車金星;王廣富;;基于粒子群最優(yōu)化下BP神經(jīng)網(wǎng)絡的短期電價預測(英文)[J];南昌工程學院學報;2012年01期

相關博士學位論文 前1條

1 李紅權;資本市場的非線性動力學特征與風險管理研究[D];湖南大學;2005年

相關碩士學位論文 前10條

1 韓莉;基于LM-BP神經(jīng)網(wǎng)絡股票預測研究[D];東北農(nóng)業(yè)大學;2016年

2 徐子偉;基于分步特征選擇和組合分類器的電信客戶流失預測模型[D];中國科學技術大學;2016年

3 張方;基于BP神經(jīng)網(wǎng)絡的稅收預測研究[D];長安大學;2016年

4 林慶添;基于人工智能算法的上海銀行間同業(yè)拆放利率預測[D];蘭州大學;2016年

5 樊嬌;省級電網(wǎng)公司融資預測分析研究[D];華北電力大學(北京);2016年

6 師小偉;基于人工智能優(yōu)化的組合模型在銀行間拆借利率預測中的應用研究[D];蘭州大學;2014年

7 肖堅;基于隨機森林的不平衡數(shù)據(jù)分類方法研究[D];哈爾濱工業(yè)大學;2013年

8 余秋宏;基于因子分解機的社交網(wǎng)絡關系推薦研究[D];北京郵電大學;2013年

9 袁小星;基于支持向量機集成的高新技術企業(yè)財務預警研究[D];哈爾濱工業(yè)大學;2012年

10 彭曉;基于灰色Logistic回歸的上市公司財務困境預測研究[D];重慶理工大學;2010年

,

本文編號:1932472

資料下載
論文發(fā)表

本文鏈接:http://www.sikaile.net/jingjilunwen/guojimaoyilunwen/1932472.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權申明:資料由用戶70a4b***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com