500导航福利夜间网站地址,精品人人操九九V综网

本書旨在討論如何使用SAS進(jìn)行特定的金融研究問題，尤其是涉及大量數(shù)據(jù)的問題。該書假定讀者已經(jīng)非常了解SAS。因此，本書不討論任何基礎(chǔ)知識(shí)。盡管本書中的SAS代碼是經(jīng)過精心編輯以使其適應(yīng)大數(shù)據(jù)處理的，但這些代碼的組成部分卻與SAS入門書籍中的一樣簡(jiǎn)單且初步。不過，不要被簡(jiǎn)單的代碼形式所誤導(dǎo)。為了充分利用SAS處理大數(shù)據(jù)的能力，深入了解SAS如何運(yùn)行是至關(guān)重要的。本書從另一個(gè)更深層次的角度回顧了SAS中的所有基本編碼技術(shù)，以便讀者有能力進(jìn)行大數(shù)據(jù)分析。
章涵蓋了如何設(shè)置SAS以開始大數(shù)據(jù)分析，包括準(zhǔn)確地從各種數(shù)據(jù)源和不同的計(jì)算機(jī)平臺(tái)導(dǎo)入數(shù)據(jù)，SAS編碼效率以及如何使代碼更強(qiáng)壯。第二章回顧了循環(huán)、分組和匯總等有用的工具。第三章討論如何在各種研究場(chǎng)景中操作表格。第四章專門介紹宏，這些宏對(duì)于進(jìn)行重復(fù)的研究工作是必不可少的。后三章討論如何進(jìn)行特定的研究，例如不滿足標(biāo)準(zhǔn)回歸假設(shè)的面板數(shù)據(jù)回歸，共同基金研究以及挑戰(zhàn)性的市場(chǎng)微觀結(jié)構(gòu)研究。在后三章中的每一章中，還討論了該領(lǐng)域文獻(xiàn)的方法和結(jié)論。

Preface
Financial research relies extensively on data. Mastering a statistical tool capable of handling the huge quantity of financial data is a necessary technique for every financial empiricist. There are many such tools, for example, Matlab, Stata, R, SQL, Python, etc. As a veteran in the field of empirical financial research, I have been using SAS for quite a long time. During the long years of experience, I gradually start appreciating SASs powerful yet smart ability to help me navigate through the ocean of financial data. I should admit that I have very limited, but not totally zero, knowledge of using other statistical software. However, I still feel obliged to compare SAS with other software in the context of financial research. The most distinguished advantage of SAS over other software is its ability to handle big data. This advantage is all the more meaningful when it comes to financial data. Let me explain this advantage as follows.
Big data analysis has become trendy during the past five to ten years, largely due to the rapid development of IT technology. To analyze data, we need to find the data in the first place. In this regard, financial data has long been collected, compiled, and distributed in a systematic, thorough, and scientific way. Most of the financial data originate from exchanges and the companies legally published periodic reports. In many countries, the financial information is universally formatted. These features make the financial data most easily to be collected and converted into commercial database. Many companies provide such database, such as WRDS, Thomson Reuters, CSMAR, and WIND. The Nobel laureate, Professor Eugene Fama, once mentioned that he had started using WRDS database to conduct researches since 1970s. In this sense, big data has been in place in financial research for about half a century, long ahead of the recent boom of big data analysis. To some degree, the financial data define the research topics, methodologies, and even sub-disciplines of todays financial research.
According to my personal observation, the researchers choice of the statistical software in the business and economics schools in universities varies from school to school. Interestingly, those in the finance departments are more likely to choose SAS, while those in economics and econometrics are more likely to choose Stata. This pattern of choice does have a reason. SAS treats data as a table, which stores and processes data line by line. Therefore, SAS theoretically has unlimited ability to dealing with any number of lines, although it necessarily takes a long time once the data are prohibitively large. In contrast, Stata treats data as a matrix, which requires, at least in theory, to read all data into the computers memory before starts the processing. This way, Statas ability to handle data is only as powerful as the computers memory size. However, Statas treating data as a matrix has a deeper rationale. That is, most modern econometric models are expressed in matrices, which means processing data in the form of a matrix is a more natural way in econometric research context. Partly due to this reason, when my students ask me why I choose SAS over Stata, I often half seriously and half jokingly answer them: That is because I am not an econometrician.
I can add another interesting observation to corroborate my argument that econometricians and those with strong econometric backgrounds tend to choose Stata. The similar choice pattern also shows up in the students who learn their econometric courses taught by the professors with different backgrounds. Empirical research methods have been compulsory courses in many business, economics, and finance programs at undergraduate, graduate, and doctoral levels in many universities. However, the professors teaching econometric have different backgrounds. In many schools, it is the econometric professor who teaches the students the concepts and methodologies of empirical research, no matter the students major in econometric or not. Needless to say, econometricians teaching econometrics can provide the most advanced, thorough, and rigorous knowledge of the field to the students. But when it comes to the application of empirical research methods on a specific economics or finance question, the researchers in that particular field typically have a specific preference of certain econometric methods. Often the case, top tier economics and finance journals reject the papers whose main contribution is merely to apply a better econometric method to an old research question. In other words, high quality economics and finance research puts more weight on ideas over econometric techniques. Therefore, there has gradually emerged a new norm in which the econometric course is taught by an economic or finance professor who does not major in econometrics, statistics, or math, but specializes in specific research areas. Over the past years, I have been teaching and working with many students. It seems that those who learn econometrics from professors majoring in econometrics tend to choose Stata, while those who learn econometrics from professors majoring in finance tend to choose SAS.
Given that SAS does not treat data as matrices, it seems to lose to Stata in terms of timely incorporating the newest statistical mythologies into the software. But on the flip side, the nearly unlimited ability to process any number of lines of data does make SAS more suitable, and in certain studies the only choice, for financial research. Those who have no experiences in handling financial data may not fully appreciate the massive quantity of financial data. Let me put it in perspective. Compared to the US. financial market, Chinese financial market has a very short history. We have just celebrated the thirtieth anniversary of the Chinese stock market as I finish writing this book. However, there have been more than 47 million trading-day observations for all Chinese bonds, and more than 11 million trading day observations for all Chinese stocks. For the microstructure (tick-by-tick) data, the number of observations is thousands of times larger than the daily data. The exceedingly large size of financial posts a series of challenges to us. For example, sorting data is a routine process. However, most of personal computers will have difficulty in sorting a data set with hundreds of millions of data by several variables. Matching data is another simple yet challenging task for Big data. The typical way of matching data is to first use a Cartesian product and then eliminate those that are not matched. A Cartesian product of two tables with x and y lines generate a x times y lines temporary table. Imagine how daunting the task could be if you are trying to match two tables which both have 10 million lines. In these scenarios, we need to find smarter ways to conduct the data processing. Fortunately, SAS can provide many handy tools for us.
This book summarizes my experience in using SAS to conduct financial research on big data. One of my research areas is market microstructure, which studies the tick-by-tick data of trades and quotes. As mentioned above, market microstructure data are especially large, hence a more demanding task for researchers. I often wait for a whole day to get one result. Although often frustrated and disappointed by the tedious calculation process, I have learned a lot from these studies. Most of all, I gradually master the skills that enable me to decipher the regularities hidden in the tremendous amount of data. There are many books discussing how to use SAS. I do not intend to add to this long list. This book is rather focused on how to use SAS to conduct sophisticated financial researches, especially in the context of big data. I assume that the readers have already understood the basics of SAS. The topics of this book mainly cover the advanced research and coding issues that are seldomly discussed in the general-purpose and introductory-level SAS books. I hope the experience shared in this book can be of some help for you to conduct high-quality financial researches.
Han Yan was supported by the National Natural Science Foundation of China under grant number 71772013.

韓燕，中國(guó)人民大學(xué)管理學(xué)博士，北京理工大學(xué)人文與社會(huì)學(xué)院經(jīng)濟(jì)系副教授兼系主任，碩士生導(dǎo)師。曾在中英文各種學(xué)術(shù)期刊上發(fā)表了十幾篇學(xué)術(shù)論文。作為主持人主持國(guó)家自然科學(xué)基金2項(xiàng)，教育部基金1項(xiàng)，參與多項(xiàng)國(guó)家自然科學(xué)基金重點(diǎn)課題的研究。主要承擔(dān)本科和研究生的國(guó)際金融、財(cái)務(wù)管理和金融經(jīng)濟(jì)學(xué)等課程。在研究生教學(xué)中，教學(xué)內(nèi)容涵蓋了金融研究前沿的廣泛主題，其中很多內(nèi)容是關(guān)于實(shí)證研究方法的。如今，大多數(shù)金融研究都是實(shí)證研究，這使得研究人員必須會(huì)使用統(tǒng)計(jì)軟件。之所以選擇SAS這款軟件，部分原因是因?yàn)闄C(jī)遇，更多是因?yàn)镾AS處理大數(shù)據(jù)的能力。經(jīng)過十多年的實(shí)證研究，積累了處理大量金融數(shù)據(jù)的豐富經(jīng)驗(yàn)。研究領(lǐng)域之一是市場(chǎng)微觀結(jié)構(gòu)，該領(lǐng)域?qū)ρ芯咳藛T的數(shù)據(jù)分析能力要求，因此她擁有許多關(guān)于數(shù)據(jù)處理的技能、見解和建議，可以與年輕一代的金融研究人員分享。金融數(shù)據(jù)的數(shù)量龐大，把越多的數(shù)據(jù)整合在一起，就可能獲得到更多的知識(shí)。但是，龐大的金融數(shù)據(jù)使其分析與小數(shù)據(jù)完全不同。例如，一個(gè)簡(jiǎn)單的排序任務(wù)對(duì)于1T的數(shù)據(jù)就變得極為困難。因此，為了有效地處理大數(shù)據(jù)，您需要不同的技能。韓燕在實(shí)證金融研究中的長(zhǎng)期經(jīng)驗(yàn)將為致力于大數(shù)據(jù)分析的讀者提供幫助。

你還可能感興趣

我要評(píng)論