Recent Trends in Machine Learning with a Focus on Applications in Finance
Kalina, Jan; Neoral, Aleš
2022 - English
Machine learning methods penetrate to applications in the analysis of financial data, particularly to supervised learning tasks including regression or classification. Other approaches, such as reinforcement learning or automated machine learning, are not so well known in the context of finance yet. In this paper, we discuss the advantages of an automated data analysis, which is beneficial especially if a larger number of datasets should be analyzed under a time pressure. Important types of learning include reinforcement learning, automated machine learning, or metalearning. This paper overviews their principles and recalls some of their inspiring applications. We include a discussion of the importance of the concept of information and of the search for the most relevant information in the field of mathematical finance. We come to the conclusion that a statistical interpretation of the results of theautomatic machine learning remains crucial for a proper understanding of the knowledge acquired by the analysis of the given (financial) data.
Keywords:
statistical learning; automated machine learning; metalearning; financial data analysis; stock market investing
Fulltext is available at external website.
Recent Trends in Machine Learning with a Focus on Applications in Finance
Machine learning methods penetrate to applications in the analysis of financial data, particularly to supervised learning tasks including regression or classification. Other approaches, such as ...
The 2020 Election In The United States: Beta Regression Versus Regression Quantiles
Kalina, Jan
2021 - English
The results of the presidential election in the United States in 2020 desire a detailed statistical analysis by advanced statistical tools, as they were much different from the majority of available prognoses as well as from the presented opinion polls. We perform regression modeling for explaining the election results by means of three demographic predictors for individual 50 states: weekly attendance at religious services, percentage of Afroamerican population, and population density. We compare the performance of beta regression with linear regression, while beta regression performs only slightly better in terms of predicting the response. Because the United States population is very heterogeneous and the regression models are heteroscedastic, we focus on regression quantiles in the linear regression model. Particularly, we develop an original quintile regression map, such graphical visualization allows to perform an interesting interpretation of the effect of the demographic predictors on the election outcome on the level of individual states.
Keywords:
elections results; electoral demography; quantile regression; heteroscedasticity; outliers
Fulltext is available at external website.
The 2020 Election In The United States: Beta Regression Versus Regression Quantiles
The results of the presidential election in the United States in 2020 desire a detailed statistical analysis by advanced statistical tools, as they were much different from the majority of available ...
City simulation software for modeling, planning, and strategic assessment of territorial city units
Svítek, M.; Přibyl, O.; Vorel, J.; Garlík, B.; Resler, Jaroslav; Kozhevnikov, S.; Krč, Pavel; Geletič, Jan; Daniel, Milan; Dostál, R.; Janča, T.; Myška, V.; Aralkina, O.; Pereira, A. M.
2021 - English
SVÍTEK, M., PŘIBYL, O., VOREL, J., GARLÍK, B., RESLER, J., KOZHEVNIKOV, S., KRČ, P., GELETIČ, J., DANIEL, M., DOSTÁL, R., JANČA, T., MYŠKA, V., ARALKINA, O., PEREIRA, A. M. City simulation software for modeling, planning, and strategic assessment of territorial city units. 1.1. Prague: CTU & ICS CAS, 2021. Technical Report. ABSTRACT: The Smart Resilience City concept is a new vision of a city as a digital platform and eco-system of smart services where agents of people, things, documents, robots, and other entities can directly negotiate with each other on resource demand principals providing the best possible solution. It creates the smart environment making possible self-organization in sustainable or, when needed, resilient way of individuals, groups and the whole system objectives.
Keywords:
Smart city; City simulation; Energy resource-demand modelling; Environmental modelling; Synthetic population; Transport modelling
Available on request at various institutes of the ASCR
City simulation software for modeling, planning, and strategic assessment of territorial city units
SVÍTEK, M., PŘIBYL, O., VOREL, J., GARLÍK, B., RESLER, J., KOZHEVNIKOV, S., KRČ, P., GELETIČ, J., DANIEL, M., DOSTÁL, R., JANČA, T., MYŠKA, V., ARALKINA, O., PEREIRA, A. M. City simulation software ...
Score matching filters for Gaussian Markov random fields with a linear model of the precision matrix
Turčičová, Marie; Mandel, J.; Eben, Kryštof
2021 - English
We present an ensemble filter that provides a rigorous covariance regularization when the underlying random field is Gaussian Markov. We use a linear model for the precision matrix (inverse of covariance) and estimate its parameters together with the analysis mean by the Score Matching method. This procedure provides an explicit expression for parameter estimators. The resulting analysis step formula is the same as in the traditional ensemble Kalman filter.
Keywords:
Score matching; ensemble filter; Gaussian Markov random field; covariance modelling
Available at various institutes of the ASCR
Score matching filters for Gaussian Markov random fields with a linear model of the precision matrix
We present an ensemble filter that provides a rigorous covariance regularization when the underlying random field is Gaussian Markov. We use a linear model for the precision matrix (inverse of ...
Assessment of Independent EEG Components Obtained by Different Methods for BCI Based on Motor Imagery
Húsek, Dušan; Frolov, A. A.; Kerechanin, J. V.; Bobrov, P.D.
2021 - English
Eight methods of decomposition of a multichannel EEG signal are compared in terms of their ability to identify the most physiologically significant components. The criterion for the meaningfulness of a method is its ability to reduce mutual information between components; to create components that can be attributed to the activity of dipoles located in the cerebral cortex; find components that are provided by other methods and for this case; and at the same time, these components should most contribute to the accuracy of the BCI based on imaginary movement. Independent component analysis methods AMICA, RUNICA and FASTICA outperform others in the first three criteria and are second only to the Common Spatial Patterns method in the fourth criterion. The components created by all methods for 386 experimental sessions of 27 subjects were combined into more than 100 clusters containing more than 10 elements. Additionally, the components of the 12 largest clusters were analyzed. They have proven to be of great importance in controlling BCI, their origins can be modeled using dipoles in the brain, and they have been detected by several degradation methods. Five of the 12 selected components have been identified and described in our previous articles. Even if the physiological and functional origins of the rest of identified components’ are to be the subject of further research, we have shown that their physiological nature is at least highly probable.\n
Keywords:
brain–computer interface; motor imagery; blind source separation; independent component analysis; common spatial patterns; cluster analysis; EEG pattern extraction; EEG analysis; ICA; CSP; BCI; motor imagery
Available at various institutes of the ASCR
Assessment of Independent EEG Components Obtained by Different Methods for BCI Based on Motor Imagery
Eight methods of decomposition of a multichannel EEG signal are compared in terms of their ability to identify the most physiologically significant components. The criterion for the meaningfulness of ...
Application Of Implicitly Weighted Regression Quantiles: Analysis Of The 2018 Czech Presidential Election
Kalina, Jan; Vidnerová, Petra
2021 - English
Regression quantiles can be characterized as popular tools for a complex modeling of a continuous response variable conditioning on one or more given independent variables. Because they are however vulnerable to leverage points in the regression model, an alternative approach denoted as implicitly weighted regression quantiles have been proposed. The aim of current work is to apply them to the results of the second round of the 2018 presidential election in the Czech Republic. The election results are modeled as a response of 4 demographic or economic predictors over the 77 Czech counties. The analysis represents the first application of the implicitly weighted regression quantiles to data with more than one regressor. The results reveal the implicitly weighted regression quantiles to be indeed more robust with respect to leverage points compared to standard regression quantiles. If however the model does not contain leverage points, both versions of the regression quantiles yield very similar results. Thus, the election dataset serves here as an illustration of the usefulness of the implicitly weighted regression quantiles.
Keywords:
linear regression; quantile regression; robustness; outliers; elections results
Fulltext is available at external website.
Application Of Implicitly Weighted Regression Quantiles: Analysis Of The 2018 Czech Presidential Election
Regression quantiles can be characterized as popular tools for a complex modeling of a continuous response variable conditioning on one or more given independent variables. Because they are however ...
Nearly All Reals Can Be Sorted with Linear Time Complexity
Jiřina, Marcel
2021 - English
We propose a variant of the counting sort modified for sorting reals in a linear time. It is assumed that the sorting key and pointers to the items being sorted are moved and individual items remain at the same place in the memory (in place sorting). In this case, the space complexity of the new variant of the algorithm is the same as the complexity of the quicksort. We also quantify the practical limits for possible sorting reals in a linear time. This possibility is assured under additional assumptions on the distribution of the sorting key, mainly the independence and identity of the distribution. Here we give a more general criteria easily applicable in practice. We also show that the algorithm is applicable for data that do not fulfill criteria for linear time complexity but even that the computation is faster than the system quicksort. A new, faster version of the algorithm is attached.
Keywords:
sorting; algorithm; real sorting key; time complexity; linear complexity
Available in digital repository of the ASCR
Nearly All Reals Can Be Sorted with Linear Time Complexity
We propose a variant of the counting sort modified for sorting reals in a linear time. It is assumed that the sorting key and pointers to the items being sorted are moved and individual items remain ...
Multifractal approaches in econometrics and fractal-inspired robust regression
Kalina, Jan
2021 - English
While the mainstream economic theory is based on the concept of general economic equilibrium, the economies throughout the world have recently been facing serious transformations and challenges. Thus, instead of a convergence to equilibrium, the economies can be regarded as unstable, turbulent or chaotic with properties characteristic for fractal or multifractal processes. This paper starts with a discussion of recent data analysis tools inspired by fractal or multifractal concepts. We pay special attention to available data analysis tools based on reciprocal weights assigned to individual observations - these are inspired by an assumed fractal structure of multivariate data. As an extension, we consider here a novel version of the least weighted squares estimator of parameters for the linear regression model, which exploits reciprocal weights. Finally, we perform a statistical analysis of 31 datasets with economic motivation and compare the performance of the least weighted squares estimator with various weights. It turns out that the reciprocal weights, inspired by the fractal theory, are not superior to other choices of weights. In fact, the best prediction results are obtained with trimmed linear weights.
Keywords:
chaos in economics; fractal market hypothesis; reciprocal weights; robust regression; prediction
Available in digital repository of the ASCR
Multifractal approaches in econometrics and fractal-inspired robust regression
While the mainstream economic theory is based on the concept of general economic equilibrium, the economies throughout the world have recently been facing serious transformations and challenges. Thus, ...
On kernel-based nonlinear regression estimation
Kalina, Jan; Vidnerová, Petra
2021 - English
This paper is devoted to two important kernel-based tools of nonlinear regression: the Nadaraya-Watson estimator, which can be characterized as a successful statistical method in various econometric applications, and regularization networks, which represent machine learning tools very rarely used in econometric modeling. This paper recalls both approaches and describes their common features as well as differences. For the Nadaraya-Watson estimator, we explain its connection to the conditional expectation of the response variable. Our main contribution is numerical analysis of suitable data with an economic motivation and a comparison of the two nonlinear regression tools. Our computations reveal some tools for the Nadaraya-Watson in R software to be unreliable, others not prepared for a routine usage. On the other hand, the regression modeling by means of regularization networks is much simpler and also turns out to be more reliable in our examples. These also bring unique evidence revealing the need for a careful choice of the parameters of regularization networks
Keywords:
nonlinear regression; machine learning; kernel smoothing; regularization; regularization networks
Available in digital repository of the ASCR
On kernel-based nonlinear regression estimation
This paper is devoted to two important kernel-based tools of nonlinear regression: the Nadaraya-Watson estimator, which can be characterized as a successful statistical method in various econometric ...
Visual Images Segmentation based on Uniform Textures Extraction
Goltsev, A.; Gritsenko, V.; Húsek, Dušan
2020 - English
A new effective procedure for partial texture segmentation of visual images is proposed. The procedure segments any input image into a number of non-overlapping homogeneous ne-grained texture areas. The main advantages of the proposed procedure are as follows. It is completely unsupervised, that is, it processes the input image without any prior knowledge of either the type of textures or the number of texture segments in the image. In addition, the procedure segments arbitrary images of all types. This means that no changes to the procedure parameters are required to switch from one image type to another. Another major advantage of the procedure is that in most cases it extracts the uniform ne-grained texture segments present in the image, just as humans do. This result is supported by series of experiments that demonstrate the ability of the procedure to delineate uniform ne-grained texture segments over a wide range of images. At a minimum, image processing according to the proposed technique leads to a signficant reduction in the uncertainty of the internal structure of the analyzed image.
Keywords:
Texture feature; Texture window; Homogeneous ne-grained texture segment; Texture segment extraction; Texture segmentation
Available at various institutes of the ASCR
Visual Images Segmentation based on Uniform Textures Extraction
A new effective procedure for partial texture segmentation of visual images is proposed. The procedure segments any input image into a number of non-overlapping homogeneous ne-grained texture areas. ...
NRGL provides central access to information on grey literature produced in the Czech Republic in the fields of science, research and education. You can find more information about grey literature and NRGL at service web
Send your suggestions and comments to nusl@techlib.cz
Provider
Other bases